4223

Posts

May, 25

Speculative Execution on GPU: An Exploratory Study

We explore the possibility of using GPUs for speculative execution: we implement software value prediction techniques to accelerate programs with limited parallelism, and software speculation techniques to accelerate programs that contain runtime parallelism, which are hard to parallelize statically. Our experiment results show that due to the relatively high overhead, mapping software value prediction techniques […]
May, 25

Performance Analysis of a Hybrid MPI/CUDA Implementation of the NAS-LU Benchmark

The emergence of Graphics Processing Units (GPUs) as a potential alternative to conventional general-purpose processors has led to significant interest in these architectures by both the academic community and the High Performance Computing (HPC) industry. While GPUs look likely to deliver unparalleled levels of performance, the publication of studies claiming performance improvements in excess of […]
May, 25

Optimizing Sweep3D for Graphic Processor Unit

As a powerful and flexible processor, the Graphic Processing Unit (GPU) can offer great faculty in solving many high-performance computing applications. Sweep3D, which simulates a single group time-independent discrete ordinates (Sn) neutron transport deterministically on 3D Cartesian geometry space, represents the key part of a real ASCI application. The wavefront process for parallel computation in […]
May, 25

GPU-Based Parallel Computing: A New Computational Approach and its Applications to Nuclear Engineering

Faster speed and higher accuracy are, and will always be pursued by computational scientists and engineers. To achieve these goals, single-core microprocessors were assembled to arrive at "parallel computing". This parallelism is recently extended to chiplevel with the emergence of multi- and many-core architectures-that is, roughly, adding more cores onto a single chip-in both Central […]
May, 25

Accelerating the Sweep3D for a Graphic Processor Unit

As a powerful and flexible processor, the Graphic Processing Unit (GPU) can offer a great faculty in solving many high-performance computing applications. Sweep3D, which simulates a single group time-independent discrete ordinates (Sn) neutron transport deterministically on 3D Cartesian geometry space, represents the key part of a real ASCI application. The wavefront process for parallel computation […]
May, 25

Fast, large volume, GPU enabled simulations for the Ly-alpha forest: power spectrum forecasts for baryon acoustic oscillation experiments

High redshift measurements of the baryonic acoustic oscillation scale (BAO) from large Ly-alpha forest surveys represent the next frontier of dark energy studies. As part of this effort, efficient simulations of the BAO signature from the Ly-alpha forest will be required. We construct a model for producing fast, large volume simulations of the Ly-alpha forest […]
May, 25

Hierarchical fractional-step approximations and parallel kinetic Monte Carlo algorithms

We present a mathematical framework for constructing and analyzing parallel algorithms for lattice Kinetic Monte Carlo (KMC) simulations. The resulting algorithms have the capacity to simulate a wide range of spatio-temporal scales in spatially distributed, non-equilibrium physiochemical processes with complex chemistry and transport micro-mechanisms. The algorithms can be tailored to specific hierarchical parallel architectures such […]
May, 24

A Framework for 3D Model-Based Visual Tracking Using a GPU-Accelerated Particle Filter

A novel framework for acceleration of particle filtering approaches to 3D model-based, markerless visual tracking in monocular video is described. Specifically, we present a methodology for partitioning and mapping the computationally expensive weight-update stage of a particle filter to a graphics processing unit (GPU) to achieve particle- and pixel-level parallelism. Nvidia CUDA and Direct3D are […]
May, 24

A new representation of intensity atlas for GPU-accelerated instance generation

Fast instance generation is a key requirement in atlas-based registration and other problems that need a large number of atlas instances. This paper describes a new method to represent and construct intensity atlases. Both geometry and intensity information are represented using B-spline deformation lattices; intensities are approximated using the multi-level B-spline approximation algorithm during model […]
May, 24

SSE Vectorized and GPU Implementations of Arakawa’s Formula for Numerical Integration of Equations of Fluid Motion

The numerical method presented by Arakawa in 1966[3] implements a finite difference scheme of the Jacobian for the solution of the equation of motion for two-dimensional incompressible flows, which diminishes nonlinear computational instability and permits long-term numerical integrations. This paper presents an efficient implementation of Arakawa’s formula using vectorized Streaming SIMD Extension (SSE) and Advanced […]
May, 24

Parallel Computing Model of Multiple Dimensions Data Streams Canonical Correlation Analysis with GPU

With view to satisfying the requirement of real-time under the circumstance of resource-constraints, specific and practical architecture for high-dimensional data streams are proposed, meanwhile, based on CUDA (Compute Unified Device Architecture), canonical correlation analysis between two multiple dimensions data streams using data cube pattern and dimensionality-reduction technique is carried out in this framework. The theoretical […]
May, 24

GPU Rendering of the Thin Film on Paints with Full Spectrum

Spectrum-based rendering uses spectral distributions instead of just three RGB colors for representation of light sources and surface properties in rendering equation. Since, spectrum has a value at every visible wavelength, the spectrum-based rendering gives much accurate color computation compared to RGB-based rendering and it give us opportunity to simulate wavelength dependent phenomena and effects […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: