Posts
Oct, 29
Parallel Processing of Matrix Multiplication in a CPU and GPU Heterogeneous Environment
GPUs for numerical computations are becoming an attractive alternative in research. In this paper, we propose a new parallel processing environment for matrix multiplications by using both CPUs and GPUs. The execution time of matrix multiplications can be decreased to 40.1% by our method, compared with using the fastest of either CPU only case or […]
Oct, 29
Ultrasound goes GPU: real-time simulation using CUDA
Despite the increasing adoption of other imaging modalities, ultrasound guidance is widely used for surgical procedures and clinical imaging due to its low cost, non-invasiveness, and real-time visual feedback. Many ultrasound-guided procedures require extensive training and where possible training on simulations should be preferred over patients. Computational resources for existing approaches to ultrasound simulation are […]
Oct, 29
High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
We implement a high-order finite-element application, which performs the numerical simulation of seismic wave propagation resulting for instance from earthquakes at the scale of a continent or from active seismic acquisition experiments in the oil industry, on a large cluster of NVIDIA Tesla graphics cards using the CUDA programming environment and non-blocking message passing based […]
Oct, 29
Accelerating Image Retrieval Using Factorial Correspondence Analysis on GPU
We are interested in the intensive use of Factorial Correspondence Analysis (FCA) for large-scale content-based image retrieval. Factorial Correspondence Analysis, is a useful method for analyzing textual data, and we adapt it to images using the SIFT local descriptors. FCA is used to reduce dimensions and to limit the number of images to be considered […]
Oct, 29
SIMD-Based Large-Scale Transient Stability Simulation on the Graphics Processing Unit
This paper presents a single-instruction-multiple-data (SIMD) based implementation of the transient stability simulation on the Graphics Processing Unit (GPU). Two programming models to implement the standard method of the transient stability simulation are proposed and implemented on a single GPU. In the first model the CPU is responsible for part of the simulation, while the […]
Oct, 29
CUSA and CUDE: GPU-accelerated methods for estimating solvent accessible surface area and desolvation
It is well-established that a linear correlation exists between accessible surface areas and experimentally measured solvation energies. Combining this knowledge with an analytic formula for calculation of solvent accessible surfaces, we derive a simple model of desolvation energy as a differentiable function of atomic positions. Additionally, we find that this algorithm is particularly well suited […]
Oct, 29
Performance study of interference on GPU and CPU resources with multiple applications
In the last years, the performance and capabilities of Graphics Processing Units (GPUs) improved drastically, mostly due to the demands of the entertainment market, with consumers and companies alike pushing for improvements in the level of visual fidelity, which is only achieved with high performing GPU solutions. Beside the entertainment market, there is an ongoing […]
Oct, 29
Current performance gains from utilizing the GPU or the ASIC MDGRAPE-3 within an enhanced Poisson Boltzmann approach
Scientific applications do frequently suffer from limited compute performance. In this article, we investigate the suitability of specialized computer chips to overcome this limitation. An enhanced Poisson Boltzmann program is ported to the graphics processing unit and the application specific integrated circuit MDGRAPE-3 and resulting execution times are compared to the conventional performance obtained on […]
Oct, 29
GPU-based image manipulation and enhancement techniques for dynamic volumetric medical image visualization
An important part of an image-guided surgical system is the display component, and seamless interactivity is critical to its successful application in a clinical environment. In this paper, we present several novel techniques for 4D medical image manipulation and enhancement that employ a graphics processing unit (GPU) to accelerate image processing. We describe three types […]
Oct, 29
An analytical model for a GPU architecture with memory-level and thread-level parallelism awareness
GPU architectures are increasingly important in the multi-core era due to their high number of parallel processors. Programming thousands of massively parallel threads is a big challenge for software engineers, but understanding the performance bottlenecks of those parallel programs on GPU architectures to improve application performance is even more difficult. Current approaches rely on programmers […]
Oct, 29
GPU-based Video Feature Tracking and Matching
Abstract This paper describes novel implementations of the KLT feature tracking and SIFT feature extraction algorithms that run on the graphics processing unit (GPU) and is suitable for video analysis in real-time vision systems. While significant acceleration over standard CPU implementations is obtained by exploiting parallelism provided by modern programmable graphics hardware, the CPU is […]
Oct, 28
GPU acceleration of cutoff pair potentials for molecular modeling applications
The advent of systems biology requires the simulation of ever-larger biomolecular systems, demanding a commensurate growth in computational power. This paper examines the use of the NVIDIA Tesla C870 graphics card programmed through the CUDA toolkit to accelerate the calculation of cutoff pair potentials, one of the most prevalent computations required by many different molecular […]