1749

Posts

Nov, 23

Acceleration of a QM/MM-QMC simulation using GPU

We accelerated an ab-initio molecular QMC calculation by using GPGPU. Only the bottle-neck part of the calculation is replaced by CUDA subroutine and performed on GPU, getting 23.5 (9.7) times faster performance in single (double) precision. The energy deviation caused by the single precision treatment was found to be within the accuracy required in the […]
Nov, 22

GPU-based cone beam computed tomography

The use of cone beam computed tomography (CBCT) is growing in the clinical arena due to its ability to provide 3D information during interventions, its high diagnostic quality (sub-millimeter resolution), and its short scanning times (60 s). In many situations, the short scanning time of CBCT is followed by a time-consuming 3D reconstruction. The standard […]
Nov, 22

Efficient and Accurate Sound Propagation Using Adaptive Rectangular Decomposition

Accurate sound rendering can add significant realism to complement visual display in interactive applications, as well as facilitate acoustic predictions for many engineering applications, like accurate acoustic analysis for architectural design. Numerical simulation can provide this realism most naturally by modeling the underlying physics of wave propagation. However, wave simulation has traditionally posed a tough […]
Nov, 22

3D nonrigid registration via optimal mass transport on the GPU

In this paper, we present a new computationally efficient numerical scheme for the minimizing flow approach for optimal mass transport (OMT) with applications to non-rigid 3D image registration. The approach utilizes all of the gray-scale data in both images, and the optimal mapping from image A to image B is the inverse of the optimal […]
Nov, 22

vCUDA: GPU accelerated high performance computing in virtual machines

This paper describes vCUDA, a GPGPU (General Purpose Graphics Processing Unit) computing solution for virtual machines. vCUDA allows applications executing within virtual machines (VMs) to leverage hardware acceleration, which can be beneficial to the performance of a class of high performance computing (HPC) applications. The key idea in our design is: API call interception and […]
Nov, 22

Optimal rotation alignment of 3D objects using a GPU-based similarity function

In this paper, we address the challenging task of finding the best alignment between two 3D objects by solving a global optimization problem in the space of rotations SO(3). The objective function to be optimized is a newly developed rotation-variant similarity measure, which is obtained directly from the object’s geometry and is entirely implemented on […]
Nov, 22

Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems

We describe heterogeneous multi-CPU and multi-GPU implementations of Jacobi’s iterative method for the 2-D Poisson equation on a structured grid, in both single- and double-precision. Properly tuned, our best implementation achieves 98% of the empirical streaming GPU bandwidth (66% of peak) on a NVIDIA C1060, and 78% on a C870. Motivated to find a still […]
Nov, 22

Parallel multi-objective evolutionary algorithms on graphics processing units

Most real-life optimization problems or decision-making problems are multi-objective in nature, since they normally have several (possibly conflicting) objectives that must be satisfied at the same time. Multi-Objective Evolutionary Algorithms (MOEAs) have been gaining increasing attention among researchers and practitioners. However, they may execute for a long time for some difficult problems, because several evaluations […]
Nov, 22

GPU computing with Kaczmarz’s and other iterative algorithms for linear systems

The graphics processing unit (GPU) is used to solve large linear systems derived from partial differential equations. The differential equations studied are strongly convection-dominated, of various sizes, and common to many fields, including computational fluid dynamics, heat transfer, and structural mechanics. The paper presents comparisons between GPU and CPU implementations of several well-known iterative methods, […]
Nov, 22

A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware

Emerging DNA sequencing technologies open up exciting new opportunities for genome sequencing by generating read data with a massive throughput. However, produced reads are significantly shorter and more error-prone compared to the traditional Sanger shotgun sequencing method. This poses challenges for de novo DNA fragment assembly algorithms in terms of both accuracy (to deal with […]
Nov, 22

Inverse scattering and refraction corrected reflection for breast cancer imaging

Reflection ultrasound (US) has been utilized as an adjunct imaging modality for over 30 years. TechniScan, Inc. has developed unique, transmission and concomitant reflection algorithms which are used to reconstruct images from data gathered during a tomographic breast scanning process called Warm Bath Ultrasound (WBU). The transmission algorithm yields high resolution, 3D, attenuation and speed […]
Nov, 22

Interventional 4-D Motion Estimation and Reconstruction of Cardiac Vasculature without Motion Periodicity Assumption

Anatomical and functional information of cardiac vasculature is a key component in the field of interventional cardiology. With the technology of C-arm CT it is possible to reconstruct static intraprocedural 3-D images from angiographic projection data. Current approaches attempt to add the temporal dimension (4-D). In the assumption of periodic heart motion, ECG-gating techniques can […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: