1804

Posts

Nov, 26

On the energy efficiency of graphics processing units for scientific computing

The graphics processing unit (GPU) has emerged as a computational accelerator that dramatically reduces the time to discovery in high-end computing (HEC). However, while today’s state-of-the-art GPU can easily reduce the execution time of a parallel code by many orders of magnitude, it arguably comes at the expense of significant power and energy consumption. For […]
Nov, 26

Computing Prestack Kirchhoff Time Migration on General Purpose GPU

This paper introduces how to optimize a practical Prestack Kirchhoff Time Migration program by Compute Unified Device Architecture (CUDA) on General Purpose GPU (GPGPU). A few useful optimization methods on GPGPU are demostrated, such as how to increase the kernel thread numbers on GPU cores, and how to utilize the memory streams to overlap GPU […]
Nov, 25

Molecular structural mechanics approach to carbon nanotubes on graphics processing units

A molecular structural mechanics approach to carbon nanotubes on graphics processing units (GPUs) is reported. As a powerful parallel and relatively low cost processor, the GPU is used to accelerate the computations of the molecular structural mechanics approach. The data structures, matrix-vector multiplication algorithm, texture reduction algorithm, and ICCG method on the GPU are presented. […]
Nov, 25

Compute Unified Device Architecture Application Suitability

Graphics processing units (GPUs) can provide excellent speedups on some, but not all, general-purpose workloads. Using a set of computational GPU kernels as examples, the authors show how to adapt kernels to utilize the architectural features of a GeForce 8800 GPU and what finally limits the achievable performance.
Nov, 25

Dynamic real-time 4D cardiac MDCT image display using GPU-accelerated volume rendering

Intraoperative cardiac monitoring, accurate preoperative diagnosis, and surgical planning are important components of minimally-invasive cardiac therapy. Retrospective, electrocardiographically (ECG) gated, multidetector computed tomographical (MDCT), four-dimensional (3D + time), real-time, cardiac image visualization is an important tool for the surgeon in such procedure, particularly if the dynamic volumetric image can be registered to, and fused with […]
Nov, 25

LBM based flow simulation using GPU computing processor

Graphics Processing Units (GPUs), originally developed for computer games, now provide computational power for scientific applications. In this paper, we develop a general purpose Lattice Boltzmann code that runs entirely on a single GPU. The results show that: (1) simple precision floating point arithmetic is sufficient for LBM computation in comparison to double precision; (2) […]
Nov, 25

Towards real-time radiation therapy: GPU accelerated superposition/convolution

We demonstrate the use of highly parallel graphics processing units (GPUs) to accelerate the superposition/convolution (S/C) algorithm to interactive rates while reducing the number of approximations. S/C first transports the incident fluence to compute the total energy released per unit mass (TERMA) grid. Dose is then calculated by superimposing the dose deposition kernel at each […]
Nov, 25

GPU-based interactive visualization framework for ultrasound datasets

Ultrasound imaging is widely used in medical areas. By transmitting ultrasound signals into the human body, their echoed signals can be rendered to represent the shape of internal organs. Although its image quality is inferior to that of CT or MR, ultrasound is widely used for its speed and reasonable cost. Volume rendering techniques provide […]
Nov, 25

A Superresolution Framework for High-Accuracy Multiview Reconstruction

We present a variational approach to jointly estimate a displacement map and a superresolution texture for a 3D model from multiple calibrated views. The superresolution image formation model leads to an energy functional defined in terms of an integral over the object surface. This functional can be minimized by alternately solving a deblurring PDE and […]
Nov, 25

Novel Computing Architectures

We can make smaller transistors, but we have a hard time making them run faster. Yet, the demand for computing power is growing. This challenge has led to a renewed interest in novel computing architectures that can take advantage of shrinking transistor size and deliver the necessary application performance improvements.
Nov, 25

Fast face tracking using parallel particle filter algorithm

This paper proposed a multi-cue based face tracking algorithm with the help of parallel multi-core processing. Due to illumination and occlusion problems, face tracking usually does not work stably based on a single cue. Three different visual cues, color histogram, edge orientation histogram and wavelet feature, are integrated under the framework of particle filter to […]
Nov, 25

A particle-based method for viscoelastic fluids animation

We present a particle-based method for viscoelastic fluids simulation. In the method, based on the traditional Navier-Stokes equation, an additional elastic stress term is introduced to achieve viscoelastic flow behaviors, which have both fluid and solid features. Benefiting from the Lagrangian nature of Smoothed Particle Hydrodynamics, large flow deformation can be handled more easily and […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: