3501

Posts

Mar, 29

CuHMMer: A load-balanced CPU-GPU cooperative bioinformatics application

GPUs have recently been used to accelerate data-parallel applications for they provide easier programmability and increased generality while maintaining the tremendous memory bandwidth and computational power. Most of those applications use CPU as a controller who decides when GPUs run the computing-intensive tasks. This CPU-control-GPU-compute pattern wastes much of CPU’s computational power. In this paper, […]
Mar, 29

GPU accelerated statistical image reconstruction for Compton cameras

We propose GPU (graphics processing unit) accelerated methods that can dramatically improve the computational performance of statistical image reconstruction algorithms for Compton cameras. Since the conventional ray-based backprojection method is inefficient for GPU, we develop a fully voxel-based backprojection method which can maximize the performance of GPU. In this method, the cone surface is sampled […]
Mar, 29

Multigrid on GPU: Tackling Power Grid Analysis on parallel SIMT platforms

The challenging task of analyzing on-chip power (ground) distribution networks with multi-million node complexity and beyond is key to todaypsilas large chip designs. For the first time, we show how to exploit recent massively parallel single-instruction multiple-thread (SIMT) based graphics processing unit (GPU) platforms to tackle power grid analysis with promising performance. Several key enablers […]
Mar, 29

Fast view synthesis using GPU for 3D display

In this paper, we develop a fast view synthesis method that generates multiple intermediate views in real time for a 3D display system when the camera geometry and the depth map of the reference views are given. The proposed method achieves a faster view synthesis than previous approaches by processing in parallel the entire computations […]
Mar, 29

5.6: GPU enhancement of FDTD-PIC plasma-wave simulations

Simple models of major CPU-intensive MAGIC electromagnetic (EM) plasma code portions using the CUDA language run on the graphical processing unit (GPU) indicate 12x computing rate compared to the same calculations run on the CPU only. MAGIC is being modified for performance speedup of large-scale plasma-wave EM calculations using GPU processing. Results to-date from MAGIC […]
Mar, 29

SAR raw signal simulation based on GPU parallel computation

In this paper we present a raw signal simulator based on GPU parallel computation for synthetic aperture radar. We describe a mathematical model of SAR simulation based on FFT in detail and implement it through GPU parallel computation. GPU has a better performance in complex calculation than CPU. It supports parallel computation and raises the […]
Mar, 29

An improved visual inspection system using visual servo

In this paper we present an improved automatic visual inspection system. In this system, homography based visual servo is used to accurately locate the camera position and attitude so that a template matching inspection can be realized. To improve the visual servo system’s performance, we propose a combination strategy of a GPU based Efficient Second-order […]
Mar, 29

Rendering of 3D Dynamic Virtual Environments

In this paper we present a framework for the rendering of dynamic 3D virtual environments which can be integrated in the development of videogames. It includes methods to manage sounds and particle effects, paged static geometries, the support of a physics engine and various input systems. It has been designed with a modular structure to […]
Mar, 29

Scandalously Parallelizable Mesh Generation

We propose a novel approach which employs random sampling to generate an accurate non-uniform mesh for numerically solving Partial Differential Equation Boundary Value Problems (PDE-BVP’s). From a uniform probability distribution U over a 1D domain, we sample M discretizations of size N where M>>N. The statistical moments of the solutions to a given BVP on […]
Mar, 29

Multi-mass solvers for lattice QCD on GPUs

Graphical Processing Units (GPUs) are more and more frequently used for lattice QCD calculations. Lattice studies often require computing the quark propagators for several masses. These systems can be solved using multi-shift inverters but these algorithms are memory intensive which limits the size of the problem that can be solved using GPUs. In this paper, […]
Mar, 28

GPU-Based Shooting and Bouncing Ray Method for Fast RCS Prediction

The shooting and bouncing ray (SBR) method is highly effective in the radar cross section (RCS) prediction. For electrically large and complex targets, computing scattered fields is still time-consuming in many applications like range profile and ISAR simulation. In this paper, we propose a GPU-based SBR that is fully implemented on the graphics processing unit […]
Mar, 28

An Empirically Optimized Radix Sort for GPU

In this paper, we propose an empirical optimization technique for one of the most important sorting routines on GPU, the radix sort, that generates highly efficient code for a number of representative NVIDIA GPUs with a wide variety of architectural specifications. Our study has been focused on the algorithmic parameters of radix sort that can […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: