1162

Posts

Oct, 28

Compute Pairwise Manhattan Distance and Pearson Correlation Coefficient of Data Points with GPU

Graphics processing units (GPUs) are powerful computational devices tailored towards the needs of the 3-D gaming industry for high-performance, real-time graphics engines. Nvidia Corporation released a new generation of GPUs designed for general-purpose computing in 2006, and it released a GPU programming language called CUDA in 2007. The DNA microarray technology is a high throughput […]
Oct, 28

Motion Compensation and Reconstruction of H.264/AVC Video Bitstreams using the GPU

Most modern computers are equipped with powerful yet cost-effective graphics processing units (GPUs) to accelerate graphics operations. Although programmable shaders on these GPUs were designed for the creation of 3-D rendering effects, they can also be used as generic processing units for vector data. This paper proposes a hardware Tenderer capable of executing motion compensation, […]
Oct, 28

GPU-based object-order ray-casting for large datasets

We propose a GPU-based object-order ray-casting algorithm for the rendering of large volumetric datasets, such as the Visible Human CT datasets. A volumetric dataset is decomposed into small sub-volumes, which are then organized using a min-max octree structure. The small sub-volumes are stored in the leaf nodes of the min-max octree, which are also called […]
Oct, 28

Accelerating Kirchhoff Migration by CPU and GPU Cooperation

We discuss the performance of Petrobras production Kirchhoff prestack seismic migration on a cluster of 64 GPUs and 256 CPU cores. Porting and optimization of the application hot spot (98.2% of a single CPU core execution time) to a single GPU reduces total execution time by a factor of 36 on a control run. We […]
Oct, 28

Hybrid GPU-Based Single- and Double-Bounce SAR Simulation

In this paper, a new hybrid graphics-processing-unit (GPU)-based real-time synthetic aperture radar (SAR) simulation system is presented. Previous real-time SAR simulators only supported single-bounce simulation in real time. The new hybrid system uses the rasterization approach for real-time single-bounce simulation and a new image-based GPU ray-tracing approach for monostatic SAR double-bounce simulation. This approach provides […]
Oct, 28

The Heisenberg spin glass model on GPU: myths and actual facts

We describe different implementations of the 3D Heisenberg spin glass model for Graphics Processing Units (GPU). The results show that the fast shared memory gives better performance with respect to the slow global memory only if a multi-hit technique is used.
Oct, 28

Accelerating astrophysical particle simulations with programmable hardware (FPGA and GPU)

In a previous paper we have shown that direct gravitational N-body simulations in astrophysics scale very well for moderately parallel supercomputers (order 10–100 nodes). The best balance between computation and communication is reached if the nodes are accelerated by special purpose hardware; in this paper we describe the implementation of particle based astrophysical simulation codes […]
Oct, 28

Analyzing CUDA workloads using a detailed GPU simulator

Modern graphic processing units (GPUs) provide sufficiently flexible programming models that understanding their performance can provide insight in designing tomorrow’s manycore processors, whether those are GPUs or otherwise. The combination of multiple, multithreaded, SIMD cores makes studying these GPUs useful in understanding tradeoffs among memory, data, and thread level parallelism. While modern GPUs offer orders […]
Oct, 28

CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment

BACKGROUND:Searching for similarities in protein and DNA databases has become a routine procedure in Molecular Biology. The Smith-Waterman algorithm has been available for more than 25 years. It is based on a dynamic programming approach that explores all the possible alignments between two sequences; as a result it returns the optimal local alignment. Unfortunately, the […]
Oct, 28

cuInspiral: prototype gravitational waves detection pipeline fully coded on GPU using CUDA

In this paper we report the prototype of the first coalescing binary detection pipeline fully implemented on NVIDIA GPU hardware accelerators. The code has been embedded in a GPU library, called cuInspiral and has been developed under CUDA framework. The library contains for example a PN gravitational wave signal generator, matched filtering/FFT and detection algorithms […]
Oct, 28

Matched Filter Computation on FPGA, Cell and GPU

The matched filter is an important kernel in the processing of hyperspectral data. The filter enables researchers to sift useful data from instruments that span large frequency bands and can produce Gigabytes of data in seconds. In this work, we evaluate the performance of a matched filter algorithm implementation on an FPGA-accelerated co-processor (Cray XD-1), […]
Oct, 28

Automatic Dynamic Task Distribution between CPU and GPU for Real-Time Systems

The increase of computational power of programmable GPU (graphics processing unit) brings new concepts for using these devices for generic processing. Hence, with the use of the CPU and the GPU for data processing come new ideas that deals with distribution of tasks among CPU and GPU, such as automatic distribution. The importance of the […]
Page 904 of 912« First...102030...902903904905906...910...Last »

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: