1159

Posts

Oct, 28

Accelerating Kirchhoff Migration by CPU and GPU Cooperation

We discuss the performance of Petrobras production Kirchhoff prestack seismic migration on a cluster of 64 GPUs and 256 CPU cores. Porting and optimization of the application hot spot (98.2% of a single CPU core execution time) to a single GPU reduces total execution time by a factor of 36 on a control run. We […]
Oct, 28

Hybrid GPU-Based Single- and Double-Bounce SAR Simulation

In this paper, a new hybrid graphics-processing-unit (GPU)-based real-time synthetic aperture radar (SAR) simulation system is presented. Previous real-time SAR simulators only supported single-bounce simulation in real time. The new hybrid system uses the rasterization approach for real-time single-bounce simulation and a new image-based GPU ray-tracing approach for monostatic SAR double-bounce simulation. This approach provides […]
Oct, 28

The Heisenberg spin glass model on GPU: myths and actual facts

We describe different implementations of the 3D Heisenberg spin glass model for Graphics Processing Units (GPU). The results show that the fast shared memory gives better performance with respect to the slow global memory only if a multi-hit technique is used.
Oct, 28

Accelerating astrophysical particle simulations with programmable hardware (FPGA and GPU)

In a previous paper we have shown that direct gravitational N-body simulations in astrophysics scale very well for moderately parallel supercomputers (order 10–100 nodes). The best balance between computation and communication is reached if the nodes are accelerated by special purpose hardware; in this paper we describe the implementation of particle based astrophysical simulation codes […]
Oct, 28

Analyzing CUDA workloads using a detailed GPU simulator

Modern graphic processing units (GPUs) provide sufficiently flexible programming models that understanding their performance can provide insight in designing tomorrow’s manycore processors, whether those are GPUs or otherwise. The combination of multiple, multithreaded, SIMD cores makes studying these GPUs useful in understanding tradeoffs among memory, data, and thread level parallelism. While modern GPUs offer orders […]
Oct, 28

CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment

BACKGROUND:Searching for similarities in protein and DNA databases has become a routine procedure in Molecular Biology. The Smith-Waterman algorithm has been available for more than 25 years. It is based on a dynamic programming approach that explores all the possible alignments between two sequences; as a result it returns the optimal local alignment. Unfortunately, the […]
Oct, 28

cuInspiral: prototype gravitational waves detection pipeline fully coded on GPU using CUDA

In this paper we report the prototype of the first coalescing binary detection pipeline fully implemented on NVIDIA GPU hardware accelerators. The code has been embedded in a GPU library, called cuInspiral and has been developed under CUDA framework. The library contains for example a PN gravitational wave signal generator, matched filtering/FFT and detection algorithms […]
Oct, 28

Matched Filter Computation on FPGA, Cell and GPU

The matched filter is an important kernel in the processing of hyperspectral data. The filter enables researchers to sift useful data from instruments that span large frequency bands and can produce Gigabytes of data in seconds. In this work, we evaluate the performance of a matched filter algorithm implementation on an FPGA-accelerated co-processor (Cray XD-1), […]
Oct, 28

Automatic Dynamic Task Distribution between CPU and GPU for Real-Time Systems

The increase of computational power of programmable GPU (graphics processing unit) brings new concepts for using these devices for generic processing. Hence, with the use of the CPU and the GPU for data processing come new ideas that deals with distribution of tasks among CPU and GPU, such as automatic distribution. The importance of the […]
Oct, 28

An improved scheme of an interactive finite element model for 3D soft-tissue cutting and deformation

As a safe and feasible alternative to enriching and enhancing traditional surgical training, virtual-reality-based surgical simulators have been investigated for a long time. But it is still a challenge for researchers to accurately depict the behavior of human tissue without losing the flexibility of simulation. In this paper, we propose an improved scheme of an […]
Oct, 28

GpuCV: A GPU-Accelerated Framework for Image Processing and Computer Vision

This paper presents briefly the state of the art of accelerating image processing with graphics hardware (GPU) and discusses some of its caveats. Then it describes GpuCV, an open source multi-platform library for GPU-accelerated image processing and Computer Vision operators and applications. It is meant for computer vision scientist not familiar with GPU technologies. GpuCV […]
Oct, 28

A hybrid condensed finite element model with GPU acceleration for interactive 3D soft tissue cutting

To meet the requirement of computer-aided medical operations, apart from the real-time deformation, it is also necessary in the design to simulate the tissue cutting and suturing in a surgery simulation. In this paper, we present a model on topology change and deformation of soft tissue, referred to as the hybrid condensed finite element model, […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: