Posts
Jul, 9
A massively parallel implementation of QC-LDPC decoder on GPU
The graphics processor unit (GPU) is able to provide a low-cost and flexible software-based multi-core architecture for high performance computing. However, it is still very challenging to efficiently map the real-world applications to GPU and fully utilize the computational power of GPU. As a case study, we present a GPU-based implementation of a real-world digital […]
Jul, 9
Utilization of GPU for real-time vision in robotics
The paper focuses on the FraDIA vision subsystem part responsible for GPU-based image processing. The developed set of classes encapsulates the OpenCL subroutines and utilizes GPU to fulfill the robotic requirements for real-time visual data processing. The class structure reflects the proposed classification of image processing algorithms.
Jul, 9
GPU volume rendering in 3D echocardiography: Real-time pre-processing and ray-casting
Since real-time acquisition of 3D echocardiographic data is achievable in practice, many volume rendering algorithms have been proposed for visualization purposes. However, due to the large amounts of data and computations involved a tradeoff between image quality and computational efficiency has to be made. The main goal of our study was to generate high quality […]
Jul, 9
Frameworks for GPU Accelerators: A comprehensive evaluation using 2D/3D image registration
In the last decade, there has been a dramatic growth in research and development of massively parallel many-core architectures like graphics hardware, both in academia and industry. This changed also the way programs are written in order to leverage the processing power of a multitude of cores on the same hardware. In the beginning, programmers […]
Jul, 8
Acceleration of the 3D ADI-FDTD method using graphics processor units
We present preliminary results of the acceleration of the three-dimensional (3D) alternating direction implicit finite-difference time-domain (ADI-FDTD) method on graphics processor units (GPUs). Although the ADI-FDTD iteration comprises two substeps, which each require solving a tridiagonal matrix system of equations over xy, xz, yz planes of the domain, the application of this scheme frees the […]
Jul, 8
Binary Mesh Partitioning for Cache-Efficient Visualization
One important bottleneck when visualizing large data sets is the data transfer between processor and memory. Cacheaware (CA) and cache-oblivious (CO) algorithms take into consideration the memory hierarchy to design cache efficient algorithms. CO approaches have the advantage to adapt to unknown and varying memory hierarchies. Recent CA and CO algorithms developed for 3D mesh […]
Jul, 8
Automatic code generation for solvers of cardiac cellular membrane dynamics in GPUs
The modeling of the electrical activity of the heart is of great medical and scientific interest, as it provides a way to get a better understanding of the related biophysical phenomena, allows the development of new techniques for diagnoses and serves as a platform for drug tests. However, due to the multi-scale nature of the […]
Jul, 8
SCGPSim: A fast SystemC simulator on GPUs
The main objective of this paper is to speed up the simulation performance of SystemC designs at the RTL abstraction level by exploiting the high degree of parallelism afforded by today’s general purpose graphics processors (GPGPUs). Our approach parallelizes SystemC’s discrete-event simulation (DES) on GPGPUs by transforming the model of computation of DES into a […]
Jul, 8
Implementability of shading models for current game engines
With the advances in the processor technology, todaypsilas graphical processing unit (GPU) architectures have evolved tremendously. Their speed and computational power has increased to the giga-flops levels. This has brought about a new architectural innovation called Shaders, which are programmable processing units that make all of the resources of the GPUs available to the game […]
Jul, 8
Parallel implementation of a spiking neuronal network model of unsupervised olfactory learning on NVidia CUDA
In this work I present the parallel implementation of a spiking neuronal network model with biologically realistic morphology, elements, and function on a graphical processing unit (GPU) using the NVidia CUDA framework. The comparison to a well-designed C/C++ implementation of the same model reveals a 24x speedup when using an NVidia Tesla C870 device for […]
Jul, 8
Hybrid Core Acceleration of UWB SIRE Radar Signal Processing
To move High-Performance Computing (HPC) closer to forward operating environments and missions, the Army Research Laboratory is developing approaches using hybrid, asymmetric core computing. By blending capabilities found in Graphics Processing Units (GPUs) and traditional von Neumann multicore Central Processing Units (CPUs), approaches are being developed and optimized to provide at or near real-time processing […]
Jul, 8
Visualizing Multiwavelength Astrophysical Data
With recent advances in the measurement technology for allsky astrophysical imaging, our view of the sky is no longer limited to the tiny visible spectral range over the 2D Celestial sphere. We now can access a third dimension corresponding to a broad electromagnetic spectrum with a wide range of allsky surveys; these surveys span frequency […]