4789

Posts

Jul, 9

Power and Performance Characterization of Computational Kernels on the GPU

Nowadays Graphic Processing Units (GPU) are gaining increasing popularity in high performance computing (HPC). While modern GPUs can offer much more computational power than CPUs, they also consume much more power. Energy efficiency is one of the most important factors that will affect a broader adoption of GPUs in HPC. In this paper, we systematically […]
Jul, 9

The use of overlapping subgrids to accelerate the FDTD on GPU devices

The method Finite Difference Time Domain (FDTD) is widely used in electromagnetic simulations to solve problems of microwave tomography, radar and telecommunications. Since this method is a data intensive and computation intensive problem, there are a lot of initiatives to improve the scalability and the performance of the FDTD. Despite the progress, performance in FDTD […]
Jul, 9

Accelerating data clustering on GPU-based clusters under shared memory abstraction

Many-core graphics processors are playing today an important role in the advancements of modern highly concurrent processors. Their ability to accelerate computation is being explored under several scientific fields. In the current paper we present the acceleration of a widely used data clustering algorithm, K-means, in the context of high performance GPU clusters. As opposed […]
Jul, 9

Numerical Parallel Processing Based on GPU with CUDA Architecture

The characteristics of modern graphics processing unit (GPU) is programmable, high price / performance ratio and high speed. It has a strong ability to adapt the parallel calculation, Based on this, the article study the general method of GPU calculating and use compute unified device architecture (CUDA) to design new parallel algorithm to accelerate the […]
Jul, 9

A massively parallel implementation of QC-LDPC decoder on GPU

The graphics processor unit (GPU) is able to provide a low-cost and flexible software-based multi-core architecture for high performance computing. However, it is still very challenging to efficiently map the real-world applications to GPU and fully utilize the computational power of GPU. As a case study, we present a GPU-based implementation of a real-world digital […]
Jul, 9

Utilization of GPU for real-time vision in robotics

The paper focuses on the FraDIA vision subsystem part responsible for GPU-based image processing. The developed set of classes encapsulates the OpenCL subroutines and utilizes GPU to fulfill the robotic requirements for real-time visual data processing. The class structure reflects the proposed classification of image processing algorithms.
Jul, 9

GPU volume rendering in 3D echocardiography: Real-time pre-processing and ray-casting

Since real-time acquisition of 3D echocardiographic data is achievable in practice, many volume rendering algorithms have been proposed for visualization purposes. However, due to the large amounts of data and computations involved a tradeoff between image quality and computational efficiency has to be made. The main goal of our study was to generate high quality […]
Jul, 9

Frameworks for GPU Accelerators: A comprehensive evaluation using 2D/3D image registration

In the last decade, there has been a dramatic growth in research and development of massively parallel many-core architectures like graphics hardware, both in academia and industry. This changed also the way programs are written in order to leverage the processing power of a multitude of cores on the same hardware. In the beginning, programmers […]
Jul, 8

Acceleration of the 3D ADI-FDTD method using graphics processor units

We present preliminary results of the acceleration of the three-dimensional (3D) alternating direction implicit finite-difference time-domain (ADI-FDTD) method on graphics processor units (GPUs). Although the ADI-FDTD iteration comprises two substeps, which each require solving a tridiagonal matrix system of equations over xy, xz, yz planes of the domain, the application of this scheme frees the […]
Jul, 8

Binary Mesh Partitioning for Cache-Efficient Visualization

One important bottleneck when visualizing large data sets is the data transfer between processor and memory. Cacheaware (CA) and cache-oblivious (CO) algorithms take into consideration the memory hierarchy to design cache efficient algorithms. CO approaches have the advantage to adapt to unknown and varying memory hierarchies. Recent CA and CO algorithms developed for 3D mesh […]
Jul, 8

Automatic code generation for solvers of cardiac cellular membrane dynamics in GPUs

The modeling of the electrical activity of the heart is of great medical and scientific interest, as it provides a way to get a better understanding of the related biophysical phenomena, allows the development of new techniques for diagnoses and serves as a platform for drug tests. However, due to the multi-scale nature of the […]
Jul, 8

SCGPSim: A fast SystemC simulator on GPUs

The main objective of this paper is to speed up the simulation performance of SystemC designs at the RTL abstraction level by exploiting the high degree of parallelism afforded by today’s general purpose graphics processors (GPGPUs). Our approach parallelizes SystemC’s discrete-event simulation (DES) on GPGPUs by transforming the model of computation of DES into a […]

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: