high performance computing on graphics processing units: hgpu.org

Posts

Jul, 23

LBCL: multi-device automatic load balancing

This paper presents the Load Balancing for OpenCL (lbcl) library, devoted to automatically solve load balancing issues on both multi-platform and heterogeneous environments. Using this library, a single kernel can be executed on a set of heterogeneous devices, giving each device an amount of work proportional to its computing power. A wrapper has been developed […]

OpenCL

Jul, 23

A Comparative Study of OpenACC Implementations

GPUs and other accelerators are available on many different devices, while GPGPU has been massively adopted by the HPC research community. Although a plethora of libraries and applications providing GPU support are available, the need of implementing new algorithms from scratch, or adapting sequential programs to accelerators, will always exist. Writing CUDA or OpenCL codes, […]

CUDA

•

OpenCL

Jul, 23

GPU-based cellular automata simulations of laser dynamics

We present a parallel implementation for Graphics Processing Units (GPUs) of a model based on cellular automata (CA) to simulate laser dynamics. A cellular automaton is an inherent parallel type of algorithm that is very suitable to simulate complex systems formed by many individual components which give rise to emergent behaviours. We exploit the parallel […]

CUDA

Jul, 23

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising

In this paper a two-phase filter for removing "salt and pepper" noise is proposed. In the first phase, an adaptive median filter is used to identify the set of the noisy pixels; in the second phase, these pixels are restored according to a regularization method, which contains a data-fidelity term reflecting the impulse noise characteristics. […]

CUDA

Jul, 23

Real time data analysis using GPU for High energy physics experiments

The use of the Graphical Processing Unit (GPU) as a general purpose processor is becoming popular. This thesis describes how GPU Computing can be used and can be beneficial in High Energy Physics (HEP) online computation or real time data analysis. This thesis explains that HEP computing is embarrassingly parallel problem therefore by using GPU […]

CUDA

Jul, 22

Parallel-META: efficient metagenomic data analysis based on high-performance computation

BACKGROUND: Metagenomics method directly sequences and analyses genome information from microbial communities. There are usually more than hundreds of genomes from different microbial species in the same community, and the main computational tasks for metagenomic data analyses include taxonomical and functional component examination of all genomes in the microbial community. Metagenomic data analysis is both […]

CUDA

Jul, 22

Dynamic Overset Grid Computations for CFD Applications on Graphics Processing Units

The objective of the present work is to discuss the development of a 3D Unstructured-Overset grid Computational Fluid Dynamics (CFD) solver on General Purpose Graphics Processing Units (GPGPUs). As an extension of our previous work on 2D/3D overset grid computations for compressible/incompressible flows on static grids[1][2], the current paper focuses on moving overset grids with […]

CUDA

Jul, 22

Sparse Approximate Inverse Preconditioners for Iterative Solvers on GPUs

For the solution of large systems of linear equations, iterative solvers with preconditioners are typically employed. However, the design of preconditioners for the black-box case, in which no additional information about the underlying problem is known, is very difficult. The most commonly employed method of incomplete LU factorizations is a serial algorithm and thus not […]

OpenCL

Jul, 22

Space-Time Finite Element Analysis on Graphics Processing Unit Computing Platform

Space-time finite element method provides a robust and accurate alternative to the traditional FEM based on semi-discrete schemes due to its extended capability in establishing approximations in both space and time. The extended capability, however, requires the simultaneous discretization of spatial and temporal domains. This subsequently results in a system of equations that is considerably […]

CUDA

Jul, 22

Efficient Cross-Device Query Processing

The increasing diversity of hardware within a single system promises large performance gains but also poses a challenge for data management systems. Strategies for the efficient use of hardware with large performance differences are still lacking. For example, existing research on GPU supported data management largely handles the GPU in isolation from the system’s CPU […]

CUDA

Jul, 20

Optimized Private Information Retrieval Protocol Using Graphics Processing Unit With Reduced Accessibility

Database outsourcing as a service is a new trend emerging in the computing industry instead of managing database in-house. This introduces several security issues related to database. One of the important security requirement is privacy. A Private Information Retrieval protocol (PIR) allows user to retrieve an element from the database in such way that identity […]

CUDA

Jul, 20

Implementation of the r.cuda.los module in the open source GRASS GIS by using parallel computation on the NVIDIA CUDA graphic cards

Parallel computing is in expanding phase in GIS applications. A very attractive solution for parallel computing are the NVIDIA graphic cards, with a parallel computing platform and the CUDA (Compute Unified Device Architecture) programming model. The basis for this paper is the r.los module used to calculate optical visibility (LOS – Line of Sight), which […]

CUDA