Posts
Jul, 24
A Reconfigurable GPU Implementation for Tomlinson-Harashima Precoding
Fast parallel processing capability of general purpose Graphic Processing Units (GPU) can be exploited to accelerate the precoding calculation needed in spatially multiplexed wireless communication systems. In this paper, a GPU-based implementation of the well-known multiuser TomlinsonHarashima precoding (THP) scheme combined with a latticereduction (LR) stage is presented. The proposed approach allows the LR stage […]
Jul, 23
LBCL: multi-device automatic load balancing
This paper presents the Load Balancing for OpenCL (lbcl) library, devoted to automatically solve load balancing issues on both multi-platform and heterogeneous environments. Using this library, a single kernel can be executed on a set of heterogeneous devices, giving each device an amount of work proportional to its computing power. A wrapper has been developed […]
Jul, 23
A Comparative Study of OpenACC Implementations
GPUs and other accelerators are available on many different devices, while GPGPU has been massively adopted by the HPC research community. Although a plethora of libraries and applications providing GPU support are available, the need of implementing new algorithms from scratch, or adapting sequential programs to accelerators, will always exist. Writing CUDA or OpenCL codes, […]
Jul, 23
GPU-based cellular automata simulations of laser dynamics
We present a parallel implementation for Graphics Processing Units (GPUs) of a model based on cellular automata (CA) to simulate laser dynamics. A cellular automaton is an inherent parallel type of algorithm that is very suitable to simulate complex systems formed by many individual components which give rise to emergent behaviours. We exploit the parallel […]
Jul, 23
A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising
In this paper a two-phase filter for removing "salt and pepper" noise is proposed. In the first phase, an adaptive median filter is used to identify the set of the noisy pixels; in the second phase, these pixels are restored according to a regularization method, which contains a data-fidelity term reflecting the impulse noise characteristics. […]
Jul, 23
Real time data analysis using GPU for High energy physics experiments
The use of the Graphical Processing Unit (GPU) as a general purpose processor is becoming popular. This thesis describes how GPU Computing can be used and can be beneficial in High Energy Physics (HEP) online computation or real time data analysis. This thesis explains that HEP computing is embarrassingly parallel problem therefore by using GPU […]
Jul, 22
Parallel-META: efficient metagenomic data analysis based on high-performance computation
BACKGROUND: Metagenomics method directly sequences and analyses genome information from microbial communities. There are usually more than hundreds of genomes from different microbial species in the same community, and the main computational tasks for metagenomic data analyses include taxonomical and functional component examination of all genomes in the microbial community. Metagenomic data analysis is both […]
Jul, 22
Dynamic Overset Grid Computations for CFD Applications on Graphics Processing Units
The objective of the present work is to discuss the development of a 3D Unstructured-Overset grid Computational Fluid Dynamics (CFD) solver on General Purpose Graphics Processing Units (GPGPUs). As an extension of our previous work on 2D/3D overset grid computations for compressible/incompressible flows on static grids[1][2], the current paper focuses on moving overset grids with […]
Jul, 22
Sparse Approximate Inverse Preconditioners for Iterative Solvers on GPUs
For the solution of large systems of linear equations, iterative solvers with preconditioners are typically employed. However, the design of preconditioners for the black-box case, in which no additional information about the underlying problem is known, is very difficult. The most commonly employed method of incomplete LU factorizations is a serial algorithm and thus not […]
Jul, 22
Space-Time Finite Element Analysis on Graphics Processing Unit Computing Platform
Space-time finite element method provides a robust and accurate alternative to the traditional FEM based on semi-discrete schemes due to its extended capability in establishing approximations in both space and time. The extended capability, however, requires the simultaneous discretization of spatial and temporal domains. This subsequently results in a system of equations that is considerably […]
Jul, 22
Efficient Cross-Device Query Processing
The increasing diversity of hardware within a single system promises large performance gains but also poses a challenge for data management systems. Strategies for the efficient use of hardware with large performance differences are still lacking. For example, existing research on GPU supported data management largely handles the GPU in isolation from the system’s CPU […]
Jul, 20
Optimized Private Information Retrieval Protocol Using Graphics Processing Unit With Reduced Accessibility
Database outsourcing as a service is a new trend emerging in the computing industry instead of managing database in-house. This introduces several security issues related to database. One of the important security requirement is privacy. A Private Information Retrieval protocol (PIR) allows user to retrieve an element from the database in such way that identity […]