high performance computing on graphics processing units: hgpu.org

Posts

Jul, 23

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising

In this paper a two-phase filter for removing "salt and pepper" noise is proposed. In the first phase, an adaptive median filter is used to identify the set of the noisy pixels; in the second phase, these pixels are restored according to a regularization method, which contains a data-fidelity term reflecting the impulse noise characteristics. […]

CUDA

Jul, 23

Real time data analysis using GPU for High energy physics experiments

The use of the Graphical Processing Unit (GPU) as a general purpose processor is becoming popular. This thesis describes how GPU Computing can be used and can be beneficial in High Energy Physics (HEP) online computation or real time data analysis. This thesis explains that HEP computing is embarrassingly parallel problem therefore by using GPU […]

CUDA

Jul, 22

Parallel-META: efficient metagenomic data analysis based on high-performance computation

BACKGROUND: Metagenomics method directly sequences and analyses genome information from microbial communities. There are usually more than hundreds of genomes from different microbial species in the same community, and the main computational tasks for metagenomic data analyses include taxonomical and functional component examination of all genomes in the microbial community. Metagenomic data analysis is both […]

CUDA

Jul, 22

Dynamic Overset Grid Computations for CFD Applications on Graphics Processing Units

The objective of the present work is to discuss the development of a 3D Unstructured-Overset grid Computational Fluid Dynamics (CFD) solver on General Purpose Graphics Processing Units (GPGPUs). As an extension of our previous work on 2D/3D overset grid computations for compressible/incompressible flows on static grids[1][2], the current paper focuses on moving overset grids with […]

CUDA

Jul, 22

Sparse Approximate Inverse Preconditioners for Iterative Solvers on GPUs

For the solution of large systems of linear equations, iterative solvers with preconditioners are typically employed. However, the design of preconditioners for the black-box case, in which no additional information about the underlying problem is known, is very difficult. The most commonly employed method of incomplete LU factorizations is a serial algorithm and thus not […]

OpenCL

Jul, 22

Space-Time Finite Element Analysis on Graphics Processing Unit Computing Platform

Space-time finite element method provides a robust and accurate alternative to the traditional FEM based on semi-discrete schemes due to its extended capability in establishing approximations in both space and time. The extended capability, however, requires the simultaneous discretization of spatial and temporal domains. This subsequently results in a system of equations that is considerably […]

CUDA

Jul, 22

Efficient Cross-Device Query Processing

The increasing diversity of hardware within a single system promises large performance gains but also poses a challenge for data management systems. Strategies for the efficient use of hardware with large performance differences are still lacking. For example, existing research on GPU supported data management largely handles the GPU in isolation from the system’s CPU […]

CUDA

Jul, 20

Optimized Private Information Retrieval Protocol Using Graphics Processing Unit With Reduced Accessibility

Database outsourcing as a service is a new trend emerging in the computing industry instead of managing database in-house. This introduces several security issues related to database. One of the important security requirement is privacy. A Private Information Retrieval protocol (PIR) allows user to retrieve an element from the database in such way that identity […]

CUDA

Jul, 20

Implementation of the r.cuda.los module in the open source GRASS GIS by using parallel computation on the NVIDIA CUDA graphic cards

Parallel computing is in expanding phase in GIS applications. A very attractive solution for parallel computing are the NVIDIA graphic cards, with a parallel computing platform and the CUDA (Compute Unified Device Architecture) programming model. The basis for this paper is the r.los module used to calculate optical visibility (LOS – Line of Sight), which […]

CUDA

Jul, 20

Scaling CUDA for Distributed Heterogeneous Processors

The mainstream acceptance of heterogeneous computing and cloud computing is prompting a future of distributed heterogeneous systems. With current software development tools, programming such complex systems is difficult and requires an extensive knowledge of network and processor architectures. Providing an abstraction of the underlying network, message-passing interface (MPI) has been the standard tool for developing […]

CUDA

Jul, 20

Projectile Monte-Carlo Trajectory Analysis Using a Graphics Processing Unit

Monte Carlo trajectory simulation is a key element in the design and evaluation process for smart weapons development. Graphics processing units (GPU’s) are powerful massively parallel computing devices that are increasingly being used for general purpose computing. This paper explores the use of graphics processing units for Monte Carlo trajectory prediction with the goal of […]

CUDA

Jul, 20

GPU-Accelerated Point-Based Color Bleeding

Traditional global illumination lighting techniques like Radiosity and Monte Carlo sampling are computationally expensive. This has prompted the development of the Point-Based Color Bleeding (PBCB) algorithm by Pixar in order to approximate complex indirect illumination while meeting the demands of movie production; namely, reduced memory usage, surface shading independent run time, and faster renders than […]

OpenGL

high performance computing on graphics processing units: hgpu.org

Posts

A Parallel Edge Preserving Algorithm for Salt and Pepper Image Denoising

Real time data analysis using GPU for High energy physics experiments

Parallel-META: efficient metagenomic data analysis based on high-performance computation

Dynamic Overset Grid Computations for CFD Applications on Graphics Processing Units

Sparse Approximate Inverse Preconditioners for Iterative Solvers on GPUs

Space-Time Finite Element Analysis on Graphics Processing Unit Computing Platform

Efficient Cross-Device Query Processing

Optimized Private Information Retrieval Protocol Using Graphics Processing Unit With Reduced Accessibility

Implementation of the r.cuda.los module in the open source GRASS GIS by using parallel computation on the NVIDIA CUDA graphic cards

Scaling CUDA for Distributed Heterogeneous Processors

Projectile Monte-Carlo Trajectory Analysis Using a Graphics Processing Unit

GPU-Accelerated Point-Based Color Bleeding

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)