high performance computing on graphics processing units: hgpu.org

Posts

Jan, 22

Revision of Relational Joins for Multi-Core and Many-Core Architectures

Actual trend set by CPU manufacturers and recent developement in the field of graphical processing units (GPUs) offered us the computational power of multi-core and many-core architectures. Database applications can benefit greatly from parallelism; however, many algorithms need to be redesigned and many technical issues need to be solved. In this paper, we have focused […]

CUDA

Jan, 22

Accelerating the Simulations of the Ising Model by the GPU under the CUDA Environment

With the rapid development of the graphics processing unit (GPU), a recent GPU offers incredible resources for general purpose computing. We apply this technology to Monte Carlo simulations of the 2D and 3D lattice Ising models. By implementing the checkerboard algorithm, results are obtained up to 54, 62 and 68 times faster on the GPU […]

CUDA

Jan, 21

Automatic Code Generation and Adaptive Grid Scheduling for GPU Cluster Computing

Recent advances in GPUs (graphics processing units) lead to massively parallel hardware that is easily programmable and widely applied in areas which require intensive computation besides graphics acceleration. The appearance of GPU clusters gains popularity in the scientific computing community, and the study on GPU clusters becomes an increasingly hot issue. While extending a singleGPU […]

CUDA

Jan, 21

GPGPU calculations of gas thermodynamic quantities

Computational processors NVIDIA Tesla GPU based on the new Fermi generation of CUDA architecture are intended to perform massively parallel calculations applicable to various parts of the scientific and technical research, including the area of fluid dynamics modeling, in particular the simulation of real gas flow. In this paper we show that a significant acceleration […]

CUDA

Jan, 21

OpenCL for Database Query Processing

In recent years, graphics processing units (GPUs) have evolved into powerful devices with significant computational performance and memory throughput. Efforts to exploit their potential to tackle problems from various scientific domains with high computational requirements have proven quite successful. In addition, previous research suggests that database query processing algorithms can be accelerated with the utilisation […]

OpenCL

Jan, 21

Markov Chain Monte Carlo on the GPU

Markov chains are a useful tool in statistics that allow us to sample and model a large population of individuals. We can extend this idea to the challenge of sampling solutions to problems. Using Markov chain Monte Carlo (MCMC) techniques we can also attempt to approximate the number of solutions with a certain confidence based […]

OpenCL

Jan, 21

A Practical Visualization Strategy for Large-Scale Supernovae CFD Simulations

Simulating the expansion of a Type II supernova using an adaptive computational fluid dynamics (CFD) engine yields a complex mixture of turbulent flow with dozens of physical properties. The dataset shown in this sketch was initially simulated on iVEC’s EPIC supercomputer (a 9600 core Linux cluster) using FLASH [Fryxell et al. 2000] to model the […]

OpenCL

•

OpenGL

Jan, 21

Parallel FEM Simulation Using GPUs

This paper deals with a research concept of parallel finite element (FE) simulation for moving boundary and adaptive refinement problems using graphics processing unit (GPU). The main concern in this study is to improve the numerical performance of continuous FE simulation using recent data-parallel computing technology (GPU-CUDA). The computational time for our existing simulations is […]

CUDA

Jan, 21

EASEA: A Generic Optimization Tool for GPU Machines in Asynchronous Island Model

Very recently, we presented an efficient implementation of Evolutionary Algorithms (EAs) using Graphics Processing Units (GPU) for solving microporous crystal structures. Because of both the inherent complexity of zeolitic materials and the constant pressure to accelerate R&D solutions, an asynchronous island model running on clusters of machines equipped with GPU cards, i.e. the current trend […]

CUDA

Jan, 21

Plenoptic Rendering With Interactive Performance Using GPUs

Processing and rendering of plenoptic camera data requires significant computational power and memory bandwidth. At the same time, real-time rendering performance is highly desirable so that users can interactively explore the infinite variety of images that can be rendered from a single plenoptic image. In this paper we describe a GPU-based approach for lightfield processing […]

OpenGL

Jan, 21

Direct Visualization of Particle-Partition of Unity Data

Direct visualization of higher-order data provides manifold advantages over the traditional approach, which is based on resampling and subsequent visualization by interpolation-based techniques. Most important, it avoids excessive computation and consumption of memory, and prevents artifacts by pixel-accurate visualization at interactive rates. This work addresses particle-partition of unity simulation data, where fields are modeled both […]

OpenGL

Jan, 21

The State of the Art in Interactive Global Illumination

The interaction of light and matter in the world surrounding us is of striking complexity and beauty. Since the very beginning of computer graphics, adequate modeling of these processes and efficient computation is an intensively studied research topic and still not a solved problem. The inherent complexity stems from the underlying physical processes as well […]

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations

microSYCL: SYCL micro-benchmarks repository

Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Revision of Relational Joins for Multi-Core and Many-Core Architectures

Accelerating the Simulations of the Ising Model by the GPU under the CUDA Environment

Automatic Code Generation and Adaptive Grid Scheduling for GPU Cluster Computing

GPGPU calculations of gas thermodynamic quantities

OpenCL for Database Query Processing

Markov Chain Monte Carlo on the GPU

A Practical Visualization Strategy for Large-Scale Supernovae CFD Simulations

Parallel FEM Simulation Using GPUs

EASEA: A Generic Optimization Tool for GPU Machines in Asynchronous Island Model

Plenoptic Rendering With Interactive Performance Using GPUs

Direct Visualization of Particle-Partition of Unity Data

The State of the Art in Interactive Global Illumination

Recent source codes

Efficient GPU Implementation of Multi-Precision Integer Division

ParEval: A Parallel Code Evaluation Benchmark

FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores

exa-AMD: Exascale Accelerated Materials Discovery

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

Most viewed papers (last 30 days)