high performance computing on graphics processing units: hgpu.org

Posts

Dec, 15

Compiler support for general-purpose computation on GPUs

In recent years, the GPU (graphics processing unit) has evolved into an extremely powerful and flexible processor, with it now representing an attractive platform for general-purpose computation. Moreover, changes to the design and programmability of GPUs provide the opportunity to perform general-purpose computation on a GPU (GPGPU). Even though many programming languages, software tools, and […]

OpenGL

Dec, 15

A Parallel Mediated Reality Platform

Realtime image processing provides a general framework for robust mediated reality problems. This paper presents a realtime mediated reality system that is built upon realtime image processing algorithms. It has been shown that the graphics processing unit (GPU) is capable of efficiently performing image processing tasks. The system presented uses a parallel GPU architecture for […]

OpenGL

Dec, 15

A scalable GPU-based approach to shading and shadowing for photorealistic real-time augmented reality

Visually realistic Augmented Reality (AR) entails addressing several difficult problems. The most difficult problem is that of rendering the virtual objects with illumination which is consistent with the illumination of the real scene. The paper describes a complete AR rendering system centered around the use of High Dynamic Range environment maps for representing the real […]

Dec, 15

Accelerating a three-dimensional finite-difference wave propagation code using GPU graphics cards

We accelerate a 3-D finite-difference in the time domain wave propagation code by a factor between about 20 and 60 compared to a serial implementation using graphics processing unit computing on NVIDIA graphics cards with the CUDA programming language. We describe the implementation of the code in CUDA to simulate the propagation of seismic waves […]

CUDA

Dec, 15

Distributed GPU Volume Rendering of ASKAP Spectral Data Cubes

The Australian SKA Pathfinder (ASKAP) will be producing 2.2 terabyte HI spectral-line cubes for each 8 hours of observation by 2013. Global views of spectral data cubes are vital for the detection of instrumentation errors, the identification of data artefacts and noise characteristics, and the discovery of strange phenomena, unexpected relations, or unknown patterns. We […]

Dec, 14

gProximity: Hierarchical GPU-based Operations for Collision and Distance Queries

We present novel parallel algorithms for collision detection and separation distance computation for rigid and deformable models that exploit the computational capabilities of many-core GPUs. Our approach uses thread and data parallelism to perform fast hierarchy construction, updating, and traversal using tight-fitting bounding volumes such as oriented bounding boxes (OBB) and rectangular swept spheres (RSS). […]

CUDA

Dec, 14

Fast Ray Sorting and Breadth-First Packet Traversal for GPU Ray Tracing

We present a novel approach to ray tracing execution on commodity graphics hardware using CUDA. We decompose a standard ray tracing algorithm into several data-parallel stages that are mapped efficiently to the massively parallel architecture of modern GPUs. These stages include: ray sorting into coherent packets, creation of frustums for packets, breadth-first frustum traversal through […]

CUDA

Dec, 14

GPU-Based Spherical Light Field Rendering with Per-Fragment Depth Correction

Image-based rendering techniques are a powerful alternative to traditional polygon-based computer graphics. This paper presents a novel light field rendering technique which performs per-pixel depth correction of rays for high-quality reconstruction. Our technique stores combined RGB and depth values in a parabolic 2D texture for every light field sample acquired at discrete positions on a […]

Dec, 14

High Performance GPU-based Proximity Queries using Distance Fields

Proximity queries such as closest point computation and collision detection have many applications in computer graphics, including computer animation, physics-based modelling, augmented and virtual reality. We present efficient algorithms for proximity queries between a closed rigid object and an arbitrary, possibly deformable, polygonal mesh. Using graphics hardware to densely sample the distance field of the […]

OpenGL

Dec, 14

Coherence aware GPU-based ray casting for virtual colonoscopy

In this paper, we propose a GPU-based volume ray casting for virtual colonoscopy to generate high-quality rendering images with a large screen size. Using the temporal coherence for ray casting, the empty space leaping can be efficiently done by reprojecting first-hit points of the previous frame; however, these approaches could produce artifacts such as holes […]

Dec, 14

Fast Isosurface Rendering on a GPU by Cell Rasterization

This paper presents a fast, high-quality, GPU-based isosurface rendering pipeline for implicit surfaces defined by a regular volumetric grid. GPUs are designed primarily for use with polygonal primitives, rather than volume primitives, but here we directly treat each volume cell as a single rendering primitive by designing a vertex program and fragment program on a […]

Dec, 14

Fast GPU-based Adaptive Tessellation with CUDA

Compact surface descriptions like higher-order surfaces are popular representations for both modeling and animation. However, for fast graphics-hardware-assisted rendering, they usually need to be converted to triangle meshes. In this paper, we introduce a new framework for performing on-the-fly crack-free adaptive tessellation of surface primitives completely on the GPU. Utilizing CUDA and its flexible memory […]

CUDA

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Compiler support for general-purpose computation on GPUs

A Parallel Mediated Reality Platform

A scalable GPU-based approach to shading and shadowing for photorealistic real-time augmented reality

Accelerating a three-dimensional finite-difference wave propagation code using GPU graphics cards

Distributed GPU Volume Rendering of ASKAP Spectral Data Cubes

gProximity: Hierarchical GPU-based Operations for Collision and Distance Queries

Fast Ray Sorting and Breadth-First Packet Traversal for GPU Ray Tracing

GPU-Based Spherical Light Field Rendering with Per-Fragment Depth Correction

High Performance GPU-based Proximity Queries using Distance Fields

Coherence aware GPU-based ray casting for virtual colonoscopy

Fast Isosurface Rendering on a GPU by Cell Rasterization

Fast GPU-based Adaptive Tessellation with CUDA

Recent source codes

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

MSCCL++: A GPU-driven communication stack for scalable AI applications

Benchmark compute shader of Unity against InteropUnityCUDA

Most viewed papers (last 30 days)