high performance computing on graphics processing units: hgpu.org

Posts

Nov, 30

Scientific computation for simulations on programmable graphics hardware

Graphics processor units (GPUs) have emerged as powerful parallel processors in recent years. Although floating point computations and high level programming languages are now available, the efficient use of the enormous computing power of GPUs still requires a significant amount of graphics specific knowledge. The paper explains how to use GPUs for scientific computations without […]

Nov, 30

Blister: GPU-based rendering of Boolean combinations of free-form triangulated shapes

By combining depth peeling with a linear formulation of a Boolean expression called Blist, the Blister algorithm renders an arbitrary CSG model of n primitives in at most k steps, where k is the number of depth-layers in the arrangement of the primitives. Each step starts by rendering each primitive to produce candidate surfels on […]

Nov, 30

The application of GPU particle tracing to diffusion tensor field visualization

In this paper we introduce GPU particle tracing for the visualization of 3D diffusion tensor fields. For about half a million particles, reconstruction of diffusion directions from the tensor field, time integration and rendering can be done at interactive rates. Different visualization options like oriented particles of diffusion-dependent shape, stream lines or stream tubes facilitate […]

Nov, 30

Large steps in GPU-based deformable bodies simulation

The interactive deformation and visualization of volumetric objects is still a challenging problem for many application areas. We present a novel integrated system which implements physically-based deformation and volume visualization of tetrahedral meshes on modern graphics hardware by exploiting the last features of vertex and fragment shaders. We achieve fast and stable deformation of tetrahedral […]

OpenGL

Nov, 30

A GPU accelerated spring mass system for surgical simulation

There is a growing demand for surgical simulators to do fast and precise calculations of tissue deformation to simulate increasingly complex morphology in real-time. Unfortunately, even fast spring-mass based systems have slow convergence rates for large models. This paper presents a method to accelerate computation of a spring-mass system in order to simulate a complex […]

OpenGL

Nov, 30

GPU accelerated surgical simulators for complex morphology

Surgical training in virtual environments, surgical simulation in other words, has previously had difficulties in simulating deformation of complex morphology in real-time. Even fast spring-mass based systems had slow convergence rates for large models. This paper presents two methods to accelerate a spring-mass system in order to simulate a complex organ such as the heart. […]

OpenGL

Nov, 30

GPU-BLAST: Using graphics processors to accelerate protein sequence alignment

MOTIVATION: The Basic Local Alignment Search Tool (BLAST) is one of the most widely used bioinformatics tools. The widespread impact of BLAST is reflected in over 53,000 citations that this software has received in the past two decades, and the use of the word "blast" as a verb referring to biological sequence comparison. Any improvement […]

CUDA

Nov, 29

Have GPUs made FPGAs redundant in the field of video processing?

Since the 1990s FPGAs have been popular for accelerating video processing applications. This paper presents GPUs (graphics processing units) as another viable solution, comparing their throughput with that of FPGAs. Previous work on using FPGAs and GPUs for video processing is analysed, grounds for comparison formulated and then exemplified through case studies of primary colour […]

Nov, 29

GPU-based Volume Rendering for Medical Image Visualization

During the quick advancements of medical image visualization and augmented virtual reality application, the low performance of the volume rendering algorithm is still a “bottle neck”. To facilitate the usage of well developed hardware resource, a novel graphics processing unit (GPU)-based volume ray-casting algorithm is proposed in this paper. Running on a normal PC, it […]

OpenGL

Nov, 29

Dynamic particle coupling for gpu-based fluid simulation

The main research approaches in Computational Fluid Dynamics (CFD) are grid-based (Euler) or based on particle motion (Lagrange). For interactive fluid simulation, techniques have been developed to use the Graphics Processing Unit (GPU) to speed up the computation using an Eulerian approach. This paper describes an approach for setting up a Lagrangian particle based fluid […]

OpenGL

Nov, 29

The GeForce 6 series GPU architecture

The previous chapter described how GPU architecture has changed as a result of computational and communications trends in microprocessing. This chapter describes the architecture of the GeForce 6 Series GPUs from NVIDIA, which owe their formidable computational power to their ability to take advantage of these trends. Most notably, we focus on the GeForce 6800 […]

Nov, 29

LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware

We present a novel algorithm to solve dense linear systems using graphics processors (GPUs). We reduce matrix decomposition and row operations to a series of rasterization problems on the GPU. These include new techniques for streaming index pairs, swapping rows and columns and parallelizing the computation to utilize multiple vertex and fragment processors. We also […]

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Scientific computation for simulations on programmable graphics hardware

Blister: GPU-based rendering of Boolean combinations of free-form triangulated shapes

The application of GPU particle tracing to diffusion tensor field visualization

Large steps in GPU-based deformable bodies simulation

A GPU accelerated spring mass system for surgical simulation

GPU accelerated surgical simulators for complex morphology

GPU-BLAST: Using graphics processors to accelerate protein sequence alignment

Have GPUs made FPGAs redundant in the field of video processing?

GPU-based Volume Rendering for Medical Image Visualization

Dynamic particle coupling for gpu-based fluid simulation

The GeForce 6 series GPU architecture

LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)