high performance computing on graphics processing units: hgpu.org

Posts

Jul, 2

Visualizing and Analyzing the Mona Lisa

As technologies for acquiring 3D data and algorithms for constructing integrated models evolve, very large data sets representing objects or environments are emerging in various application areas. As a result, significant research in computer graphics has aimed to interactively render such models on affordable commodity computers. Interest is growing in the possibility of integrating real-time […]

Jul, 1

Size-based Transfer Functions: A New Volume Exploration Technique

The visualization of complex 3D images remains a challenge, a fact that is magnified by the difficulty to classify or segment volume data. In this paper, we introduce size-based transfer functions, which map the local scale of features to color and opacity. Features in a data set with similar or identical scalar values can be […]

OpenGL

Jul, 1

Adaptive proxy geometry for direct volume manipulation

This paper introduces a new design to allow interactive, direct manipulation of volume data on volumetrically rendered images. We present an adaptive volume proxy mesh which serves not to define surfaces, but to encode the geometry and physical state of the volume. This system performs a modeling-free form of direct volume deformation by adaptively constructing […]

OpenGL

Jul, 1

Graphics processing unit accelerated non-uniform fast Fourier transform for ultrahigh-speed, real-time Fourier-domain OCT

We implemented fast Gaussian gridding (FGG)-based non-uniform fast Fourier transform (NUFFT) on the graphics processing unit (GPU) architecture for ultrahigh-speed, real-time Fourier-domain optical coherence tomography (FD-OCT). The Vandermonde matrix-based non-uniform discrete Fourier transform (NUDFT) as well as the linear/cubic interpolation with fast Fourier transform (InFFT) methods are also implemented on GPU to compare their performance […]

CUDA

Jul, 1

Increasing Realism and Supporting Content Planning for Dynamic Scenes in a Mixed Reality System incorporating a Time-of-Flight Camera

Mixed reality is the combination of real and virtual scene content. Besides correct alignment of the two modalities and correct occlusion handling the core issues to be tackled are the degree of realism and the ease of use. For a convincing perception correct occlusion handling and shadowing is mandatory. We present a system for mixed […]

OpenGL

Jul, 1

GPU-assisted positive mean value coordinates for mesh deformations

In this paper we introduce positive mean value coordinates (PMVC) for mesh deformation. Following the observations of Joshi et al. [JMD*07] we show the advantage of having positive coordinates. The control points of the deformation are the vertices of a "cage" enclosing the deformed mesh. To define positive mean value coordinates for a given vertex, […]

Jul, 1

Graphics processing unit implementations of relative expression analysis algorithms enable dramatic computational speedup

SUMMARY: The top-scoring pair (TSP) and top-scoring triplet (TST) algorithms are powerful methods for classification from expression data, but analysis of all combinations across thousands of human transcriptome samples is computationally intensive, and has not yet been achieved for TST. Implementation of these algorithms for the graphics processing unit results in dramatic speedup of two […]

CUDA

Jul, 1

CAMPAIGN: An open-source Library of GPU-accelerated Data Clustering Algorithms

MOTIVATION: Data clustering techniques are an essential component of a good data analysis toolbox. Many current bioinformatics applications are inherently compute-intense and work with very large data sets. Sequential algorithms are inadequate for providing the necessary performance. For this reason, we have created CAMPAIGN, a central resource for data clustering algorithms and tools that are […]

CUDA

Jul, 1

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation

Unified Parallel C (UPC), a parallel extension to ANSI C, is designed for high performance computing on large-scale parallel machines. With General-purpose graphics processing units (GPUs) becoming an increasingly important high performance computing platform, we propose new language extensions to UPC to take advantage of GPU clusters. We extend UPC with hierarchical data distribution, revise […]

CUDA

Jul, 1

EASEA: specification and execution of evolutionary algorithms on GPGPU

EASEA is a framework designed to help non-expert programmers to optimize their problems by evolutionary computation. It allows to generate code targeted for standard CPU architectures, GPGPU-equipped machines as well as distributed memory clusters. In this paper, EASEA is presented by its underlying algorithms and by some example problems. Achievable speedups are also shown onto […]

CUDA

Jul, 1

Computing trends using graphic processor in high energy physics

One of the main challenges in Heavy Energy Physics is to make fast analysis of high amount of experimental and simulated data. At LHC-CERN one p-p event is approximate 1 Mb in size. The time taken to analyze the data and obtain fast results depends on high computational power. The main advantage of using GPU(Graphic […]

Jun, 30

On the technology roadmap of Free-Viewpoint 3DTV receivers

This paper presents the architecture of an innovative 3DTV receiver system, enabling Free-ViewPoint (FVP) interpolation and rendering functionality. We outline the hardware architecture of the receiver, and specify how the design decisions address the extremely high processing requirements of the system. Based on the experience and quantitative data obtained during the receiver prototyping, we present […]

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Visualizing and Analyzing the Mona Lisa

Size-based Transfer Functions: A New Volume Exploration Technique

Adaptive proxy geometry for direct volume manipulation

Graphics processing unit accelerated non-uniform fast Fourier transform for ultrahigh-speed, real-time Fourier-domain OCT

Increasing Realism and Supporting Content Planning for Dynamic Scenes in a Mixed Reality System incorporating a Time-of-Flight Camera

GPU-assisted positive mean value coordinates for mesh deformations

Graphics processing unit implementations of relative expression analysis algorithms enable dramatic computational speedup

CAMPAIGN: An open-source Library of GPU-accelerated Data Clustering Algorithms

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation

EASEA: specification and execution of evolutionary algorithms on GPGPU

Computing trends using graphic processor in high energy physics

On the technology roadmap of Free-Viewpoint 3DTV receivers

Recent source codes

UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization

CuFuzz: An API-Knowledge-Graph Coverage-Driven Fuzzing Framework for CUDA Libraries

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

Probe-and-Refine Tuning of Repository Guidance for AI Coding Agents

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

Most viewed papers (last 30 days)