high performance computing on graphics processing units: hgpu.org

Posts

Aug, 1

Mapping of a film grain removal algorithm to a heterogeneous reconfigurable architecture

Despite recent advances in FPGA, GPU, and general purpose processor technologies, the challenges posed by real-time digital image processing at high resolutions cannot be fully overcome due to insufficient processing capability, inadequate data transport and control mechanisms, and often prohibitively high costs. To address these issues, we proposed a two-phase solution for a real-time film […]

Aug, 1

Real-time Volumetric Haptic and Visual Burrhole Simulation

This paper describes real-time volumetric haptic and visual algorithms developed to simulate burrhole creation for a virtual reality-based craniotomy surgical simulator. A modified Voxmap point-shell algorithm (McNeely et al., 1999), (Renz et al., 2001) is created to simulate haptic interactions between bone cutting tools and voxel-based bone. New surface boundary detection and force feedback calculation […]

OpenGL

Aug, 1

Towards an embedded biologically-inspired machine vision processor

Biologically-inspired machine vision algorithms – those that attempt to capture aspects of the computational architecture of the brain – have proven to be a promising class of algorithms for performing a variety of object and face recognition tasks. However these algorithms typically require a large number of arithmetic operations per image frame evaluated. Meanwhile, the […]

Aug, 1

Continual surface-based multi-projector blending for moving objects

We introduce a general technique for blending imagery from multiple projectors on a tracked, moving, non-planar object. Our technique continuously computes visibility of pixels over the surfaces of the object and dynamically computes the per-pixel weights for each projector. This approach supports smooth transitions between areas of the object illuminated by different number of projectors, […]

OpenGL

Aug, 1

Image-Space Caustics and Curvatures

Caustics are important visual phenomena, as well as challenging global illumination effects in computer graphics. Physically caustics can be interpreted from one of two perspectives: in terms of photons gathered on scene geometry, or in terms of a pair of caustic surfaces. These caustic surfaces are swept by the foci of light rays. In this […]

Aug, 1

Silhouette Smoothing for Real-Time Rendering of Mesh Surfaces

Coarse piecewise linear approximation of surfaces causes the undesirable polygonal appearance of silhouettes. We present an efficient method for smoothing the silhouettes of coarse triangle meshes using efficient 3D curve reconstruction and simple local remeshing. It does not assume the availability of a fine mesh and generates only a moderate amount of additional data at […]

Jul, 31

Noise-resistant fitting for spherical harmonics

Spherical harmonic (SH) basis functions have been widely used for representing spherical functions in modeling various illumination properties. They can compactly represent low-frequency spherical functions. However, when the unconstrained least square method is used for estimating the SH coefficients of a hemispherical function, the magnitude of these SH coefficients could be very large. Hence, the […]

Jul, 31

Blind image deconvolution algorithm on NVIDIA CUDA platform

Advanced image processing algorithms usually require high computing performance. Today’s personal computers (PCs) offer satisfying resources for implementation of image processing tasks. However, as the image processing techniques are becoming more and more complex other implementation possibilities have to be searched. Since image processing algorithms usually comply with the Single Instruction Multiple Data (SIMD) model, […]

CUDA

Jul, 31

An efficient stochastic approach to groupwise non-rigid image registration

The groupwise approach to non-rigid image registration, solving the dense correspondence problem, has recently been shown to be a useful tool in many applications, including medical imaging, automatic construction of statistical models of appearance and analysis of facial dynamics. Such an approach overcomes limitations of traditional pairwise methods but at a cost of having to […]

Jul, 31

Interactive Rendering of Dynamic Geometry

Fluid simulations typically produce complex three-dimensional (3D) isosurfaces whose geometry and topology change over time. The standard way of representing such "dynamic geometry" is by a set of isosurfaces that are extracted individually at certain time steps. An alternative strategy is to represent the whole sequence as a four-dimensional (4D) tetrahedral mesh. The isosurface at […]

OpenGL

Jul, 31

Real-time foreground segmentation on GPUs using local online learning and global graph cut optimization

This paper is to address the problem of foreground separation from the background modeling perspective. In particular, we deal with the difficult scenarios where the background texture might change spatially and temporally. A novel approach is proposed that incorporates a pixel-based online learning method to adapt to temporal background changes promptly, together with a graph […]

Jul, 31

Fully 3-D List-Mode OSEM Accelerated by Graphics Processing Units

Advanced list-mode image reconstruction algorithms such as fully 3D list-mode ordered-subset expectation maximization (OSEM) are needed to exploit the potential performance of high-resolution PET systems with depth-of-interaction capabilities. However, such algorithms are computationally intensive. With the aim to accelerate list-mode 3D-OSEM, we investigated the use of graphics processing units (GPUs). Primarily designed to deliver high-definition […]

OpenGL

high performance computing on graphics processing units: hgpu.org

Posts

Mapping of a film grain removal algorithm to a heterogeneous reconfigurable architecture

Real-time Volumetric Haptic and Visual Burrhole Simulation

Towards an embedded biologically-inspired machine vision processor

Continual surface-based multi-projector blending for moving objects

Image-Space Caustics and Curvatures

Silhouette Smoothing for Real-Time Rendering of Mesh Surfaces

Noise-resistant fitting for spherical harmonics

Blind image deconvolution algorithm on NVIDIA CUDA platform

An efficient stochastic approach to groupwise non-rigid image registration

Interactive Rendering of Dynamic Geometry

Real-time foreground segmentation on GPUs using local online learning and global graph cut optimization

Fully 3-D List-Mode OSEM Accelerated by Graphics Processing Units

Recent source codes

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

Most viewed papers (last 30 days)