high performance computing on graphics processing units: hgpu.org

Posts

May, 16

CUDA Based GPU Programming to Simulate 3D Tissue Deformation

The medical training systems based on virtual simulation are highly desired since minimally invasive surgical techniques have become popular to patients. The training system helps surgeon trainees to acquire, practice and evaluate their surgical skills, and the key component of such a system is to simulate the dynamic procedure such as 3D biological tissue deformation […]

CUDA

May, 16

Shape-merging and interpolation using class estimation for unseen voxels with a GPU-based efficient implementation

The merging of multiple range images obtained by 3D measurement systems for generating a single polygon mesh, and processing for filling holes caused by unmeasured data or insufficient range images are essential processes for CAD, digital archiving of shapes, and CG rendering. Many of the existing processes that have been proposed for merging and interpolating […]

May, 16

The GPU enters computing’s mainstream

The Siggraph/Eurographics Graphics Hardware 2003 workshop, held in San Diego, will likely be remembered as a turning point in modern computing. In one of those rare moments when a new paradigm visibly begins changing general-purpose computing’s course, what has traditionally been a graphics-centric workshop shifted its attention to the nongraphics applications of the graphics processing […]

May, 16

Incremental Raycasting of Piecewise Quadratic Surfaces on the GPU

To overcome the limitations of triangle and point based surfaces several authors have recently investigated surface representations that are based on higher order primitives. Among these are MPU, SLIM surfaces, dynamic skin surfaces and higher order iso-surfaces. Up to now these representations were not suitable for interactive applications because of the lack of an efficient […]

OpenGL

May, 15

High dimensional pricing of exotic European contracts on a GPU Cluster, and comparison to a CPU cluster

The aim of this paper is the efficient use of CPU and GPU clusters for a general path-dependent exotic European pricing, and their comparison in terms of speed and energy consumption. To reach our goal, we propose a parallel random number generator which is well suited to the parallelization paradigm, then, we implement a multidimensional […]

CUDA

May, 15

A parallel Ant Colony Optimization algorithm with GPU-acceleration based on All-In-Roulette selection

Ant Colony Optimization is computationally expensive when it comes to complex problems. The Jacket toolbox allows implementation of MATLAB programs in Graphics Processing Unit (GPU). This paper presents and implements a parallel MAX-MIN Ant System (MMAS) based on a GPU+CPU hardware platform under the MATLAB environment with Jacket toolbox to solve Traveling Salesman Problem (TSP). […]

May, 15

K3 Moore’s Law in the Era of GPU Computing

The history of humanity is that we strive to use better tools and knowledge to build even better tools, and extend further the border of knowledge. In the past 50 years, CPU, as a dominant paradigm for computing, has provided exponential growth as predicted by Moore’s Law with remarkable accuracy. We have been leveraging CPUs […]

May, 15

Object oriented framework for real-time image processing on GPU

In this paper, we present a framework for efficiently integrating programming resources of both GPU and CPU. We introduce an object oriented framework for GPGPU-based image processing. We illustrate a set of classes exploiting the design and programming advantages of an object oriented language, such as code reusability/extensibility, flexibility, information hiding, and complexity hiding. This […]

CUDA

May, 15

Fermi GF100 GPU Architecture

The Fermi GF100 is a GPU architecture that provides several new capabilities beyond the Nvidia GT200 or Tesla architecture. The Fermi architecture offers up to 512 CUDA cores and special features for gaming and high-performance computing. This article describes the GPU’s new capabilities for tessellation, physics processing, and computational graphics.

CUDA

May, 15

Investigating the use of GPU-accelerated nodes for SAR image formation

The computation of an electromagnetic reflectivity image from a set of radar returns is a computationally intensive process. Therefore, the use of high performance computing is required to form images from radar signals in a short time frame. This paper explores the use of distributed memory cluster computers and accelerator technologies such as GPUs for […]

May, 15

A GPU Algorithm for IC Floorplanning: Specification, Analysis and Optimization

In this paper, we propose a novel floor planning algorithm for GPUs. Floor planning is an inherently sequential algorithm, far from the typical programs suitable for Single Instruction Multiple Thread (SIMT) style concurrency in a GPU. We propose a fundamentally different approach of exploring the floor plan solution space, where we evaluate concurrent moves on […]

May, 15

Automated pose estimation in 3D point clouds applying annealing particle filters and inverse kinematics on a GPU

Current experiments with HCIs have shown a high demand for more natural interaction paradigms. Gestures are thereby considered the most important cue besides speech. In order to recognize gestures it is necessary to extract meaningful motion features from the body. Up to now mostly marker based tracking systems are used in virtual reality environments, since […]

CUDA

* * *

high performance computing on graphics processing units: hgpu.org

Posts

CUDA Based GPU Programming to Simulate 3D Tissue Deformation

Shape-merging and interpolation using class estimation for unseen voxels with a GPU-based efficient implementation

The GPU enters computing’s mainstream

Incremental Raycasting of Piecewise Quadratic Surfaces on the GPU

High dimensional pricing of exotic European contracts on a GPU Cluster, and comparison to a CPU cluster

A parallel Ant Colony Optimization algorithm with GPU-acceleration based on All-In-Roulette selection

K3 Moore’s Law in the Era of GPU Computing

Object oriented framework for real-time image processing on GPU

Fermi GF100 GPU Architecture

Investigating the use of GPU-accelerated nodes for SAR image formation

A GPU Algorithm for IC Floorplanning: Specification, Analysis and Optimization

Automated pose estimation in 3D point clouds applying annealing particle filters and inverse kinematics on a GPU

Recent source codes

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

MSCCL++: A GPU-driven communication stack for scalable AI applications

Benchmark compute shader of Unity against InteropUnityCUDA

Most viewed papers (last 30 days)