high performance computing on graphics processing units: hgpu.org

Posts

Jul, 1

Increasing Realism and Supporting Content Planning for Dynamic Scenes in a Mixed Reality System incorporating a Time-of-Flight Camera

Mixed reality is the combination of real and virtual scene content. Besides correct alignment of the two modalities and correct occlusion handling the core issues to be tackled are the degree of realism and the ease of use. For a convincing perception correct occlusion handling and shadowing is mandatory. We present a system for mixed […]

OpenGL

Jul, 1

GPU-assisted positive mean value coordinates for mesh deformations

In this paper we introduce positive mean value coordinates (PMVC) for mesh deformation. Following the observations of Joshi et al. [JMD*07] we show the advantage of having positive coordinates. The control points of the deformation are the vertices of a "cage" enclosing the deformed mesh. To define positive mean value coordinates for a given vertex, […]

Jul, 1

Graphics processing unit implementations of relative expression analysis algorithms enable dramatic computational speedup

SUMMARY: The top-scoring pair (TSP) and top-scoring triplet (TST) algorithms are powerful methods for classification from expression data, but analysis of all combinations across thousands of human transcriptome samples is computationally intensive, and has not yet been achieved for TST. Implementation of these algorithms for the graphics processing unit results in dramatic speedup of two […]

CUDA

Jul, 1

CAMPAIGN: An open-source Library of GPU-accelerated Data Clustering Algorithms

MOTIVATION: Data clustering techniques are an essential component of a good data analysis toolbox. Many current bioinformatics applications are inherently compute-intense and work with very large data sets. Sequential algorithms are inadequate for providing the necessary performance. For this reason, we have created CAMPAIGN, a central resource for data clustering algorithms and tools that are […]

CUDA

Jul, 1

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation

Unified Parallel C (UPC), a parallel extension to ANSI C, is designed for high performance computing on large-scale parallel machines. With General-purpose graphics processing units (GPUs) becoming an increasingly important high performance computing platform, we propose new language extensions to UPC to take advantage of GPU clusters. We extend UPC with hierarchical data distribution, revise […]

CUDA

Jul, 1

EASEA: specification and execution of evolutionary algorithms on GPGPU

EASEA is a framework designed to help non-expert programmers to optimize their problems by evolutionary computation. It allows to generate code targeted for standard CPU architectures, GPGPU-equipped machines as well as distributed memory clusters. In this paper, EASEA is presented by its underlying algorithms and by some example problems. Achievable speedups are also shown onto […]

CUDA

Jul, 1

Computing trends using graphic processor in high energy physics

One of the main challenges in Heavy Energy Physics is to make fast analysis of high amount of experimental and simulated data. At LHC-CERN one p-p event is approximate 1 Mb in size. The time taken to analyze the data and obtain fast results depends on high computational power. The main advantage of using GPU(Graphic […]

Jun, 30

On the technology roadmap of Free-Viewpoint 3DTV receivers

This paper presents the architecture of an innovative 3DTV receiver system, enabling Free-ViewPoint (FVP) interpolation and rendering functionality. We outline the hardware architecture of the receiver, and specify how the design decisions address the extremely high processing requirements of the system. Based on the experience and quantitative data obtained during the receiver prototyping, we present […]

Jun, 30

Fast variational static IR-drop analysis on the graphical processing unit

Due to large power grid sizes, IR-drop analysis is a computationally challenging design flow step that is commonly used in integrated circuit design. Variability in silicon and circuit operating conditions makes IR-drop analysis even more challenging. We introduce a flow to take benefit of a graphical processing unit (GPU). We introduce variability for the power […]

Jun, 30

Dynamic adaptation of broad phase collision detection algorithms

In this paper we present a new technique to dynamically adapt the first step (broad phase) of the collision detection process on hardware architecture during simulation. Our approach enables to face the unpredictable evolution of the simulation scenario (this includes addition of complex objects, deletion, split into several objects, …). Our technique of dynamic adaptation […]

CUDA

Jun, 30

Exploiting Graphic Processing Units Parallelism to Improve Intelligent Data Acquisition System Performance in JET’s Correlation Reflectometer

The performance of intelligent data acquisition systems relies heavily on their processing capabilities and local bus bandwidth, especially in applications with high sample rates or high number of channels. This is the case of the self adaptive sampling rate data acquisition system installed as a pilot experiment in KG8B correlation reflectometer at JET. The system, […]

CUDA

Jun, 30

High Resolution Program Flow Visualization of Hardware Accelerated Hybrid Multi-core Applications

The advent of multi-core processors has made parallel computing techniques mandatory on main stream systems. With the recent rise of hardware accelerators, hybrid parallelism adds yet another dimension of complexity to the process of software development. This article presents a tool for graphical program flow analysis of hardware accelerated parallel programs. It monitors the hybrid […]

CUDA

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Increasing Realism and Supporting Content Planning for Dynamic Scenes in a Mixed Reality System incorporating a Time-of-Flight Camera

GPU-assisted positive mean value coordinates for mesh deformations

Graphics processing unit implementations of relative expression analysis algorithms enable dramatic computational speedup

CAMPAIGN: An open-source Library of GPU-accelerated Data Clustering Algorithms

Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation

EASEA: specification and execution of evolutionary algorithms on GPGPU

Computing trends using graphic processor in high energy physics

On the technology roadmap of Free-Viewpoint 3DTV receivers

Fast variational static IR-drop analysis on the graphical processing unit

Dynamic adaptation of broad phase collision detection algorithms

Exploiting Graphic Processing Units Parallelism to Improve Intelligent Data Acquisition System Performance in JET’s Correlation Reflectometer

High Resolution Program Flow Visualization of Hardware Accelerated Hybrid Multi-core Applications

Recent source codes

SYCL Container

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Can Large Language Models Predict Parallel Code Performance?

PELSI: Power-Efficient Layer-Switched Inference

Ouroboros: Virtualized Queues for dynamic memory management

MSCCL++: A GPU-driven communication stack for scalable AI applications

Benchmark compute shader of Unity against InteropUnityCUDA

Most viewed papers (last 30 days)