high performance computing on graphics processing units: hgpu.org

Posts

Jan, 18

Scan Test Power Simulation on GPGPUs

The precise estimation of dynamic power consumption, power droop and temperature development during scan test require a very large number of time-aware gate-level logic simulations. Until now, such characterizations have been feasible only for rather small designs or with reduced precision due to the high computational demands. We propose a new, throughput-optimized timing simulator on […]

CUDA

Jan, 18

Flip-Flop: Convex Hull Construction via Star-Shaped Polyhedron in 3D

Flipping is a local and efficient operation to construct the convex hull in an incremental fashion. However, it is known that the traditional flip algorithm is not able to compute the convex hull when applied to a polyhedron in R3. Our novel Flip-Flop algorithm is a variant of the flip algorithm. It overcomes the deficiency […]

CUDA

Jan, 18

Fast Sparse Level Sets on Graphics Hardware

The level-set method is one of the most popular techniques for capturing and tracking deformable interfaces. Although level sets have demonstrated great potential in visualization and computer graphics applications, such as surface editing and physically based modeling, their use for interactive simulations has been limited due to the high computational demands involved. In this paper, […]

CUDA

Jan, 18

Rethinking resampling in the particle filter on graphics processing units

Modern parallel computing devices such as the graphics processing unit (GPU) have gained significant traction in scientific computing, and are particularly well-suited to data-parallel algorithms such as the particle filter. Of the components of the particle filter, the resampling step is the most difficult to implement well on such devices, as it often requires a […]

CUDA

Jan, 17

The 20th IEEE International Conference on High Performance Computing, HiPC 2013

Authors are invited to submit original unpublished research manuscripts that demonstrate current research in all areas of high performance computing including design and analysis of parallel and distributed systems, embedded systems, and their applications in scientific, engineering, and commercial areas. Topics of interest include but are not limited to: * High-Performance Computing * Parallel and […]

Jan, 17

2013 IEEE High Performance Extreme Computing Conference, HPEC ’13

The HPEC charter is to be the premier conference in the world on the confluence of HPC and Embedded Computing. The High Performance Extreme Computing (HPEC) technical committee announces the Seventeenth Annual HPEC conference to be held 24–26 September 2013 at the Westin Hotel in Waltham, Massachusetts. The technical committee seeks new presentations that clearly […]

Jan, 17

Performance Analysis and Tuning For: General-Purpose Graphics Processing Units (GPGPU)

General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a […]

CUDA

•

OpenCL

Jan, 17

Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs

GPUs employ thousands of threads per core to achieve high throughput. These threads exhibit localities in control-flow, instruction and data addresses and values. In this study we investigate inter-warp instruction temporal locality and show that during short intervals a significant share of fetched instructions are fetched unnecessarily. This observation provides several opportunities to enhance GPUs. […]

Jan, 17

A GPU Based Implementation of Side Effect Analysis

In this paper, we discuss a novel approach of improving the performance and accuracy of parallelization compilers by utilizing massively parallel processing power of GPGPU (General Purpose Graphical Processing Units). With the advent of multi core processors, it has become necessary to use parallel programming methodologies. However, parallel programmers need tools that can provide support […]

CUDA

Jan, 17

Application of GPU Smooth Particle Hydrodynamics: Wave Runup and Overtopping on Composite Slopes

Smooth Particle Hydrodynamics is a Lagrangian meshless numerical method with substantially improved capabilities in simulation of both fluid dynamics and solid mechanics due to its meshless nature. GPUSPH is an implementation of Smoothed Particle Hydrodynamics (SPH) on Nvidia CUDA-enabled (graphics) cards. In this paper the GPUSPH is applied to runup and overtopping applications and compared […]

CUDA

Jan, 17

Bouncing Behavior of Microscopic Dust Aggregates

CONTEXT: Bouncing collisions of dust aggregates within the protoplanetary may have a significant impact on the growth process of planetesimals. Yet, the conditions that result in bouncing are not very well understood. Existing simulations studying the bouncing behavior used aggregates with an artificial, very regular internal structure. Aims: Here, we study the bouncing behavior of […]

Jan, 16

Burrows-Wheeler Aligner: A Parallel Approach

The advent of mainframe computing brought about a fundamentally different way of approaching problems for many branches of science. But none has transformed quite like the science of biology. With genome sequencing now commonplace, an organism may be completely represented as a sequence of numbers. Harnessing the power of computers, sequences of genomes (and thus […]

CUDA

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

* * *

high performance computing on graphics processing units: hgpu.org

Posts

Scan Test Power Simulation on GPGPUs

Flip-Flop: Convex Hull Construction via Star-Shaped Polyhedron in 3D

Fast Sparse Level Sets on Graphics Hardware

Rethinking resampling in the particle filter on graphics processing units

The 20th IEEE International Conference on High Performance Computing, HiPC 2013

2013 IEEE High Performance Extreme Computing Conference, HPEC ’13

Performance Analysis and Tuning For: General-Purpose Graphics Processing Units (GPGPU)

Inter-Warp Instruction Temporal Locality in Deep-Multithreaded GPUs

A GPU Based Implementation of Side Effect Analysis

Application of GPU Smooth Particle Hydrodynamics: Wave Runup and Overtopping on Composite Slopes

Bouncing Behavior of Microscopic Dust Aggregates

Burrows-Wheeler Aligner: A Parallel Approach

Recent source codes

WiLLM: An Open Wireless LLM Communication System

Vcc: the Vulkan Clang Compiler

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

Most viewed papers (last 30 days)