high performance computing on graphics processing units: hgpu.org

Posts

Feb, 7

Seismic imaging based on spectral differentiation matrix and GPU implementation

Finite-difference depth migration based on one-way wave equation uses second-order, fourth-order, or other finite-order approximations for spatial derivatives. These finite-order approximations often lead to spatial dispersion errors and low accuracy. To avoid these errors, smaller mesh spacings are used, which results in huge increase in computation cost. In this paper, we develop a new spectral […]

CUDA

Feb, 7

Stochastic Differential Equations simulation using GPU

We discretize generic stochastic differential equation(SDE)s using Euler and Milstein schemes. We propose GPU based random number generation GPURNG. Using GPURNG, Euler and Milstein methods, we derive algorithms with which we solve the underlying SDE. For a test case, we show the simulation results for European options. We shows that our algorithms give greater than […]

CUDA

Feb, 7

Aggregate Gaze Visualization with Real-time Heatmaps

A GPU implementation is given for real-time visualization of aggregate eye movements (gaze) via heatmaps. Parallelization of the algorithm leads to substantial speedup over its CPU-based implementation and, for the first time, allows real-time rendering of heatmaps atop video. GLSL shader colorization allows the choice of color ramps. Several luminance-based color maps are advocated as […]

OpenGL

Feb, 6

Realistic Lighting Simulation for Interactive VR Applications

In the field of aircraft design, interior illumination increasingly becomes an important design element. Different illumination scenarios inside an aircraft cabin are considered to influence the mood of air passengers, help passengers to be better prepared for time lags and to create an overall positive environment. Consequently, a physically correct and realistic lighting simulation becomes […]

OpenGL

Feb, 6

Towards High Speed Aerial Tracking of Agile Targets

In order to provide a novel perspective for videography of high speed sporting events, a highly capable trajectory tracking control methodology is developed for a custom designed Kadet Senior Unmanned Aerial Vehicle (UAV). The accompanying high fidelity system identification ensures that accurate flight models are used to design the control laws. A parallel vision based […]

OpenCL

Feb, 6

A Computational Model of Afterimages

Afterimages are optical illusions, particularly well perceived when fixating an image for an extended period of time and then looking at a neutral background, where an inverted copy of the original stimulus appears. The full mechanism that produces the perceived specific colors and shapes is complex and not entirely understood, but most of the important […]

OpenGL

Feb, 6

GPU-based Swendsen-Wang multi-cluster algorithm for the simulation of two-dimensional classical spin systems

We present the GPU calculation with the common unified device architecture (CUDA) for the Swendsen-Wang multi-cluster algorithm of two-dimensional classical spin systems. We adjust the two connected component labeling algorithms recently proposed with CUDA for the assignment of the cluster in the Swendsen-Wang algorithm. Starting with the q-state Potts model, we extend our implementation to […]

CUDA

Feb, 6

A toolkit to describe and interactively display three-manifolds embedded in four-space

A data structure and toolkit are presented here that allow for the description and manipulation of mathematical models of three-manifolds and their interactive display from multiple viewpoints via the OpenGL 3D graphics package. The data structure and vector math package can be extended to support an arbitrary number of Euclidean spatial dimensions. A model in […]

OpenGL

Feb, 5

Comparing Linear and Convex Relaxations for Stereo and Motion

We provide an analysis of several linear programming relaxations for the problems of stereo disparity estimation and motion estimation. The problems are cast as integer linear programs and their relaxations are solved approximately either by block coordinate descent (TRW-S and MPLP) or by smoothing and convex optimization techniques. We include a comparison to graph cuts. […]

Feb, 5

A Fast Parallel Implementation of Queue-based Morphological Reconstruction using GPUs

In this paper we develop and experimentally evaluate a novel GPU-based implementation of the morphological reconstruction operation. This operation is commonly used in the segmentation and feature computation steps of image analysis pipelines, and often used as a component in other image processing operations. Our implementation builds on a fast hybrid CPU algorithm, which employs […]

CUDA

Feb, 5

Accelerating Outlier Detection with Uncertain Data using Graphics Processors

Outlier detection (also known as anomaly detection) is a common data mining task in which data points that lie outside expected patterns in a given dataset are identified. This is useful in areas such as fault detection, intrusion detection and in pre-processing before further analysis. There are many approaches already in use for outlier detection, […]

OpenCL

Feb, 5

Efficient Computation of SOM for Outage Database

This paper describes a utilization of the Self Organizing Map (SOM) method for the analysis of power outage data. SOM, to be already used in many fields, is based on the Kohonen self-organizing neural network and it is known to capture underlying concepts. We apply this method for a unified database of power outages to […]

CUDA

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

Engineering Supercomputing Platforms for Biomolecular Applications

high performance computing on graphics processing units: hgpu.org

Posts

Seismic imaging based on spectral differentiation matrix and GPU implementation

Stochastic Differential Equations simulation using GPU

Aggregate Gaze Visualization with Real-time Heatmaps

Realistic Lighting Simulation for Interactive VR Applications

Towards High Speed Aerial Tracking of Agile Targets

A Computational Model of Afterimages

GPU-based Swendsen-Wang multi-cluster algorithm for the simulation of two-dimensional classical spin systems

A toolkit to describe and interactively display three-manifolds embedded in four-space

Comparing Linear and Convex Relaxations for Stereo and Motion

A Fast Parallel Implementation of Queue-based Morphological Reconstruction using GPUs

Accelerating Outlier Detection with Uncertain Data using Graphics Processors

Efficient Computation of SOM for Outage Database

Recent source codes

hpcbench: A set of benchmarking utilities for biomolecular simulation tools

HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration

chemtrain: Training Molecular Dynamics Potentials in JAX

microSYCL: SYCL micro-benchmarks repository

XaaS containers

CASS: Cuda-Amd aSSembly

Cluser of smartphones for edge computing application using TensorFlow

SYCL Container

CFAL-bench

Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration

Most viewed papers (last 30 days)