high performance computing on graphics processing units: hgpu.org

Posts

Feb, 4

A Graphics Hardware-Based Vortex Detection and Visualization System

Feature detection in flow fields is a well researched area, but practical application is often difficult due to the numerical complexity of the algorithms preventing interactive use and due to noise in experimental or high-resolution simulation data sets. We present an integrated system that provides interactive denoising, vortex detection, and visualization of vector data on […]

Feb, 4

Dispersion Simulation and Visualization For Urban Security

We present a system for simulating and visualizing the propagation of dispersive contaminants with an application to urban security. In particular, we simulate airborne contaminant propagation in open environments characterised by sky-scrapers and deep urban canyons. Our approach is based on the multiple relaxation time lattice Boltzmann model (MRTLBM), which can efficiently handle complex boundary […]

OpenGL

Feb, 4

Depth Enhanced Panoramas

Depth enhanced panoramas are a practical solution to the difficult problem of inside-looking-out modeling. Depth enhanced panoramas extend color panoramas to support viewpoint translation, while retaining their speed, convenience, and low cost. Depth enhanced panoramas are built incrementally from same-center-of-projection dense-color and sparse-depth frames that are acquired, registered, and merged at the rate of 5 […]

OpenGL

Feb, 4

Two Level Approach to Efficient Visualization of Protein Dynamics

Proteins are highly flexible and large amplitude deformations of their structure, also called slow dynamics, are often decisive to their function. We present a two-level rendering approach that enables visualization of slow dynamics of large protein assemblies. Our approach is aligned with a hierarchical model of large scale molecules. Instead of constantly updating positions of […]

OpenGL

Feb, 4

Lattice Based Volumetric Global Illumination

We describe a novel volumetric global illumination framework based on the face-centered cubic (FCC) lattice. An FCC lattice has important advantages over a Cartesian lattice. It has higher packing density in the frequency domain, which translates to better sampling efficiency. Furthermore, it has the maximal possible kissing number (equivalent to the number of nearest neighbors […]

Feb, 4

QUDA programming for staggered quarks

We have been extending the QUDA GPU code developed at Boston University to include the case of improved staggered quarks. Improved staggered quarks such as asqtad and HISQ require both first and third nearest neighbor terms in the Dirac operator. We call the corresponding links fatlinks and longlinks. The fatlinks are not unitary, and staggered […]

CUDA

Feb, 4

Phoenix: A Runtime Environment for High Performance Computing on Chip Multiprocessors

Execution of applications on upcoming high-performance computing (HPC) systems introduces a variety of new challenges and amplifies many existing ones. These systems will be composed of a large number of ldquofatrdquo nodes, where each node consists of multiple processors on a chip with symmetric multithreading capabilities, interconnected via high-performance networks. Traditional system software for parallel […]

Feb, 4

QP: A Heterogeneous Multi-Accelerator Cluster

We present a heterogeneous multi-accelerator cluster developed and deployed at NCSA. The cluster consists of 16 AMD dual-core CPU compute nodes each with four NVIDIA GPUs and one Xilinx FPGA. Cluster nodes are interconnected with both InfiniBand and Ethernet networks. The software stack consists of standard cluster tools with the addition of accelerator-specific software packages […]

CUDA

Feb, 3

On testing GPU memory for hard and soft errors

NVIDIA GPUs are becoming increasingly popular in scientific computation as a way to accelerate the execution of computationally demanding codes. The graphics memory used in GPUs is not protected against soft errors that may be caused by cosmic radiation and thus is a source of concern for the scientific computing community. In this short paper […]

CUDA

Feb, 3

Quantifying the Impact of GPUs on Performance and Energy Efficiency in HPC Clusters

We present an inexpensive hardware system for monitoring power usage of individual CPU hosts and externally attached GPUs in HPC clusters and the software stack for integrating the power usage data streamed in real-time by the power monitoring hardware with the cluster management software tools. We introduce a measure for quantifying the overall improvement in […]

CUDA

Feb, 3

MILC on GPUs

The MIMD Lattice Computation (MILC) code, a Quantum Chromodynamics (QCD) application used to simulate four-dimensional SU(3) lattice gauge theory, is one of the largest compute cycle users at many supercomputing centers. Previously we have investigated how one of MILC applications can be accelerated on the Cell Broadband Engine. We currently investigate how this code can […]

CUDA

Feb, 3

3I: A tool for visualizing and processing in parallel 2D & 3D images

We present a tool for intensive processing of digital images based on graphics processing units (GPUs) and multi-core CPU. The tool incorporates innovative filters for the denoising and estimation of missing information in three-dimensional digital images. Both processes are integrated into a pipeline that repeatedly evaluates the image until a given convergence. Finally, 3D images […]

CUDA