high performance computing on graphics processing units: hgpu.org

Ashwin M. Aji, Lokendra S. Panwar, Feng Ji, Milind Chabbi, Karthik Murthy, Pavan Balaji, Keith R. Bisset, James Dinan, Wu-chun Feng, John Mellor-Crummey, Xiaosong May, Rajeev Thakur

View

Download (PDF)

Tags: Computer science, CUDA, GPU cluster, Heterogeneous systems, MPI, nVidia, Optimization, Seismology, Tesla M2050

April 22, 2013 by hgpu

High-dimensional wave atoms and compression of seismic datasets

Matti Leinonen,Russell J. Hewett, Xiangxiong Zhang, Lexing Ying, Laurent Demanet

View

Download (PDF)

Tags: Compression, CUDA, nVidia, Seismology, Tesla C2075

April 16, 2013 by hgpu

GPGPU-Aided 3D Staggered-grid Finite-difference Seismic Wave Modeling

Chang Cai, Haiqing Chen, Ze Deng, Dan Chen, Samee U. Khan, Ke Zeng, Minxiao Wu

View

Download (PDF)

Tags: CUDA, Earth and Space Sciences, Finite difference, nVidia, nVidia GeForce GTX 680, Seismic modeling, Seismology

December 8, 2012 by hgpu

Seismic damage simulation for urban buildings based on high-performance GPU computing

Bo Han, Xinzheng Lu, Zhen Xu, Yi Li

View

Download (PDF)

Tags: CUDA, Earth and Space Sciences, nVidia, nVidia GeForce GTX 460, Seismic modeling, Seismology

September 17, 2012 by hgpu

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Fast GPU-Based Seismogram Simulation from Microseismic Events in Marine Environments Using Heterogeneous Velocity Models

A Data Parallel Algorithm for Seismic Raytracing

Automatic Detection and Denoising of Signals in Large Geophysical Datasets

GPU performance analysis of a nodal discontinuous Galerkin method for acoustic and elastic models

Automatic Command Queue Scheduling for Task-Parallel Workloads in OpenCL

Performance Modeling, Optimization, and Characterization on Heterogeneous Architectures

Towards Porting a Real-World Seismological Application to the Intel MIC Architecture

Heterogeneous FTDT for Seismic Processing

On the Efficacy of GPU-Integrated MPI for Scientific Applications

High-dimensional wave atoms and compression of seismic datasets

GPGPU-Aided 3D Staggered-grid Finite-difference Seismic Wave Modeling

Seismic damage simulation for urban buildings based on high-performance GPU computing

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)