high performance computing on graphics processing units: hgpu.org

hgpu.org » Performance prediction

Predicting GPUDirect Benefits for HPC Workloads

Harsh Khetawat, Nikhil Jain, Abhinav Bhatele, Frank Mueller

View

Download (PDF)

Tags: Computer science, HPC, Performance prediction

March 18, 2024 by hgpu

A Practical Performance Model for Compute and Memory Bound GPU Kernels

Elias Konstantinidis, Yiannis Cotronis

View

Download (PDF)

Source codes

Tags: Benchmarking, CUDA, OpenCL, Performance prediction

May 23, 2016 by ekondis

Performance models for CPU-GPU data transfers

B. van Werkhoven, J. Maassen, F.J. Seinstra, H.E. Bal

View

Download (PDF)

Tags: Computer science, CUDA, nVidia GeForce GTX 680, nVidia GeForce GTX Titan, PCIe, Performance, Performance prediction, Tesla K20

June 5, 2014 by bennotsi

A Performance Analysis Framework for Identifying Potential Benefits in GPGPU Applications

Jaewoong Sim, Aniruddha Dasgupta, Hyesoon Kim, and Richard Vuduc

View

Download (PDF)

Tags: Analytical model, CUDA, GPGPU architecture, nVidia, Performance benefit prediction, Performance prediction, Tesla C2050

March 30, 2012 by Moaddeli

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Predicting GPUDirect Benefits for HPC Workloads

A Practical Performance Model for Compute and Memory Bound GPU Kernels

Performance models for CPU-GPU data transfers

A Performance Analysis Framework for Identifying Potential Benefits in GPGPU Applications

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)