high performance computing on graphics processing units: hgpu.org

hgpu.org » ATI Radeon HD 6950

Contributions to the Efficient Use of General Purpose Coprocessors: Kernel Density Estimation as Case Study

Unai Lopez Novoa

View

Tags: Algorithms, ATI, ATI Radeon HD 6950, Computational Complexity, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, nVidia GeForce GTX 650, OpenCL, Thesis

September 8, 2015 by hgpu

Enabling CP2K Application for Exascale Computing with Accelerators using OpenACC and OpenCL

Mariusz Uchronski, Agnieszka Kwiecien, Marcin Gebarowski

View

Download (PDF)

Tags: ATI, ATI Radeon HD 6950, Chemistry, CUDA, Matrix multiplication, Molecular simulation, MPI, nVidia, OpenACC, OpenCL, Sparse matrix, Tesla M2075

May 16, 2014 by hgpu

High-Performance GPGPU Programming with OCaml

Mathias Bourgoin, Emmanuel Chailloux, Jean-Luc Lamotte

View

Download (PDF)

Tags: ATI, ATI Radeon HD 6950, Computer science, CUDA, High-level Languages, nVidia, nVidia GeForce GTX 680, OpenCL, Tesla C2070

October 15, 2013 by hgpu

SPOC: GPGPU Programming Through Stream Processing With OCaml

Mathias Bourgoin, Emmanuel Chailloux, Jean-Luc Lamotte

View

Download (PDF)

Source codes

Tags: ATI, ATI Radeon HD 6950, Computer science, CUDA, nVidia, OpenCL, Package, Tesla C2070

May 19, 2012 by hgpu

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Contributions to the Efficient Use of General Purpose Coprocessors: Kernel Density Estimation as Case Study

Enabling CP2K Application for Exascale Computing with Accelerators using OpenACC and OpenCL

High-Performance GPGPU Programming with OCaml

SPOC: GPGPU Programming Through Stream Processing With OCaml

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)