high performance computing on graphics processing units: hgpu.org

Trevor Bekolay, James Bergstra, Eric Hunsberger, Travis DeWolf, Terrence C. Stewart, Daniel Rasmussen, Xuan Choo, Aaron Russell Voelker, Chris Eliasmith

View

Download (PDF)

Source codes

Tags: ATI, ATI Radeon HD 7970, Biology, Neuroscience, nVidia, nVidia GeForce GTX 280, OpenCL, Package, PyOpenCL, Python

January 9, 2014 by hgpu

Computing finite models using free Boolean generators

Zarko Mijajlovic, Aleksandar Pejovic

View

Download (PDF)

Source codes

Tags: Computer science, Logic in Computer Science, OpenCL, Package, PyOpenCL

October 29, 2013 by hgpu

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Benchmarking optimization algorithms for auto-tuning GPU kernels

Designing a high-performance boundary element library with OpenCL and Numba

linus: Conveniently explore, share, and present large-scale biological trajectory data from a web browser

PyMatting: A Python Library for Alpha Matting

APL on GPUs: A TAIL from the Past, Scribbled in Futhark

OpenCL-accelerated object classification in video streams using Spatial Pooler of Hierarchical Temporal Memory

GPU computing with OpenCL to model 2D elastic wave propagation: exploring memory usage

A Parallel Implementation of the Galerkin Method for Solving Partial Differential Equations on a Triangular Mesh

PyFAI: a Python library for high performance azimuthal integration on GPU

Loo.py: transformation-based code generation for GPUs and CPUs

Nengo: a Python tool for building large-scale functional brain models

Computing finite models using free Boolean generators

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second