high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia ION

A Low-Power Hybrid CPU-GPU Sort

Lawrence Tan

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia ION, Sorting, Thesis

April 6, 2014 by hgpu

Accelerating Dynamic Binary Translation with GPUs

Chung Hwan Kim, Srikanth Manikarnike, Vaibhav Sharma, Eric Eide, Robert Ricci

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia ION, Programming techniques

February 28, 2013 by hgpu

An Energy-Efficient Heterogeneous System for Embedded Learning and Classification

Abhinandan Majumdar, Srihari Cadambi, Srimat T. Chakradhar

View

Download (PDF)

Tags: Computer science, Energy-efficient computing, FPGA, Heterogeneous systems, nVidia, nVidia ION, Tesla C1070

January 20, 2012 by hgpu

Design space exploration towards a realtime and energy-aware GPGPU-based analysis of biosensor data

Constantin Timm, Frank Weichert, Peter Marwedel, Heinrich Muller

View

Download (PDF)

Tags: Computer science, Design space exploration, Energy-efficient computing, Image processing, nVidia, nVidia GeForce 9600 GT, nVidia GeForce GTS 250, nVidia ION, OpenCL, Sensing

September 28, 2011 by hgpu

Parallel graduated assignment algorithm for multiple graph matching based on a common labelling

David Rodenas, Francesc Serratosa, Albert Sole-Ribalta

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, Graph theory, nVidia, nVidia ION, Presentation

September 21, 2011 by hgpu

Evolution of thread-level parallelism in desktop applications

Geoffrey Blake, Ronald G. Dreslinski, Trevor Mudge, Krisztian Flautner

View

Download (PDF)

Tags: Benchmarking, Computer science, Measurement techniques, nVidia, nVidia GeForce GT 120, nVidia GeForce GTX 285, nVidia ION

November 7, 2010 by hgpu

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

A Low-Power Hybrid CPU-GPU Sort

Accelerating Dynamic Binary Translation with GPUs

An Energy-Efficient Heterogeneous System for Embedded Learning and Classification

Design space exploration towards a realtime and energy-aware GPGPU-based analysis of biosensor data

Parallel graduated assignment algorithm for multiple graph matching based on a common labelling

Evolution of thread-level parallelism in desktop applications

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)