high performance computing on graphics processing units: hgpu.org

hgpu.org » RapidMind

Generating Device-specific GPU code for Local Operators in Medical Imaging

Richard Membarth, Frank Hannig, Jurgen Teich, Mario Korner, Wieland Eckert

View

Tags: ATI, ATI Radeon HD 5870, ATI Radeon HD 6970, Code generation, CUDA, Image processing, Medicine, nVidia, nVidia Quadro FX 5800, OpenCL, RapidMind, Tesla C2050

June 1, 2012 by hgpu

Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration

Richard Membarth, Frank Hannig, Jurgen Teich, Mario Korner, Wieland Eckert

View

Download (PDF)

Tags: ATI, Image registration, Medicine, OpenCL, OpenMP, RapidMind

August 19, 2011 by hgpu

Frameworks for GPU Accelerators: A comprehensive evaluation using 2D/3D image registration

Richard Membarth, Frank Hannig, Jurgen Teich, Mario Korner, Wieland Eckert

View

Download (PDF)

Tags: CUDA, Image processing, Image registration, nVidia, nVidia Quadro FX 5800, OpenCL, Performance, RapidMind, Tesla C2050

July 9, 2011 by hgpu

A Light-weight API for Portable Multicore Programming

Christopher G. Baker, Michael A. Heroux, H. Carter Edwards, Alan B. Williams

View

Download (PDF)

Tags: Computer science, CUDA, Linear Algebra, nVidia, nVidia GeForce GTX 280, OpenCL, RapidMind

April 3, 2011 by hgpu

Iterative GPGPU Linear Solvers for Sparse Matrices

Filip Vesely

View

Download (PDF)

Tags: Brook, Computer science, nVidia, nVidia GeForce 8600 M GS, RapidMind, Sparse linear iterative solvers, Thesis

February 24, 2011 by hgpu

PRNG Random Numbers on GPU

W. B. Langdon

View

Download (PDF)

Source codes

Tags: Computer science, CUDA, nVidia, nVidia GeForce 8800 GTX, Package, Pseudo-random number generators, RapidMind

February 15, 2011 by hgpu

A Fast High Quality Pseudo Random Number Generator for Graphics Processing Units

W. B. Langdon

View

Download (PDF)

Source codes

Tags: Computer science, CUDA, nVidia, nVidia GeForce 8800 GTX, Package, Pseudo-random number generators, RapidMind

February 15, 2011 by hgpu

Data parallel acceleration of decision support queries using Cell/BE and GPUs

Pedro Trancoso, Despo Othonos, Artemakis Artemiou

View

Download (PDF)

Tags: Algorithms, Cell processor, Computer science, DirectX, nVidia, nVidia GeForce 8500 GT, nVidia GeForce 8800 GTS, Performance, RapidMind

February 7, 2011 by hgpu

Financial modeling on the cell broadband engine

Virat Agarwal, Lurng-Kuo Liu, David A. Bader

View

Download (PDF)

Tags: Cell processor, CUDA, Finance, Monte Carlo simulation, nVidia, nVidia GeForce 8800, RapidMind

February 6, 2011 by hgpu

Supercomputing with toys: harnessing the power of NVIDIA 8800GTX and playstation 3 for bioinformatics problem

Justin Wilson, Manhong Dai, Elvis Jakupovic, Stanley Watson, Fan Meng

View

Download (PDF)

Tags: Bioinformatics, Cell processor, CUDA, nVidia, nVidia GeForce 8800 GTX, Performance, Playstation, RapidMind

January 26, 2011 by hgpu

GP on SPMD parallel graphics hardware for mega Bioinformatics data mining

William B. Langdon, Andrew P. Harrison

View

Download (PDF)

Source codes

Tags: Computer science, Genetic programming, nVidia, nVidia GeForce 8800 GTX, OpenGL, Package, RapidMind

January 21, 2011 by hgpu

Evolving GeneChip correlation predictors on parallel graphics hardware

William B. Langdon

View

Download (PDF)

Source codes

Tags: Bioinformatics, Computer science, Genetic programming, nVidia, nVidia GeForce 8800 GTX, OpenGL, Package, RapidMind

January 21, 2011 by hgpu

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Generating Device-specific GPU code for Local Operators in Medical Imaging

Frameworks for multi-core architectures: a comprehensive evaluation using 2D/3D image registration

Frameworks for GPU Accelerators: A comprehensive evaluation using 2D/3D image registration

A Light-weight API for Portable Multicore Programming

Iterative GPGPU Linear Solvers for Sparse Matrices

PRNG Random Numbers on GPU

A Fast High Quality Pseudo Random Number Generator for Graphics Processing Units

Data parallel acceleration of decision support queries using Cell/BE and GPUs

Financial modeling on the cell broadband engine

Supercomputing with toys: harnessing the power of NVIDIA 8800GTX and playstation 3 for bioinformatics problem

GP on SPMD parallel graphics hardware for mega Bioinformatics data mining

Evolving GeneChip correlation predictors on parallel graphics hardware

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)