high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia GeForce 8800

Multifrontal Sparse Matrix Factorization on Graphics Processing Units

Robert F. Lucas, Gene Wagenbreth, John J. Tran, Dan M. Davis

View

Download (PDF)

Tags: Computer science, CUDA, Factorization, nVidia, nVidia GeForce 8800, Sparse matrix

January 25, 2012 by hgpu

Computation of electron quantum transport in graphene nanoribbons using GPU

S. Ihnatsenka

View

Download (PDF)

Tags: Computational Physics, CUDA, Mesoscale and Nanoscale Physics, nVidia, nVidia GeForce 8800, Physics, Tesla C1060

July 27, 2011 by hgpu

Fast, parallel, GPU-based construction of space filling curves and octrees

Prekshu Ajmera, Rhushabh Goradia, Sharat Chandran, Srinivas Aluru

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce 8800, Visualization

June 21, 2011 by hgpu

Digital beamforming using a GPU

Carl-Inge Colombo Nilsen, Ines Hafizovic

View

Download (PDF)

Tags: CUDA, nVidia, nVidia GeForce 8800, Signal processing

March 24, 2011 by hgpu

STOCHSIMGPU: Parallel stochastic simulation for the Systems Biology Toolbox 2 for MATLAB

Guido Klingbeil, Radek Erban, Mike Giles, Philip K. Maini

View

Download (PDF)

Source codes

Tags: Bioinformatics, Biology, CUDA, nVidia, nVidia GeForce 8800, Package, Stochastic simulation

February 28, 2011 by hgpu

Financial modeling on the cell broadband engine

Virat Agarwal, Lurng-Kuo Liu, David A. Bader

View

Download (PDF)

Tags: Cell processor, CUDA, Finance, Monte Carlo simulation, nVidia, nVidia GeForce 8800, RapidMind

February 6, 2011 by hgpu

Adaptive enhancement and noise reduction in very low light-level video

Henrik Malm, Magnus Oskarsson, Eric Warrant, Petrik Clarberg, Jon Hasselgren, Calle Lejdfors

View

Download (PDF)

Tags: Algorithms, Filtering, Image processing, nVidia, nVidia GeForce 8800

February 2, 2011 by hgpu

Efficient gather and scatter operations on graphics processors

Bingsheng He, Naga K. Govindaraju, Qiong Luo, Burton Smith

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce 8800, Search, Sorting, Sparse matrix

January 6, 2011 by hgpu

Parallel Position Weight Matrices Algorithms

Mathieu Giraud, Jean-Stephane Varre

View

Download (PDF)

Source codes

Tags: Biology, Computer science, CUDA, nVidia, nVidia GeForce 8800, nVidia GeForce GTX 280, Package

November 22, 2010 by hgpu

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Multifrontal Sparse Matrix Factorization on Graphics Processing Units

Computation of electron quantum transport in graphene nanoribbons using GPU

Fast, parallel, GPU-based construction of space filling curves and octrees

Digital beamforming using a GPU

STOCHSIMGPU: Parallel stochastic simulation for the Systems Biology Toolbox 2 for MATLAB

Financial modeling on the cell broadband engine

Adaptive enhancement and noise reduction in very low light-level video

Efficient gather and scatter operations on graphics processors

Parallel Position Weight Matrices Algorithms

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)