high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia Quadro FX 570

Exploring Heterogeneous Scheduling using the Task-Centric Programming Model

Artur Podobas, Mats Brorsson, Vladimir Vlassov

View

Download (PDF)

Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia Quadro FX 570, Task scheduling

September 13, 2012 by hgpu

GPU-Based Research of Highly Efficient Ray Tracing

He Jin, Fang Zhiyi, Ji Liang, Cai Ruicheng, Chen Lin

View

Download (PDF)

Tags: Computer science, DirectX, GLSL, HLSL, nVidia, nVidia Quadro FX 570, Raytracing, Rendering

December 31, 2011 by hgpu

Online video synthesis for removing occluding objects using multiple uncalibrated cameras via plane sweep algorithm

Takahide Hosokawa, Songkran Jarusirisawad, Hideo Saito

View

Download (PDF)

Tags: Algorithms, Computer science, nVidia, nVidia Quadro FX 570, OpenGL, Rendering

July 30, 2011 by hgpu

MITHRA: Multiple data independent tasks on a heterogeneous resource architecture

Reza Farivar, Abhishek Verma, Ellick M. Chan, Roy H. Campbell

View

Download (PDF)

Tags: Computer science, CUDA, GPU cluster, Heterogeneous systems, MapReduce, nVidia, nVidia GeForce 9800 GX2, nVidia Quadro FX 570

April 22, 2011 by hgpu

Tuned and asynchronous stencil kernels for CPU/GPU systems (thesis)

Sundaresan Venkatasubramanian

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia Quadro FX 570, Performance, Tesla C1060, Tesla C870, Thesis

March 6, 2011 by hgpu

Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems

Sundaresan Venkatasubramanian, Richard W. Vuduc, None None

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia Quadro FX 570, Performance, Tesla C1060, Tesla C870

November 22, 2010 by hgpu

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Exploring Heterogeneous Scheduling using the Task-Centric Programming Model

GPU-Based Research of Highly Efficient Ray Tracing

Online video synthesis for removing occluding objects using multiple uncalibrated cameras via plane sweep algorithm

MITHRA: Multiple data independent tasks on a heterogeneous resource architecture

Tuned and asynchronous stencil kernels for CPU/GPU systems (thesis)

Tuned and wildly asynchronous stencil kernels for hybrid CPU/GPU systems

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)