high performance computing on graphics processing units: hgpu.org

hgpu.org » nVidia Quadro FX 2700 M

Use of CUDA for the Continuous Space Language Model

Elizabeth A. Thompson, Timothy Anderson

View

Download (PDF)

Tags: CUBLAS, CUDA, nVidia, nVidia Quadro FX 2700 M, Signal processing

November 16, 2012 by hgpu

Building Source-to-Source Compilers for Heterogeneous Targets

Serge Guelton

View

Download (PDF)

Tags: Code generation, Computer science, CUDA, FPGA, Heterogeneous systems, nVidia, nVidia Quadro FX 2700 M, Thesis

January 4, 2012 by hgpu

Compilation for Heterogeneous Computing: Automating Analyses, Transformations and Decisions

Serge Guelton, Francois Irigoin, Ronan Keryell

View

Download (PDF)

Source codes

Tags: Code generation, Computer science, CUDA, FPGA, Heterogeneous systems, nVidia, nVidia Quadro FX 2700 M, Package, Signal processing

November 17, 2011 by hgpu

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

SimSYCL: A SYCL Implementation Targeting Development, Debugging, Simulation and Conformance

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Use of CUDA for the Continuous Space Language Model

Building Source-to-Source Compilers for Heterogeneous Targets

Compilation for Heterogeneous Computing: Automating Analyses, Transformations and Decisions

Recent source codes

SimSYCL: Synchronous, single-threaded, library-only SYCL implementation for debugging and verification

GPU plugin for PySCF

QArray

Celerity: High-level C++ for Accelerator Clusters

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Most viewed papers (last 30 days)