high performance computing on graphics processing units: hgpu.org

hgpu.org » Operating systems

Glider: A GPU Library Driver for Improved System Security

Ardalan Amiri Sani, Lin Zhong, Dan S. Wallach

View

Download (PDF)

Tags: ATI, ATI Radeon HD 6450, Computer science, OpenCL, OpenGL, Operating systems, Security

November 18, 2014 by hgpu

GPUvm: Why Not Virtualizing GPUs at the Hypervisor?

Yusuke Suzuki, Shinpei Kato, Hiroshi Yamada, Kenji Kono

View

Download (PDF)

Tags: Cloud, Computer science, CUDA, nVidia, nVidia Quadro 6000, Operating systems, Virtualization

July 4, 2014 by hgpu

Preemptive Thread Block Scheduling with Online Structural Runtime Prediction for Concurrent GPGPU Kernels

Sreepathi Pai, R. Govindarajan, Matthew J. Thazhuthaveetil

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, Operating systems, Performance, Tesla K20

June 25, 2014 by hgpu

Heterogeneity-aware Fault Tolerance using a Self-Organizing Runtime System

Mario Kicherer, Wolfgang Karl

View

Download (PDF)

Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 275, nVidia GeForce GTX 560 Ti, Operating systems

May 17, 2014 by hgpu

Transparent Checkpoint-Restart for Hardware-Accelerated 3D Graphics

Samaneh Kazemi, Rohan Garg, Gene Cooperman

View

Download (PDF)

Tags: Computer science, nVidia, nVidia GeForce GT 650 M, OpenGL, Operating systems

December 24, 2013 by hgpu

Task scheduling in hybrid CPU-GPU systems

Martin Krulis, Zbynek Falt, David Bednarek, Jakub Yaghob

View

Download (PDF)

Tags: Computer science, Heterogeneous systems, nVidia, nVidia GeForce GTX 580, OpenCL, Operating systems, Task scheduling, Tesla M2090

June 2, 2013 by hgpu

GPUfs: Integrating a File System with GPUs

Mark Silberstein, Bryan Ford, Idit Keidar, Emmett Witchel

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, Operating systems, Tesla C2075

January 26, 2013 by hgpu

Implementing Open-Source CUDA Runtime

Shinpei Kato

View

Download (PDF)

Source codes

Tags: Computer science, CUDA, nVidia, Operating systems, Package

January 23, 2013 by hgpu

Intrusion Detection Architecture Utilizing Graphics Processors

Liberios Vokorokos, Anton Balaz, Branislav Mados

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, nVidia GeForce GTX 260, Operating systems, Security

January 12, 2013 by hgpu

CPUless PCs inside networked control systems

Peter Fodrek, Tomas Murgas, Michal Blaho

View

Download (PDF)

Tags: Algorithms, Computer science, nVidia, OpenCL, Operating systems

December 1, 2012 by hgpu

A Simulation Framework for Scheduling Performance Evaluation on CPU-GPU Heterogeneous System

Flavio Vella, Igor Neri, Osvaldo Gervasi, Sergio Tasso

View

Download (PDF)

Tags: Computer science, Heterogeneous systems, nVidia, OpenCL, Operating systems

September 17, 2012 by hgpu

SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters

Jungwon Kim, Sangmin Seo, Jun Lee, Jeongho Nah, Gangwon Jo, Jaejin Lee

View

Download (PDF)

Source codes

Tags: Code generation, Computer science, GPU cluster, Heterogeneous systems, MPI, nVidia, nVidia GeForce GTX 480, OpenCL, Operating systems, Package, Programming Languages, Programming techniques

July 26, 2012 by hgpu

CUDAnalyst (CUDA + Analyst)

Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation

CodegenBench

CodegenBench: Can LLMs Write Efficient Code Across Architectures?

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CuTile Benchmark Suite: Performance and Productivity Tradeoffs for GPU Kernel Programming on Blackwell Architecture

Evaluating CUDA Tile for AI Workloads on Hopper and Blackwell GPUs

Agentic Code Optimization via Compiler-LLM Cooperation

Device Virtual Machine (DVM)

DVM: Real-Time Kernel Generation for Dynamic AI Models

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

Glider: A GPU Library Driver for Improved System Security

Preemptive Thread Block Scheduling with Online Structural Runtime Prediction for Concurrent GPGPU Kernels

Heterogeneity-aware Fault Tolerance using a Self-Organizing Runtime System

Transparent Checkpoint-Restart for Hardware-Accelerated 3D Graphics

Task scheduling in hybrid CPU-GPU systems

GPUfs: Integrating a File System with GPUs

Implementing Open-Source CUDA Runtime

Intrusion Detection Architecture Utilizing Graphics Processors

CPUless PCs inside networked control systems

A Simulation Framework for Scheduling Performance Evaluation on CPU-GPU Heterogeneous System

SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters

Recent source codes

CUDAnalyst (CUDA + Analyst)

CodegenBench

KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels

CUDA Kernel Fusion Benchmarks

IntelliKit: Agent-first tooling for AMD hardware

DITRON: Distributed Compiler based on Triton for Parallel Systems

CuTile Benchmark Suite: Performance and Productivity Tradeoffs for GPU Kernel Programming on Blackwell Architecture

Agentic Code Optimization via Compiler-LLM Cooperation

Device Virtual Machine (DVM)

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

Most viewed papers (last 30 days)