high performance computing on graphics processing units: hgpu.org

hgpu.org » Operating systems

A shared file system abstraction for heterogeneous architectures

Mark Silberstein, Idit Keidar

View

Download (PDF)

Tags: Computer science, Heterogeneous systems, Operating systems

November 2, 2011 by hgpu

The MOSIX Virtual OpenCL (VCL) Cluster Platform

Amnon Barak, Amnon Shiloh

View

Download (PDF)

Source codes

Tags: APU, Computer science, GPU cluster, Heterogeneous systems, MPI, nVidia, nVidia GeForce GTX 480, OpenCL, Operating systems, Package

October 24, 2011 by hgpu

Efficient Synchronization Primitives for GPUs

Jeff A. Stuart, John D. Owens

View

Download (PDF)

Tags: Algorithms, Benchmarking, Computer science, CUDA, Data Structures and Algorithms, nVidia, nVidia GeForce GTX 295, nVidia GeForce GTX 580, Operating systems

October 21, 2011 by hgpu

Operating Systems Challenges for GPU Resource Management

Shinpei Kato, Scott Brandt, Yutaka Ishikawa, Ragunathan (Raj) Rajkumar

View

Download (PDF)

Tags: Computer science, CUDA, GPU cluster, nVidia, OpenCL, Operating systems, Virtualization

October 15, 2011 by hgpu

PTask: Operating System Abstractions To Manage GPUs as Compute Devices

Christopher J. Rossbach, Jon Currey, Mark Silberstein, Baishakhi Ray, Emmett Witchel

View

Download (PDF)

Tags: Computer science, CUDA, HLSL, nVidia, nVidia GeForce GT 230, nVidia GeForce GTX 470, nVidia GeForce GTX 580, OpenCL, Operating systems, Performance, Programming techniques

October 2, 2011 by hgpu

Real-Time Handling of GPU Interrupts in LITMUSRT

Glenn A. Elliott, Chih-Hao Sun, and James H. Anderson

View

Download (PDF)

Tags: Algorithms, Computer science, CUDA, nVidia, nVidia GeForce GTX 470, Operating systems

September 30, 2011 by hgpu

Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework

Vignesh T. Ravi, Michela Becchi, Gagan Agrawal, Srimat Chakradhar

View

Download (PDF)

Tags: Algorithms, Cloud, Computer science, CUDA, nVidia, Operating systems, Performance, Tesla C2050, Virtualization

September 20, 2011 by hgpu

Performing with CUDA

William B. Langdon

View

Download (PDF)

Tags: Computer science, CUDA, nVidia, Operating systems, Performance, Review, Software Engineering, Tutorial

September 15, 2011 by hgpu

The case for VOS: the vector operating system

Vijay Vasudevan, David G. Andersen, Michael Kaminsky

View

Download (PDF)

Tags: Computer science, Operating systems

September 14, 2011 by hgpu

Optimizing a shared virtual memory system for a heterogeneous CPU-accelerator platform

Shoumeng Yan, Xiaocheng Zhou, Ying Gao, Hu Chen, Gansha Wu, Sai Luo, Bratin Saha

Tags: Computer science, Heterogeneous systems, Memory, Operating systems, Performance, Programming Languages

September 12, 2011 by hgpu

TimeGraph: GPU scheduling for real-time multi-tasking environments

Shinpei Kato, Karthik Lakshmanan, Ragunathan Rajkumar, Yutaka Ishikawa

View

Download (PDF)

Source codes

Tags: Benchmarking, Computer science, nVidia, nVidia GeForce 9800 GT, nVidia GeForce GTX 285, nVidia GeForce GTX 480, OpenGL, Operating systems, Package, Real-time graphics, Task scheduling

September 11, 2011 by hgpu

KAdvice: infering synchronization patterns from an existing codebase

Alexander Schmidt, Andreas Polze

Tags: Computer science, Operating systems

September 9, 2011 by hgpu

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

gpu_tracker: Python package for tracking and profiling GPU utilization in both desktop and high-performance computing environments

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers

OpenMC Monte Carlo Code

Performance Portable Monte Carlo Particle Transport on Intel, NVIDIA, and AMD GPUs

Polygeist: C/C++ frontend for MLIR

Retargeting and Respecializing GPU Workloads for Performance Portability

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

SYCL in the edge: performance and energy evaluation for heterogeneous acceleration

OpenMP5-Offload-OpenMC-Intel-PVC

Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators

See all packages

* * *

high performance computing on graphics processing units: hgpu.org

A shared file system abstraction for heterogeneous architectures

The MOSIX Virtual OpenCL (VCL) Cluster Platform

Efficient Synchronization Primitives for GPUs

Operating Systems Challenges for GPU Resource Management

PTask: Operating System Abstractions To Manage GPUs as Compute Devices

Real-Time Handling of GPU Interrupts in LITMUSRT

Performing with CUDA

The case for VOS: the vector operating system

Optimizing a shared virtual memory system for a heterogeneous CPU-accelerator platform

TimeGraph: GPU scheduling for real-time multi-tasking environments

KAdvice: infering synchronization patterns from an existing codebase

Recent source codes

QArray

Celerity: High-level C++ for Accelerator Clusters

CIFAR-10 Airbench: 94% on CIFAR-10 in 3.29 second

gpu_tracker: Context manager and CLI that tracks the computational-resource-usage of a code block or shell command, particularly the GPU usage

LOOPer: a polyhedral compiler for expressing fast and portable data parallel algorithms

OpenMC Monte Carlo Code

Polygeist: C/C++ frontend for MLIR

Parallel Gaussian process with kernel approximation in CUDA

Optical flow algorithms for SYCL

OpenMP5-Offload-OpenMC-Intel-PVC

Most viewed papers (last 30 days)