hgpu.org » Operating systems
Shinpei Kato, Karthik Lakshmanan, Ragunathan Rajkumar, Yutaka Ishikawa
Tags: Benchmarking, Computer science, nVidia, nVidia GeForce 9800 GT, nVidia GeForce GTX 285, nVidia GeForce GTX 480, OpenGL, Operating systems, Package, Real-time graphics, Task scheduling
September 11, 2011 by hgpu
Alexander Schmidt, Andreas Polze
September 9, 2011 by hgpu
Vishakha Gupta, Rob Knauerhase, Karsten Schwan
Tags: Cloud, Computer science, Heterogeneous systems, Operating systems, Performance, Virtualization
September 9, 2011 by hgpu
Vishakha Gupta, Karsten Schwan, Niraj Tolia, Vanish Talwar, Parthasarathy Ranganathan
Tags: Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce 9800 GTX, Operating systems, Task scheduling, Virtualization
September 7, 2011 by hgpu
Christopher J. Rossbach, Jon Currey, Emmett Witchel
September 7, 2011 by hgpu
Flavio Vella, Riccardo M. Cefal, Alessandro Costantini, Osvaldo Gervasi, Claudio Tanci
Tags: Cloud, Computer science, Grid, Operating systems, Virtualization
August 8, 2011 by hgpu
Shinpei Kato, Karthik Lakshmanan, Yutaka Ishikawa, Ragunathan (Raj) Rajkumar
June 22, 2011 by hgpu
Dongkyun Jeong, Kamalneet Singh, Namin Kim, Soochan Lim
May 16, 2011 by hgpu
Andrew Baumann, Paul Barham, Pierre E. Dagand, Tim Harris, Rebecca Isaacs, Simon Peter, Timothy Roscoe, Adrian Schüpbach, Akhilesh Singhania
November 27, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Performance Portable Gradient Computations Using Source Transformation
- ConTraPh: Contrastive Learning for Parallelization and Performance Optimization
- Block: Balancing Load in LLM Serving with Context, Knowledge and Predictive Scheduling
- Understanding the Landscape of Ampere GPU Memory Errors
- Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks
- SIGMo: High-Throughput Batched Subgraph Isomorphism on GPUs for Molecular Matching
- GBOTuner: Autotuning of OpenMP Parallel Codes with Bayesian Optimization and Code Representation Transfer Learning
- DGEMM without FP64 Arithmetic - using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme
- Luthier: Bridging Auto-Tuning and Vendor Libraries for Efficient Deep Learning Inference
- OpenDwarfs 2025: Modernizing the OpenDwarfs Benchmark Suite for Heterogeneous Computing
* * *