hgpu.org » nVidia V100
René Caspart, Sebastian Ziegler, Arvid Weyrauch, Holger Obermaier, Simon Raffeiner, Leon Pascal Schuhmacher, Jan Scholtyssek, Darya Trofimova, Marco Nolden, Ines Reinartz, Fabian Isensee, Markus Götz, Charlotte Debus
Tags: Artificial intelligence, Computer science, Deep learning, Energy-efficient computing, Heterogeneous systems, nVidia, nVidia A100, nVidia V100, Package
December 11, 2022 by hgpu
Connor Kenyon, Collin Capano
Tags: Benchmarking, Computer science, Heterogeneous systems, nVidia, nVidia A100, nVidia V100, OpenCL, Performance
November 6, 2022 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Performance Portable Gradient Computations Using Source Transformation
- ConTraPh: Contrastive Learning for Parallelization and Performance Optimization
- Block: Balancing Load in LLM Serving with Context, Knowledge and Predictive Scheduling
- Understanding the Landscape of Ampere GPU Memory Errors
- Geak: Introducing Triton Kernel AI Agent & Evaluation Benchmarks
- SIGMo: High-Throughput Batched Subgraph Isomorphism on GPUs for Molecular Matching
- GBOTuner: Autotuning of OpenMP Parallel Codes with Bayesian Optimization and Code Representation Transfer Learning
- DGEMM without FP64 Arithmetic - using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme
- Luthier: Bridging Auto-Tuning and Vendor Libraries for Efficient Deep Learning Inference
- OpenDwarfs 2025: Modernizing the OpenDwarfs Benchmark Suite for Heterogeneous Computing
* * *