hgpu.org » nVidia H800
Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Qiang Wang, Xiaowen Chu
Tags: Artificial intelligence, Benchmarking, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia H800, Performance, PTX
February 25, 2024 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Portability of Fortran's 'do concurrent' on GPUs
- Double-Precision Floating-Point Data Visualizations Using Vulkan API
- Abstractions for C++ code optimizations in parallel high-performance applications
- Characterizing CUDA and OpenMP Synchronization Primitives
- In-Situ Techniques on GPU-Accelerated Data-Intensive Applications
- HiCCL: A Hierarchical Collective Communication Library
- Data-driven Forecasting of Deep Learning Performance on GPUs
- Evaluating Operators in Deep Neural Networks for Improving Performance Portability of SYCL
- CI/CD Efforts for Validation, Verification and Benchmarking OpenMP Implementations
- Confidential Computing on Heterogeneous Systems: Survey and Implications
* * *