hgpu.org » nVidia H800
Weile Luo, Ruibo Fan, Zeyu Li, Dayou Du, Qiang Wang, Xiaowen Chu
Tags: Artificial intelligence, Benchmarking, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia H800, Performance, PTX
February 25, 2024 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Jailbreaking LLM-Controlled Robots
- Over-synchronization in GPU Programs
- Testing GPU Numerics: Finding Numerical Differences Between NVIDIA and AMD GPUs
- Accelerating Drug Discovery in AutoDock-GPU with Tensor Cores
- Mixed-precision finite element kernels and assembly: Rounding error analysis and hardware acceleration
- Using modern C++ to improve CUDA programs
- General-Purpose Computing on Tensor Processors
- Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models
- LLload: An Easy-to-Use HPC Utilization Tool
- Online Energy Optimization in GPUs: A Multi-Armed Bandit Approach
* * *