Kaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang
Tags: Computer science, CUDA, Heterogeneous systems, Machine learning, nVidia, nVidia A100, nVidia A40, nVidia H100, nVidia H20, nVidia H200, nVidia H800, nVidia L20, nVidia L40, nVidia RTX 6000 Ada, Performance, Triton
Le Chen1, Nuo Xu, Winson Chen, Bin Lei, Pei-Hung Lin, Dunzhi Zhou, Rajeev Thakur, Caiwen Ding, Ali Jannesari, Chunhua Liao
December 21, 2025 by
hgpuAaron Jarmusch, Sunita Chandrasekaran
Nandor Licker, Kevin Hu, Vladimir Zaytsev, Lequn Chen
Siddharth Samsi, Dan Campbell, Emanuel Scoullos, Oded Green
September 14, 2025 by
hgpuDavid Jin, Alexis Montoison, Sungho Shin
Tags: AMD Radeon Instinct MI300X, ATI, Benchmarking, BLAS, Computer science, CUDA, Factorization, Julia, nVidia, nVidia H200, Package, ROCm
September 7, 2025 by
hgpuCarlo Baronio, Pietro Marsella, Ben Pan, Simon Guo, Silas Alberti
Mohammad Firas Sada, John J. Graham, Elham E Khoda, Mahidhar Tatineni, Dmitry Mishin, Rajesh K. Gupta, Rick Wagner, Larry Smarr, Thomas A. DeFanti, Frank Würthwein