Ruifan Chu, Anbang Wang, Xiuxiu Bai, Shuai Liu, and Xiaoshe Dong
Genghan Zhang, Shaowei Zhu, Anjiang Wei, Zhenyu Song, Allen Nie, Zhen Jia, Nandita Vijaykumar, Yida Wang, Kunle Olukotun
December 29, 2025 by
hgpuRyan Swann, Muhammad Osama, Xiaohu Guo, Bryant Nelson, Lixun Zhang, Alex Brown, Yen Ong, Ali Yazdani, Sean Siddens, Ganesh Dasika, Alex Underwood
Tags: AMD, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, BLAS, Computer science, HPC, Package, Performance, ROCm, Triton
Burkhard Ringlein, Jan van Lunteren, Radu Stoica, Thomas Parnell
Tags: AMD Radeon Instinct MI250, AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, DSL, HIP, LLM, nVidia, nVidia H100, Performance, Programming Languages, Triton
November 23, 2025 by
hgpuKelun Lei, Hailong Yang, Huaitao Zhang, Xin You, Kaige Zhang, Zhongzhi Luan, Yi Liu, Depei Qian
November 16, 2025 by
hgpuWilliam Hu, Drew Wadsworth, Sean Siddens, Stanley Winata, Daniel Y. Fu, Ryann Swann, Muhammad Osama, Christopher Ré, Simran Arora
November 16, 2025 by
hgpuZijian Zhang, Rong Wang, Shiyang Li, Yuebo Luo, Mingyi Hong, Caiwen Ding
Nandor Licker, Kevin Hu, Vladimir Zaytsev, Lequn Chen
Chandrish Ambati, Trung Diep
Min Si, Pavan Balaji, Yongzhou Chen, Ching-Hsiang Chu, Adi Gangidi, Saif Hasan, Subodh Iyengar, Dan Johnson, Bingzhe Liu, Jingliang Ren, Ashmitha Jeevaraj Shetty, Greg Steinbrecher, Xinfeng Xie, Yulun Wang, Bruce Wu, Jingyi Yang, Mingran Yang, Minlan Yu, Cen Zhao, Wes Bland, Denis Boyda, Suman Gumudavelli, Cristian Lumezanu, Rui Miao, Zhe Qu, Venkat Ramesh, Maxim Samoylov, Jan Seidel, Feng Tian, Qiye Tan, Shuqiang Zhang, Yimeng Zhao, Shengbao Zheng, Art Zhu, Hongyi Zeng
Yifan Zhao, Egan Johnson, Prasanth Chatarasi, Vikram Adve, Sasa Misailovic
Tags: AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia RTX A5000, nVidia RTX A6000, Package, Performance, ROCm