Gang Liao, Hongsen Qin, Ying Wang, Alicia Golden, Michael Kuchnik, Yavuz Yetim, Jia Jiunn Ang, Chunli Fu, Yihan He, Samuel Hsia, Zewei Jiang, Dianshi Li, Uladzimir Pashkevich, Varna Puvvada, Feng Shi, Matt Steiner, Ruichao Xiao, Nathan Yan, Xiayu Yu, Zhou Fang, Abdul Zainul-Abedin, Ketan Singh, Hongtao Yu, Wenyuan Chi, Barney Huang, Sean Zhang, Noah Weller, Zach Marine, Wyatt Cook, Carole-Jean Wu, Gaoxiang Liu
Tags: AI, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, Computer science, CUDA, Deep learning, Heterogeneous systems, LLM, nVidia, nVidia A100, nVidia H100, PTX, ROCm, Triton
Marco Kurzynski, Shaizeen Aga, Di Wu
December 15, 2025 by
hgpuRyan Swann, Muhammad Osama, Xiaohu Guo, Bryant Nelson, Lixun Zhang, Alex Brown, Yen Ong, Ali Yazdani, Sean Siddens, Ganesh Dasika, Alex Underwood
Tags: AMD, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, BLAS, Computer science, HPC, Package, Performance, ROCm, Triton
Pedro Antunes, Ana Rita Ortigoso, Gabriel Vieira, Daniel Fuentes, Luís Frazão, Nuno Costa, António Pereira
November 23, 2025 by
hgpuYifan Zhao, Egan Johnson, Prasanth Chatarasi, Vikram Adve, Sasa Misailovic
Tags: AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia RTX A5000, nVidia RTX A6000, Package, Performance, ROCm
Mohammad Zaeed, Tanzima Z. Islam, Vladimir Inđić
William F. Godoy, Tatiana Melnichenko, Pedro Valero-Lara, Wael Elwasif, Philip Fackler, Rafael Ferreira Da Silva, Keita Teranishi, Jeffrey S. Vetter
Tags: AI, AMD Radeon Instinct MI300A, ATI, Compilers, Computer science, CUDA, HIP, HPC, nVidia, nVidia H100, Package, Python, ROCm
September 28, 2025 by
hgpuAndreas Herten, Olga Pearce, Filipe S. M. Guimarães
Tags: Benchmarking, Computer science, CUDA, Fortran, HIP, HPC, MPI, OpenACC, OpenCL, OpenMP, Package, Performance, ROCm, SYCL
September 14, 2025 by
hgpuDavid Jin, Alexis Montoison, Sungho Shin
Tags: AMD Radeon Instinct MI300X, ATI, Benchmarking, BLAS, Computer science, CUDA, Factorization, Julia, nVidia, nVidia H200, Package, ROCm
September 7, 2025 by
hgpuXiyan Hu, Titus Parker, Connor Phillips, Yifa Yu
Tags: AMD Radeon Instinct MI210, AMD Radeon Instinct MI325X, ATI, Computer science, HIP, nVidia, nVidia A100, Package, Performance, PyTorch, ROCm, Thesis
Jianghui Wang, Vinay Joshi, Saptarshi Majumder, Xu Chao, Bin Ding, Ziqiong Liu, Pratik Prabhanjan Brahma, Dong Li, Zicheng Liu, Emad Barsoum
Zixian Wang, Cole Ramos, Muhammad A. Awad, Keith Lowery