Jon Hu, Thomas Jia, Jing Zhu, Zhendong Yu
Aaron Jarmusch, Connor Vitz, Sunita Chandrasekaran
February 16, 2026 by
hgpuYang Yu, Peiyu Zang, Chi Hsu Tsai, Haiming Wu, Yixin Shen, Jialing Zhang, Haoyu Wang, Zhiyou Xiao, Jingze Shi, Yuyu Luo, Wentao Zhang, Chunlei Men, Guang Liu, Yonghua Lin
Gang Liao, Hongsen Qin, Ying Wang, Alicia Golden, Michael Kuchnik, Yavuz Yetim, Jia Jiunn Ang, Chunli Fu, Yihan He, Samuel Hsia, Zewei Jiang, Dianshi Li, Uladzimir Pashkevich, Varna Puvvada, Feng Shi, Matt Steiner, Ruichao Xiao, Nathan Yan, Xiayu Yu, Zhou Fang, Abdul Zainul-Abedin, Ketan Singh, Hongtao Yu, Wenyuan Chi, Barney Huang, Sean Zhang, Noah Weller, Zach Marine, Wyatt Cook, Carole-Jean Wu, Gaoxiang Liu
Tags: AI, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, Computer science, CUDA, Deep learning, Heterogeneous systems, LLM, nVidia, nVidia A100, nVidia H100, PTX, ROCm, Triton
Marco Kurzynski, Shaizeen Aga, Di Wu
December 15, 2025 by
hgpuRyan Swann, Muhammad Osama, Xiaohu Guo, Bryant Nelson, Lixun Zhang, Alex Brown, Yen Ong, Ali Yazdani, Sean Siddens, Ganesh Dasika, Alex Underwood
Tags: AMD, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, BLAS, Computer science, HPC, Package, Performance, ROCm, Triton
Pedro Antunes, Ana Rita Ortigoso, Gabriel Vieira, Daniel Fuentes, Luís Frazão, Nuno Costa, António Pereira
November 23, 2025 by
hgpuYifan Zhao, Egan Johnson, Prasanth Chatarasi, Vikram Adve, Sasa Misailovic
Tags: AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia RTX A5000, nVidia RTX A6000, Package, Performance, ROCm
Mohammad Zaeed, Tanzima Z. Islam, Vladimir Inđić
William F. Godoy, Tatiana Melnichenko, Pedro Valero-Lara, Wael Elwasif, Philip Fackler, Rafael Ferreira Da Silva, Keita Teranishi, Jeffrey S. Vetter
Tags: AI, AMD Radeon Instinct MI300A, ATI, Compilers, Computer science, CUDA, HIP, HPC, nVidia, nVidia H100, Package, Python, ROCm
September 28, 2025 by
hgpuAndreas Herten, Olga Pearce, Filipe S. M. Guimarães
Tags: Benchmarking, Computer science, CUDA, Fortran, HIP, HPC, MPI, OpenACC, OpenCL, OpenMP, Package, Performance, ROCm, SYCL
September 14, 2025 by
hgpuDavid Jin, Alexis Montoison, Sungho Shin
Tags: AMD Radeon Instinct MI300X, ATI, Benchmarking, BLAS, Computer science, CUDA, Factorization, Julia, nVidia, nVidia H200, Package, ROCm
September 7, 2025 by
hgpu