Gang Liao, Hongsen Qin, Ying Wang, Alicia Golden, Michael Kuchnik, Yavuz Yetim, Jia Jiunn Ang, Chunli Fu, Yihan He, Samuel Hsia, Zewei Jiang, Dianshi Li, Uladzimir Pashkevich, Varna Puvvada, Feng Shi, Matt Steiner, Ruichao Xiao, Nathan Yan, Xiayu Yu, Zhou Fang, Abdul Zainul-Abedin, Ketan Singh, Hongtao Yu, Wenyuan Chi, Barney Huang, Sean Zhang, Noah Weller, Zach Marine, Wyatt Cook, Carole-Jean Wu, Gaoxiang Liu
Tags: AI, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, Computer science, CUDA, Deep learning, Heterogeneous systems, LLM, nVidia, nVidia A100, nVidia H100, PTX, ROCm, Triton
Muhammad Usman Tariq, Abhinav Jangda, Angelica Moreira, Madan Musuvathi, Tyler Sorensen
Tags: AI, AMD, AMD Radeon Instinct MI200, ATI, Computer science, CUDA, HIP, HLSL, LLM, NLP, nVidia, nVidia RTX A6000
December 29, 2025 by
hgpuMarco Kurzynski, Shaizeen Aga, Di Wu
December 15, 2025 by
hgpuRyan Swann, Muhammad Osama, Xiaohu Guo, Bryant Nelson, Lixun Zhang, Alex Brown, Yen Ong, Ali Yazdani, Sean Siddens, Ganesh Dasika, Alex Underwood
Tags: AMD, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, BLAS, Computer science, HPC, Package, Performance, ROCm, Triton
Pedro Antunes, Ana Rita Ortigoso, Gabriel Vieira, Daniel Fuentes, Luís Frazão, Nuno Costa, António Pereira
November 23, 2025 by
hgpuMuhammad Awad, Muhammad Osama, Brandon Potter
November 23, 2025 by
hgpuBurkhard Ringlein, Jan van Lunteren, Radu Stoica, Thomas Parnell
Tags: AMD Radeon Instinct MI250, AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, DSL, HIP, LLM, nVidia, nVidia H100, Performance, Programming Languages, Triton
November 23, 2025 by
hgpuWilliam Hu, Drew Wadsworth, Sean Siddens, Stanley Winata, Daniel Y. Fu, Ryann Swann, Muhammad Osama, Christopher Ré, Simran Arora
November 16, 2025 by
hgpuStepan Vanecek, Manuel Walter Mussbacher, Dominik Groessler, Urvij Saroliya, Martin Schulz
Tags: AMD Radeon Instinct MI100, AMD Radeon Instinct MI210, AMD Radeon Instinct MI300X, ATI, Benchmarking, Computer science, CUDA, HIP, nVidia, nVidia A100, nVidia GeForce RTX 2080, nVidia H100, nVidia Quadro P 6000, nVidia V100, Package, PTX
November 16, 2025 by
hgpuChandrish Ambati, Trung Diep
Yifan Zhao, Egan Johnson, Prasanth Chatarasi, Vikram Adve, Sasa Misailovic
Tags: AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia RTX A5000, nVidia RTX A6000, Package, Performance, ROCm