Yiran Lei, Dongjoo Lee, Liangyu Zhao, Daniar Kurniawan, Chanmyeong Kim, Heetaek Jeong, Changsu Kim, Hyeonseong Choi, Liangcheng Yu, Arvind Krishnamurthy, Justine Sherry, Eriko Nurvitadhi
Aymeric Millan, Thomas Padioleau, Julien Bigot
Tags: AMD Radeon Instinct MI250X, ATI, Computer science, CUDA, FFT, Neural networks, nVidia, nVidia A100, Package, performance portability, SYCL
Ahmed Heakl, Sarim Hashmi, Gustavo Bertolo Stahl, Seung Hun Eddie Han, Salman Khan, Abdulrahman Mahmoud
Tags: AI, AMD Radeon RX 7900 XT, ATI, Computer science, CUDA, HIP, Machine learning, nVidia, nVidia A100, OpenCL, Package, Programming Languages, PTX
Zhonggen Li, Xiangyu Ke, Yifan Zhu, Yunjun Gao, Feifei Li
Burkhard Ringlein, Thomas Parnell, Radu Stoica
Tags: AMD Radeon Instinct MI250, ATI, Auto-Tuning, Computer science, CUDA, DSL, HIP, LLM, nVidia, nVidia A100, Performance, performance portability
Aashaka Shah, Abhinav Jangda, Binyang Li, Caio Rocha, Changho Hwang, Jithin Jose, Madan Musuvathi, Olli Saarikivi, Peng Cheng, Qinghua Zhou, Roshan Dathathri, Saeed Maleki, Ziyue Yang
Tags: AI, AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, Heterogeneous systems, HIP, nVidia, nVidia A100, nVidia H100, Package
Weijie Lv, Xuan Xia, Sheng-Jun Huang
Patrick H. Coppock, Brian Zhang, Eliot H. Solomon, Vasilis Kypriotis, Leon Yang, Bikash Sharma, Dan Schatzberg, Todd C. Mowry, Dimitrios Skarlatos
Dimitar Mileski, Nikola Petrovski, Marjan Gusev
Timothée David--Cléris, Guillaume Laibe, Yona Lapeyre
Tags: AMD, AMD Radeon Instinct MI250X, Astrophysics, CUDA, MPI, nVidia, nVidia A100, OpenMP, Package, Physics, PTX, ROCm, SYCL
Fabian Knorr, Philip Salzmann, Peter Thoman, Thomas Fahringer
Mohammad Atif, Tianle Wang, Zhihua Dong, Charles Leggett, Meifeng Lin