Zixian Wang, Cole Ramos, Muhammad A. Awad, Keith Lowery
Mugeng Liu, Siqi Zhong, Weichen Bi, Yixuan Zhang, Zhiyang Chen, Zhenpeng Chen, Xuanzhe Liu, Yun Ma
Wentao Chen, Jiace Zhu, Qi Fan, Yehan Ma, An Zou
Jiaqi Lv, Xufeng He, Yanchen Liu, Xu Dai, Yang Hu, Shouyi Yin
Tags: AI, Benchmarking, Compilers, Computer science, CUDA, Deep learning, LLM, nVidia, nVidia A100, Package, performance portability
Yong-Cheng Liaw, Shuo-Han Chen
Antonio Martínez Ibarra, Julian James Stephen, Aurora González Vidal, K. R. Jayaram, Antonio Fernando Skarmeta Gómez
Gregory Bolet, Giorgis Georgakoudis, Harshitha Menon, Konstantinos Parasyris, Niranjan Hasabnis, Hayden Estes, Kirk W. Cameron, Gal Oren
Burkhard Ringlein, Thomas Parnell, Radu Stoica
Tags: AMD Radeon Instinct MI250, ATI, Auto-Tuning, Computer science, CUDA, DSL, HIP, LLM, nVidia, nVidia A100, Performance, performance portability
Neha Prakriya, Zijian Ding, Yizhou Sun, Jason Cong
Jiuqiang Tang, Raman Sarokin, Ekaterina Ignasheva, Grant Jensen, Lin Chen, Juhyun Lee, Andrei Kulik, Matthias Grundmann
Weijie Lv, Xuan Xia, Sheng-Jun Huang