Carlo Baronio, Pietro Marsella, Ben Pan, Simon Guo, Silas Alberti
Joel Schlotthauer, Christian Kroos, Chris Hinze, Viktor Hangya, Luzian Hahn, Fabian Küch
Changxin Ke, Rui Zhang, Shuo Wang, Li Ding, Guangli Li, Yuanbo Wen, Shuoming Zhang, Ruiyuan Xu, Jin Qin, Jiaming Guo, Chenxi Wang, Ling Li, Qi Guo, Yunji Chen
Wali Mohammad Abdullah, Azmain Kabir
Joshua H. Davis, Daniel Nichols, Ishan Khillan, Abhinav Bhatele
Boyi Liu, Yongguang Lu, Jianguo Zhao, Qiang Yang, Wen Wu, Lin Chen, Jagmohan Chauhan, Jun Zhang
Zixian Wang, Cole Ramos, Muhammad A. Awad, Keith Lowery
Mugeng Liu, Siqi Zhong, Weichen Bi, Yixuan Zhang, Zhiyang Chen, Zhenpeng Chen, Xuanzhe Liu, Yun Ma
Wentao Chen, Jiace Zhu, Qi Fan, Yehan Ma, An Zou
Jiaqi Lv, Xufeng He, Yanchen Liu, Xu Dai, Yang Hu, Shouyi Yin
Tags: AI, Benchmarking, Compilers, Computer science, CUDA, Deep learning, LLM, nVidia, nVidia A100, Package, performance portability
Yong-Cheng Liaw, Shuo-Han Chen