Martin Langhammer, George Constantinides
Mingyu Liang, Wenyin Fu, Louis Feng, Zhongyi Lin, Pavani Panakanti, Shengbao Zheng, Srinivas Sridharan, Christina Delimitrou
Tags: AI, Benchmarking, Code generation, Computer science, CUDA, nVidia, Package, Performance, PyTorch, Tesla A100, Tesla V100
Zhihe Zhao, Neiwen Ling, Nan Guan, Guoliang Xing
Anil Shanbhag, Bobbi W. Yogatama, Xiangyao Yu, Samuel Madden
Jacob Faibussowitsch, Mark F. Adams, Richard Tran Mills, Stefano Zampini, Junchao Zhang
Daniel Nichols, Aniruddha Marathe, Harshitha Menon, Todd Gamblin, Abhinav Bhatele
Pieter Hijma, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven, Henri E. Bal
Daniel Cussen, Jeffrey D. Ullman
Yujie Wang, Youhe Jiang, Xupeng Miao, Fangcheng Fu, Xiaonan Nie, Bin Cui
William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter
Tags: AI, Artificial intelligence, Benchmarking, Code generation, Computer science, CUDA, Fortran, HPC, Julia, nVidia, OpenACC, OpenMP, Package, Python