Hao Zhu, Baojian Hua, Xinrong Lin, Yufei Wu
Chun-Hee Lee, Dong-oh Kang, Hwa Jeon Song
Pietro Incardona, Aryaman Gupta, Serhii Yaskovets, Ivo F. Sbalzarini
Tags: AMD RX Vega 64, ATI, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce RTX 3090, OpenACC, OpenCL, OpenMP, Package, Performance, performance portability, SYCL
Bin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao
Shilei Tian, Barbara Chapman, Johannes Doerfert
Mingyu Liang, Wenyin Fu, Louis Feng, Zhongyi Lin, Pavani Panakanti, Shengbao Zheng, Srinivas Sridharan, Christina Delimitrou
Tags: AI, Benchmarking, Code generation, Computer science, CUDA, nVidia, Package, Performance, PyTorch, Tesla A100, Tesla V100
Anil Shanbhag, Bobbi W. Yogatama, Xiangyao Yu, Samuel Madden
Zhihe Zhao, Neiwen Ling, Nan Guan, Guoliang Xing
Jacob Faibussowitsch, Mark F. Adams, Richard Tran Mills, Stefano Zampini, Junchao Zhang
Pieter Hijma, Stijn Heldens, Alessio Sclocco, Ben van Werkhoven, Henri E. Bal