Hyun Park, Parth Patel, Roland Haas, E. A. Huerta
Tal Kadosh, Niranjan Hasabnis, Timothy Mattson, Yuval Pinter, Gal Oren
Tags: Computer science, CUDA, Databases, Fortran, HPC, MPI, nVidia, OpenACC, OpenCL, OpenMP, Package, SYCL
Xinchi Han, Weihao Jiang, Peirui Cao, Qinwei Yang, Yunzhuo Liu, Shuyao Qi, Shengkai Lin, Shizhen Zhao
Ryan R. Curtin, Marcus Edel, Conrad Sanderson
Bo Yang, Zhihao Zhang Kirisame Marisa, Kai Shi
Pietro Incardona, Aryaman Gupta, Serhii Yaskovets, Ivo F. Sbalzarini
Tags: AMD RX Vega 64, ATI, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce RTX 3090, OpenACC, OpenCL, OpenMP, Package, Performance, performance portability, SYCL
Bin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao
Hanyan Cao, Feng Pan, Yijia Wang, Pan Zhang
Mingyu Liang, Wenyin Fu, Louis Feng, Zhongyi Lin, Pavani Panakanti, Shengbao Zheng, Srinivas Sridharan, Christina Delimitrou
Tags: AI, Benchmarking, Code generation, Computer science, CUDA, nVidia, Package, Performance, PyTorch, Tesla A100, Tesla V100
Anil Shanbhag, Bobbi W. Yogatama, Xiangyao Yu, Samuel Madden
Jacob Faibussowitsch, Mark F. Adams, Richard Tran Mills, Stefano Zampini, Junchao Zhang
William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter
Tags: AI, Artificial intelligence, Benchmarking, Code generation, Computer science, CUDA, Fortran, HPC, Julia, nVidia, OpenACC, OpenMP, Package, Python