Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry
Juan Fumero, György Rethy, Athanasios Stratikopoulos, Nikos Foutris, Christos Kotselidis
Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4
Vsevolod Livinskii, Dmitry Babokin, John Regehr
Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, David Beckingsale, Todd Gamblin, Bronis de Supinski
Tags: AMD Radeon Instinct Mi50, ATI, Benchmarking, Code generation, Compilers, Computer science, CUDA, Heterogeneous systems, Machine learning, nVidia, OpenMP, performance portability, Tesla P100, Tesla V100
Tal Ben-Nun, Berke Ates, Alexandru Calotoiu, Torsten Hoefler
Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Tags: Benchmarking, Code generation, Computer science, CUDA, nVidia, nVidia GeForce GTX Titan X, OpenACC, Package, PTX, Tesla K40, Tesla K80, Tesla V100
Kun Wu, Mert Hidayetoğlu, Xiang Song, Sitao Huang, Da Zheng, Israt Nisa, Wen-mei Hwu
Jianhui Li, Zhennan Qin, Yijie Mei, Jingze Cui, Yunfei Song, Ciyong Chen, Yifei Zhang, Longsheng Du, Xianhang Cheng, Baihui Jin, Jason Ye, Eric Lin, Dan Lavery