Hiroyuki Ootomo, Katsuhisa Ozaki, Rio Yokota
Tags: Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Machine learning, Matrix multiplication, nVidia, nVidia A100, nVidia Jetson AGX Orin, nVidia RTX 6000 Ada, nVidia Titan RTX, Package
Shilei Tian, Tom Scogland, Barbara Chapman, Johannes Doerfert
Lukas Mazur, Dennis Bollweg, David A. Clarke, Luis Altenkort, Olaf Kaczmarek, Rasmus Larsen, Hai-Tao Shu, Jishnu Goswami, Philipp Scior, Hauke Sandmeyer, Marius Neumann, Henrik Dick, Sajid Ali, Jangho Kim, Christian Schmidt, Peter Petreczky, Swagato Mukherjee
Tags: Algorithms, AMD Radeon Instinct MI250X, ATI, CUDA, High Energy Physics - Lattice, HIP, MPI, nVidia, nVidia A100, Package, Physics, QCD
Tobias Groth, Sven Groppe, Thilo Pionteck, Franz Valdiek, Martin Koppehel
Leonardo Solis-Vasquez, Edward Mascarenhas, Andreas Koch
Siddharth Singh, Zack Sating, Abhinav Bhatele
Igor Sfiligoi, Emily A. Belli, Jeff Candy, Reuben D. Budiardja
Yueming Hao, Xu Zhao, Bin Bao, David Berard, Will Constable, Adnan Aziz, Xu Liu
Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4
Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello
Erik A. Träff, Anton Rydahl, Sven Karlsson, Ole Sigmund, Niels Aage