Hiroyuki Ootomo, Katsuhisa Ozaki, Rio Yokota
Tags: Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Machine learning, Matrix multiplication, nVidia, nVidia A100, nVidia Jetson AGX Orin, nVidia RTX 6000 Ada, nVidia Titan RTX, Package
Shilei Tian, Tom Scogland, Barbara Chapman, Johannes Doerfert
Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Taghreed Bagies, Wei Le, Jeremy Sheafer, Ali Jannesari
Mathis Bouverot-Dupuis, Mary Sheeran
Mohamed Tarek Ibn Ziad, Sana Damani, Aamer Jaleel, Stephen W. Keckler, Mark Stephenson
Yu Zhou, Justin Sonneck, Sweta Banerjee, Stefanie Dörr, Anika Grüneboom, Kristina Lorenz, Jianxu Chen
Reese Levine, Mingun Cho, Devon McKee, Andrew Quinn, Tyler Sorensen
Gargi Alavani, Jineet Desai, Snehanshu Saha, Santonu Sarkar
Lukas Mazur, Dennis Bollweg, David A. Clarke, Luis Altenkort, Olaf Kaczmarek, Rasmus Larsen, Hai-Tao Shu, Jishnu Goswami, Philipp Scior, Hauke Sandmeyer, Marius Neumann, Henrik Dick, Sajid Ali, Jangho Kim, Christian Schmidt, Peter Petreczky, Swagato Mukherjee
Tags: Algorithms, AMD Radeon Instinct MI250X, ATI, CUDA, High Energy Physics - Lattice, HIP, MPI, nVidia, nVidia A100, Package, Physics, QCD