Hiroyuki Ootomo, Katsuhisa Ozaki, Rio Yokota
Tags: Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Machine learning, Matrix multiplication, nVidia, nVidia A100, nVidia Jetson AGX Orin, nVidia RTX 6000 Ada, nVidia Titan RTX, Package
Shilei Tian, Tom Scogland, Barbara Chapman, Johannes Doerfert
Kazuaki Matsumura, Simon Garcia De Gonzalo, Antonio J. Peña
Taghreed Bagies, Wei Le, Jeremy Sheafer, Ali Jannesari
Mathis Bouverot-Dupuis, Mary Sheeran
Mohamed Tarek Ibn Ziad, Sana Damani, Aamer Jaleel, Stephen W. Keckler, Mark Stephenson
Yu Zhou, Justin Sonneck, Sweta Banerjee, Stefanie Dörr, Anika Grüneboom, Kristina Lorenz, Jianxu Chen
Reese Levine, Mingun Cho, Devon McKee, Andrew Quinn, Tyler Sorensen
Gargi Alavani, Jineet Desai, Snehanshu Saha, Santonu Sarkar
Fumiya Kono, Naohito Nakasato, Maho Nakata
Tobias Groth, Sven Groppe, Thilo Pionteck, Franz Valdiek, Martin Koppehel