Patrick G. Bridges, Anthony Skjellum, Evan D. Suggs, Derek Schafer, Purushotham V. Bangalore
L.A. Torres, Carlos J. Barrios H, Yves Denneulin
Tags: Computer science, CUBLAS, CUDA, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia A100, Package, Performance, SYCL
Cosmin E. Oancea, Stephen M. Watt
Ruixin Wang, Minghai Lu, Cody Hao Yu, Yi-Hsiang Lai, Tianyi Zhang
Eishi Arima, Minjoon Kang, Issa Saba, Josef Weidendorfer, Carsten Trinitis, Martin Schulz
Andrey Alekseenko, Szilárd Páll, Erik Lindahl
Junjie Li, Yinzhi Wang, Xiao Liang, Hang Liu
Peter Thoman, Fabian Knorr, Luigi Crisci
Zachary Cooper-Baldock, Brenda Vara Almirall, Kiao Inthavong