L.A. Torres, Carlos J. Barrios H, Yves Denneulin
Tags: Computer science, CUBLAS, CUDA, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia A100, Package, Performance, SYCL
Jianhua Gao, Bingjie Liu, Weixing Ji, Hua Huang
Ryan Swann, Muhammad Osama, Karthik Sangaiah, Jalal Mahmud
Andres E. Tomas, Enrique S. Quintana-Orti, Hartwig Anzt
Ryan R. Curtin, Marcus Edel, Conrad Sanderson
Hiroyuki Ootomo, Katsuhisa Ozaki, Rio Yokota
Tags: Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Machine learning, Matrix multiplication, nVidia, nVidia A100, nVidia Jetson AGX Orin, nVidia RTX 6000 Ada, nVidia Titan RTX, Package
Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4
Noel Chalmers, Jakub Kurzak, Damon McDougall, Paul T. Bauman