Beau Johnston, Jeffrey S. Vetter, Josh Milthorpe
Tags: AMD Radeon VII, ATI, Benchmarking, Computer science, CUDA, Heterogeneous systems, HIP, Matrix multiplication, nVidia, OpenCL, Package, Performance, Tesla P100
November 29, 2020 by
hgpuSteven Harris, Roger D. Chamberlain, Christopher Gill
Thomas Faingnaert, Tim Besard, Bjorn De Sutter
Tags: Computer science, CUBLAS, CUDA, Julia, Machine learning, Mathematical Software, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2080 Ti, Package, Performance
Orestis Zachariadis, Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares
Tags: Algorithms, Computer science, CUDA, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2070, nVidia Titan RTX, Package, Performance, Sparse matrix
Guyue Huang, Guohao Dai, Yu Wang, Huazhong Yang
Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen
Somashekaracharya G. Bhaskaracharya, Julien Demouth, Vinod Grover
Tags: Compilers, Computer science, CUBLAS, CUDA, Deep learning, Matrix multiplication, nVidia, nVidia Quadro GV100, Performance, Programming Languages, PTX
Mohammadreza Soltaniyeh, Richard P. Martin, Santosh Nagarakatte