Xiaoyan Liu, Yi Liu, Ming Dun, Bohong Yin, Hailong Yang, Zhongzhi Luan, Depei Qian

Beau Johnston, Jeffrey S. Vetter, Josh Milthorpe

Tags: AMD Radeon VII, ATI, Benchmarking, Computer science, CUDA, Heterogeneous systems, HIP, Matrix multiplication, nVidia, OpenCL, Package, Performance, Tesla P100

November 29, 2020 by

hgpuSteven Harris, Roger D. Chamberlain, Christopher Gill

Thomas Faingnaert, Tim Besard, Bjorn De Sutter

Tags: Computer science, CUBLAS, CUDA, Julia, Machine learning, Mathematical Software, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2080 Ti, Package, Performance

Orestis Zachariadis, Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares

Tags: Algorithms, Computer science, CUDA, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2070, nVidia Titan RTX, Package, Performance, Sparse matrix

Guyue Huang, Guohao Dai, Yu Wang, Huazhong Yang

Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen

Somashekaracharya G. Bhaskaracharya, Julien Demouth, Vinod Grover

Tags: Compilers, Computer science, CUBLAS, CUDA, Deep learning, Matrix multiplication, nVidia, nVidia Quadro GV100, Performance, Programming Languages, PTX