Yuanhang Yu, Dong Wen, Ying Zhang, Xiaoyang Wang, Wenjie Zhang, Xuemin Lin
Min Li, Yulong Ao, Chao Yang
Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yuhsiang Mike Tsai, Enrique S. Quintana-Ortí
Tags: Algorithms, AMD Radeon VII, ATI, Computer science, CUDA, HIP, Linear Algebra, Mathematical Software, nVidia, Package, Sparse, Sparse matrix, Tesla V100
Chao-Tung Yang, Jung-Chun Liu, Yu-Wei Chan, Endah Kristiani, Chan-Fu Kuo
Utpal Kiran, Sachin Singh Gautam, Deepak Sharma
Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen
Yuhsiang M. Tsai, Terry Cojean, Tobias Ribizel, Hartwig Anzt
Somashekaracharya G. Bhaskaracharya, Julien Demouth, Vinod Grover
Tags: Compilers, Computer science, CUBLAS, CUDA, Deep learning, Matrix multiplication, nVidia, nVidia Quadro GV100, Performance, Programming Languages, PTX
Rishi Bharadwaj Subramanian
Lianmin Zheng, Chengfan Jia, Minmin Sun, Zhao Wu, Cody Hao Yu, Ameer Haj-Ali, Yida Wang, Jun Yang, Danyang Zhuo, Koushik Sen, Joseph E. Gonzalez, Ion Stoica