Somashekaracharya G. Bhaskaracharya, Julien Demouth, Vinod Grover
Tags: Compilers, Computer science, CUBLAS, CUDA, Deep learning, Matrix multiplication, nVidia, nVidia Quadro GV100, Performance, Programming Languages, PTX
Rishi Bharadwaj Subramanian
Lianmin Zheng, Chengfan Jia, Minmin Sun, Zhao Wu, Cody Hao Yu, Ameer Haj-Ali, Yida Wang, Jun Yang, Danyang Zhuo, Koushik Sen, Joseph E. Gonzalez, Ion Stoica
Johannes Blühdorn, Nicolas R. Gauger, Matthias Kabel
Joseph Mellor, Jack Turner, Amos Storkey, Elliot J. Crowley
Jiajian Xiao, Philipp Andelfinger, Wentong Cai, Paul Richmond, Alois Knoll, David Eckhoff
Gangwon Jo, Heehoon Kim, Jeesoo Lee, Jaejin Lee