Utpal Kiran, Sachin Singh Gautam, Deepak Sharma
Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen
Yuhsiang M. Tsai, Terry Cojean, Tobias Ribizel, Hartwig Anzt
Somashekaracharya G. Bhaskaracharya, Julien Demouth, Vinod Grover
Tags: Compilers, Computer science, CUBLAS, CUDA, Deep learning, Matrix multiplication, nVidia, nVidia Quadro GV100, Performance, Programming Languages, PTX
Rishi Bharadwaj Subramanian
Lianmin Zheng, Chengfan Jia, Minmin Sun, Zhao Wu, Cody Hao Yu, Ameer Haj-Ali, Yida Wang, Jun Yang, Danyang Zhuo, Koushik Sen, Joseph E. Gonzalez, Ion Stoica
Johannes Blühdorn, Nicolas R. Gauger, Matthias Kabel
Joseph Mellor, Jack Turner, Amos Storkey, Elliot J. Crowley