Junqing Lin, Jingwei Sun, Xiaolong Shi, Honghe Zhang, Xianzhi Yu, Xinzhi Wang, Jun Yao, Guangzhong Sun
Tags: Compilers, Computer science, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce RTX 2080 Ti, Performance, Sparse matrix, Tesla V100
Seonho Lee, Amar Phanishayee, Divya Mahajan
Tags: Computer science, CUDA, Deep learning, nVidia, nVidia A100, nVidia H100, nVidia P100, nVidia V100, Performance, PyTorch, Tesla T4
Yizhou Luo, Qiang Wang, Shaohuai Shi, Jiaxin Lai, Shuhan Qi, Jiajia Zhang, Xuan Wang
Wei Sun, Ang Li, Sander Stuijk, Henk Corporaal
Numaan Huq, Philippe Lin, Roel Reyes, Charles Perine
Tags: AMD Radeon Pro V520, Artificial intelligence, ATI, Cloud, Computer science, CUDA, Deep learning, nVidia, OpenCL, Security, Tesla T4
Guillaume Couairon, Christian Lessig, Anastase Charantonis, Claire Monteleoni
Roberto L. Castro, Diego Andrade, Basilio B. Fraguela
Yiluan Xing, Chao Yan, Cathy Chang Xie
Ruixin Wang, Minghai Lu, Cody Hao Yu, Yi-Hsiang Lai, Tianyi Zhang