Philippe Tillet, H. T. Kung, David Cox
Tags: Compilers, Computer science, CUDA, Deep learning, High-level Languages, LLVM, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1070, Package
Bodun Hu, Christopher J. Rossbach
Evangelos Georganas, Kunal Banerjee, Dhiraj Kalamkar, Sasikanth Avancha, Anand Venkat, Michael Anderson, Greg Henry, Hans Pabst, Alexander Heinecke
Francois Belletti, Davis King, Kun Yang, Roland Nelet, Yusef Shafi, Yi-Fan Chen, John Anderson
Andre Viebke, Sabri Pllana, Suejb Memeti, Joanna Kolodziej
Long-Gang Pang, Kai Zhou, Nan Su, Hannah Petersen, Horst Stoecker, Xin-Nian Wang
Yunfei Teng, Wenbo Gao, Francois Chalus, Anna Choromanska, Donald Goldfarb, Adrian Weller
Zhenheng Tang, Yuxin Wang, Qiang Wang, Xiaowen Chu
C. Jiang, D. Ojika, T. Kurth, Prabhat, S. Vallecorsa, B. Patel, H. Lam