Philippe Tillet, H. T. Kung, David Cox
Tags: Compilers, Computer science, CUDA, Deep learning, High-level Languages, LLVM, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1070, Package
Isabelly Rocha, Christian Göttel, Pascal Felber, Marcelo Pasin, Romain Rouvoy, Valerio Schiavoni
Tobias Stauber, Peter Sommerlad
Bodun Hu, Christopher J. Rossbach
Newsha Ardalani, Urmish Thakker, Aws Albarghouthi, Karu Sankaralingam
Martin Elsman, Troels Henriksen, Niels Gustav Westphal Serup
Zhen Xie, Guangming Tan, Weifeng Liu, Ninghui Sun
Yifan Sun, Trinayan Baruah, Saiful A. Mojumder, Shi Dong, Xiang Gong, Shane Treadway, Yuhui Bao, Spencer Hance, Carter McCardwell, Vincent Zhao, Harrison Barclay, Amir Kavyan Ziabari, Zhongliang Chen, Rafael Ubal, Jose L. Abellan, John Kim, Ajay Joshi
Stavros Efthymiou, Jack Hidary, Stefan Leichenauer
Yanhao Chen, Fei Hua, Chaozhang Huang, Jeremy Bierema, Chi Zhang, Eddy Z. Zhang
Evangelos Georganas, Kunal Banerjee, Dhiraj Kalamkar, Sasikanth Avancha, Anand Venkat, Michael Anderson, Greg Henry, Hans Pabst, Alexander Heinecke