Philippe Tillet, H. T. Kung, David Cox
Tags: Compilers, Computer science, CUDA, Deep learning, High-level Languages, LLVM, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1070, Package
Isabelly Rocha, Christian Göttel, Pascal Felber, Marcelo Pasin, Romain Rouvoy, Valerio Schiavoni
Bodun Hu, Christopher J. Rossbach
Martin Elsman, Troels Henriksen, Niels Gustav Westphal Serup
Yifan Sun, Trinayan Baruah, Saiful A. Mojumder, Shi Dong, Xiang Gong, Shane Treadway, Yuhui Bao, Spencer Hance, Carter McCardwell, Vincent Zhao, Harrison Barclay, Amir Kavyan Ziabari, Zhongliang Chen, Rafael Ubal, Jose L. Abellan, John Kim, Ajay Joshi
Stavros Efthymiou, Jack Hidary, Stefan Leichenauer
Stefan Groth, Christian Schmitt, Jürgen Teich, and Frank Hannig
Xiawu Zheng, Rongrong Ji, Lang Tang, Yan Wan, Baochang Zhang, Yongjian Wu, Yunsheng Wu, Ling Shao