Philippe Tillet, H. T. Kung, David Cox
Tags: Compilers, Computer science, CUDA, Deep learning, High-level Languages, LLVM, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1070, Package
Tobias Stauber, Peter Sommerlad
Bodun Hu, Christopher J. Rossbach
Martin Elsman, Troels Henriksen, Niels Gustav Westphal Serup
Zhen Xie, Guangming Tan, Weifeng Liu, Ninghui Sun
Yifan Sun, Trinayan Baruah, Saiful A. Mojumder, Shi Dong, Xiang Gong, Shane Treadway, Yuhui Bao, Spencer Hance, Carter McCardwell, Vincent Zhao, Harrison Barclay, Amir Kavyan Ziabari, Zhongliang Chen, Rafael Ubal, Jose L. Abellan, John Kim, Ajay Joshi
Yanhao Chen, Fei Hua, Chaozhang Huang, Jeremy Bierema, Chi Zhang, Eddy Z. Zhang
Evangelos Georganas, Kunal Banerjee, Dhiraj Kalamkar, Sasikanth Avancha, Anand Venkat, Michael Anderson, Greg Henry, Hans Pabst, Alexander Heinecke
Feng Zhang, Weifeng Liu, Ningxuan Feng, Jidong Zhai, Xiaoyong Du
Ingo Wald, Will Usher, Nate Morrical, Laura Lediaev, Valerio Pascucci
Stefan Groth, Christian Schmitt, Jürgen Teich, and Frank Hannig
Viktor Rosenfeld, Sebastian Bress, Steffen Zeuch, Tilmann Rabl, Volker Markl
Tags: Algorithms, AMD Radeon R9 Fury, ATI, Computer science, Hashing, nVidia, nVidia GeForce GXT 1080, nVidia GeForce GXT 980, OpenCL, Performance, Tesla K40, Tesla V100