Andrew Gozillon, Ronan Keryell, Lin-Ya Yu, Gauthier Harnisch, Paul Keir
Geoffrey X. Yu, Yubo Gao, Pavel Golikov, Gennady Pekhimenko
Tags: Computer science, CUDA, Deep learning, Machine learning, Neural networks, nVidia, nVidia GeForce RTX 2070, Performance, Python, Tesla P100, Tesla V100
Yuhan Liu, Saurabh Agarwal, Shivaram Venkataraman
Walther Carballo-Hernández, Maxime Pelcat, François Berry
Guei-Yuan Lueh, Kaiyu Chen, Gang Chen, Joel Fuentes, Wei-Yu Chen, Fangwen Fu, Hong Jiang, Hongzheng Li, Daniel Rhee
Aamir Shafi, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. (DK) Panda
Uttaran Bhattacharya, Nicholas Rewkowski, Abhishek Banerjee, Pooja Guhan, Aniket Bera, Dinesh Manocha
Biagio Cosenza, Nikita Popov, Ben Juurlink, Paul Richmond, Mozhgan Kabiri Chimeh, Carmine Spagnuolo, Gennaro Cordasco, Vittorio Scarano
Lin Mingbao, Ji Rongrong, Li Shaojie, Wang Yan, Wu Yongjian, Huang Feiyue, Ye Qixiang
Jason Mohoney, Roger Waleffe, Yiheng Xu, Theodoros Rekatsinas, Shivaram Venkataraman
Johannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun, Dominic Hofer, Torsten Hoefler
Tags: Code generation, Computer science, CUDA, Distributed computing, FPGA, Heterogeneous systems, nVidia, OpenCL, Package, Tesla P100, Tesla V100