Yehia Arafa, Ammar ElWazir, Abdelrahman ElKanishy, Youssef Aly, Ayatelrahman Elsayed, Abdel-Hameed Badawy, Gopinath Chennupati, Stephan Eidenbenz, Nandakishore Santhi
February 23, 2020 by
hgpuYehia Arafa, Abdel-Hameed Badawy, Gopinath Chennupati, Nandakishore Santhi, Stephan Eidenbenz
Tags: Benchmarking, Computer science, CUDA, nVidia, nVidia GeForce GTX Titan X, nVidia Titan RTX, Performance, PTX, Tesla K40, Tesla P100, Tesla V100
Benjamin Ferrell, Jun Duan, Kevin W. Hamlen
Zhe Jia, Marco Maggioni, Jeffrey Smith, Daniele Paolo Scarpazza
Ricardo Nobre, Luis Reis, Joao M. P. Cardoso
Zhe Jia, Marco Maggioni, Benjamin Staiger, Daniele P. Scarpazza
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe
Tags: Compilers, Computer science, DSL, FPGA, Matrix multiplication, nVidia, OpenMPI, Performance, Programming Languages, PTX, Tesla K40
Philippe Tillet, David Cox
Tags: Auto-Tuning, Computer science, CUDA, Deep learning, nVidia, nVidia GeForce GTX 980 Ti, OpenCL, Package, Performance, PTX, Tesla P100
February 17, 2018 by
hgpuScott Cyphers, Arjun K. Bansal, Anahita Bhiwandiwalla, Jayaram Bobba, Matthew Brookhart, Avijit Chakraborty, Will Constable, Christian Convey, Leona Cook, Omar Kanawi, Robert Kimball, Jason Knight, Nikolay Korovaiko, Varun Kumar, Yixing Lao, Christopher R. Lishka, Jaikrishnan Menon, Jennifer Myers, Sandeep Aswath Narayana, Adam Procter, Tristan J. Webb
Xiaohui Chen, Marc Moreno-Maza, Jeeva Paudel, Ning Xie
Gheorghe-Teodor Bercea, Carlo Bertolli, Arpith C. Jacob, Alexandre Eichenberger, Alexey Bataev, Georgios Rokos, Hyojin Sung, Tong Chen, Kevin O'Brien
November 30, 2017 by
hgpu