Philippe Tillet, David Cox
Tags: Auto-Tuning, Computer science, CUDA, Deep learning, nVidia, nVidia GeForce GTX 980 Ti, OpenCL, Package, Performance, PTX, Tesla P100
February 17, 2018 by
hgpuAndras Attila Sulyok, Gabor Daniel Balogh, Istvan Zoltan Reguly, Gihan R. Mudalige
February 15, 2018 by
hgpuXin Chen, Hua Zhou, Yuxiang Gao, Yu Zhu, Dongyan Wang
Simon Garcia De Gonzalo, Simon D. Hammond, Christian R. Trott, Wen-Mei Hwu
Ali Karakus, Noel Chalmers, Kasia Swirydowicz, Timothy Warburton
Zeyi Wen, Jiashuai Shi, Bingsheng He, Qinbin Li, Jian Chen
Ammar Ahmad Awan, Hari Subramoni, Dhabaleswar K. Panda
Tags: Benchmarking, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Intel Xeon Phi, Machine learning, nVidia, Tela K40, Tesla K80, Tesla P100
December 24, 2017 by
hgpuHsi-Yu Schive, John A. ZuHone, Nathan J. Goldbaum, Matthew J. Turk, Massimo Gaspari, Chin-Yu Cheng
Tags: ARM, Astrophysics, Chemistry, CUDA, Instrumentation and Methods for Astrophysics, Magnetohydrodynamics, MPI, nVidia, OpenMP, Package, Tesla K20, Tesla P100
December 24, 2017 by
hgpuAzzam Haidar, Panruo Wu, Stanimire Tomov, Jack Dongarra
December 10, 2017 by
hgpuRuizi Li, Carleton DeTar, Steven Gottlieb, Doug Toussaint
Gheorghe-Teodor Bercea, Carlo Bertolli, Arpith C. Jacob, Alexandre Eichenberger, Alexey Bataev, Georgios Rokos, Hyojin Sung, Tong Chen, Kevin O'Brien
November 30, 2017 by
hgpuG. D. Balogh, I. Z. Reguly, G. R. Mudalige