Steven Eliuk, Cameron Upright, Hars Vardhan, Stephen Walsh, Trevor Gale
November 25, 2016 by
hgpuPedro Bruel, Marcos Amaris, Alfredo Goldman
Tags: Benchmarking, Computer science, CUDA, Heterogeneous systems, Matrix multiplication, nVidia, nVidia GeForce GTX 750, nVidia GeForce GTX 980, Package, Performance, Tesla K40
November 16, 2016 by
hgpuRyotaro Sakai, Fumihiko Ino, Kenichi Hagihara
Farhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan
Michel Steuwer, Toomas Remmelg, Christophe Dubach
Tags: ARM, ATI, ATI Radeon HD 7970, BLAS, Code generation, Computer science, Linear Algebra, Matrix multiplication, nVidia, nVidia GeForce GTX Titan Black, OpenCL, performance portability
Gregory Diamos, Shubho Sengupta, Bryan Catanzaro, Mike Chrzanowski, Adam Coates, Erich Elsen, Jesse Engel, Awni Hannun, Sanjeev Satheesh
Tatsumi Aoyama, Ken-Ichi Ishikawa, Yasuyuki Kimura, Hideo Matsufuru, Atsushi Sato, Tomohiro Suzuki, Sunao Torii
Steven Eliuk, Cameron Upright, Anthony Skjellum
Tags: Computer science, CUDA, Deep learning, Heterogeneous systems, Linear Algebra, Matrix multiplication, Neural and Evolutionary Computing, Neural networks, nVidia, OpenMPI, Tesla K80