Uldis Locans, Andreas Adelmann, Andreas Suter, Jannis Fischer, Werner Lustermann, Gunther Dissertori, Qiulin Wang

Jeremy Appleyard, Tomas Kocisky, Phil Blunsom

Steven Eliuk, Cameron Upright, Anthony Skjellum

Tags: Computer science, CUDA, Deep learning, Heterogeneous systems, Linear Algebra, Matrix multiplication, Neural and Evolutionary Computing, Neural networks, nVidia, OpenMPI, Tesla K80

X. Bellekens, C. Tachtatzis, R. C. Atkinson, C. Renfrew, T. Kirkham

X. J. A. Bellekens, C. Tachtatzis, R. C. Atkinson, C. Renfrew, T. Kirkham

Jingyue Wu, Artem Belevich, Eli Bendersky, Mark Heffernan, Chris Leary, Jacques Pienaar, Bjarke Roune, Rob Springer, Xuetian Weng, Robert Hundt

S. Rallapalli, H. Qiu, A. J. Bency, S. Karthikeyan, R. Govindan, B.S.Manjunath, R. Urgaonkar

Ajay K. Sampathirao, Pantelis Sopasakis, Alberto Bemporad, Panagiotis Patrinos

Kenichi W. Okamoto, Priyanga Amarasekare

James King, Thomas Gilray, Robert M. Kirby, Matthew Might

Jingbo Zhou, Qi Guo, H. V. Jagadish, Wenhao Luan, Anthony K. H. Tung, Yueji Yang, Yuxin Zheng