Wei Niu, Zhengang Li, Xiaolong Ma, Peiyan Dong, Gang Zhou, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren
Aditya Rajagopal, Christos-Savvas Bouganis
Lazaros Papadopoulos, Dimitris John Soudris, Christoph Kessler, August Ernstsson, Johan Ahlqvist, Nikos Vasilas, Athanasios Papadopoulos, Panos Seferlis, Charles Prouveur, Matthieu Haefele, Samuel Paul Thibault, Athanasios Salamanis, Theodoros Ioakimidis, Dionisis D. Kehagias
Tags: Computer science, CUDA, FPGA, Heterogeneous systems, MPI, nVidia, nVidia Quadro P 620, OpenCL, OpenMP, Tesla P100, Tesla V100
Shandong Lao, Aaron Holt, Deepthi Vaidhynathan, Hariswaran Sitaraman, Christine M. Hrenya, Thomas Hauser
Muhammad A. Awad, Saman Ashkiani, Serban D. Porumbescu, Martín Farach-Colton, John D. Owens
Fabian Knorr, Peter Thoman, Thomas Fahringer
Guillermo Oyarzun, Daniel Mira, Guillaume Houzeaux
Hanchen Ye, Cong Hao, Jianyi Cheng, Hyunmin Jeong, Jack Huang, Stephen Neuendorffer, Deming Chen
Jan Solanti, Michal Babej, Julius Ikkala, Vinod Kumar Malamal Vadakital, Pekka Jääskeläinen
Tags: Computer science, GPU cluster, Heterogeneous systems, Matrix multiplication, nVidia, nVidia GeForce GTX 1060, nVidia GeForce GTX 2080 Ti, OpenCL, Package, Rendering, Tesla P100, Tesla V100