Lazaros Papadopoulos, Dimitris John Soudris, Christoph Kessler, August Ernstsson, Johan Ahlqvist, Nikos Vasilas, Athanasios Papadopoulos, Panos Seferlis, Charles Prouveur, Matthieu Haefele, Samuel Paul Thibault, Athanasios Salamanis, Theodoros Ioakimidis, Dionisis D. Kehagias
Tags: Computer science, CUDA, FPGA, Heterogeneous systems, MPI, nVidia, nVidia Quadro P 620, OpenCL, OpenMP, Tesla P100, Tesla V100
Jan Solanti, Michal Babej, Julius Ikkala, Vinod Kumar Malamal Vadakital, Pekka Jääskeläinen
Tags: Computer science, GPU cluster, Heterogeneous systems, Matrix multiplication, nVidia, nVidia GeForce GTX 1060, nVidia GeForce GTX 2080 Ti, OpenCL, Package, Rendering, Tesla P100, Tesla V100
Chao Chen, Chris Porter, Santosh Pande
Martin Svedin, Steven W. D. Chien, Gibson Chikafa, Niclas Jansson, Artur Podobas
Shenggui Li, Fuzhao Xue, Yongbin Li, Yang You
Purushotam Kumar, Surya Pratap Vanka
Davide Vanzo, Samuel Peter, Lukas Vonwiller, Matthias Buergler, Manuel Weberndorfer, Annunziato Siviglia, Daniel Conde, David F. Vetsch
February 28, 2021 by
hgpuGeoffrey X. Yu, Yubo Gao, Pavel Golikov, Gennady Pekhimenko
Tags: Computer science, CUDA, Deep learning, Machine learning, Neural networks, nVidia, nVidia GeForce RTX 2070, Performance, Python, Tesla P100, Tesla V100
Yuhan Liu, Saurabh Agarwal, Shivaram Venkataraman
Johannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun, Dominic Hofer, Torsten Hoefler
Tags: Code generation, Computer science, CUDA, Distributed computing, FPGA, Heterogeneous systems, nVidia, OpenCL, Package, Tesla P100, Tesla V100