Alexander Brandt, Davood Mohajerani, Marc Moreno Maza, Jeeva Paudel, Linxiao Wang
November 10, 2019 by
hgpuSteven W. D. Chien, Ivy B. Peng, Stefano Markidis
Mohammed Alser, Taha Shahroodi, Juan Gomez-Luna, Can Alkan, Onur Mutlu
Filip Petrovič, David Střelák, Jana Hozzová, Jaroslav Oľha, Richard Trembecký, Siegfried Benkner, Jiří Filipovič
Tags: AMD Radeon RX Vega 56, Auto-Tuning, Benchmarking, Computer science, CUDA, Electron microscopy, Intel Xeon Phi, Microscopy, nVidia, nVidia GeForce GTX 1070, nVidia GeForce GTX 750, nVidia GeForce RTX 2080 Ti, OpenCL, Package, Performance, performance portability, Tesla K20
Andrey Ignatov, Radu Timofte, Andrei Kulik, Seungsoo Yang, Ke Wang, Felix Baum, Max Wu, Lirong Xu, Luc Van Gool
Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica
Ilia Sivkov, Alfio Lazzaro, Jurg Hutter
Tags: Computer science, CUDA, Data mining, Linear Algebra, Machine learning, Matrix multiplication, nVidia, Package, Signal processing, Sparse matrix, Tesla P100
Valentin Radu, Kuba Kaszyk, Yuan Wen, Jack Turner, Jose Cano, Elliot J. Crowley, Bjorn Franke, Amos Storkey, Michael O'Boyle
Oded Green, James Fox, Jeffrey Young, Jun Shirako, David Bader
Yuanming Hu, Tzu-Mao Li, Luke Anderson, Jonathan Ragan-Kelley, Frédo Durand
Martin Bauer, Sebastian Eibl, Christian Godenschwager, Nils Kohl, Michael Kuron, Christoph Rettinger, Florian Schornbaum, Christoph Schwarzmeier, Dominik Thönnes, Harald Köstler, Ulrich Rüde
Tags: Code generation, CUDA, Heterogeneous systems, Lattice Boltzmann model, MPI, nVidia, Package, Particle simulation, performance portability, Physics, Tesla P100