Shuo Zhang, Yanxia Wu, Chaoguang Men, Hongtao He, Kai Liang
Steven W. D. Chien, Ivy B. Peng, Stefano Markidis
Mohammed Alser, Taha Shahroodi, Juan Gomez-Luna, Can Alkan, Onur Mutlu
Filip Petrovič, David Střelák, Jana Hozzová, Jaroslav Oľha, Richard Trembecký, Siegfried Benkner, Jiří Filipovič
Tags: AMD Radeon RX Vega 56, Auto-Tuning, Benchmarking, Computer science, CUDA, Electron microscopy, Intel Xeon Phi, Microscopy, nVidia, nVidia GeForce GTX 1070, nVidia GeForce GTX 750, nVidia GeForce RTX 2080 Ti, OpenCL, Package, Performance, performance portability, Tesla K20
Ilia Sivkov, Alfio Lazzaro, Jurg Hutter
Tags: Computer science, CUDA, Data mining, Linear Algebra, Machine learning, Matrix multiplication, nVidia, Package, Signal processing, Sparse matrix, Tesla P100
Hamid Reza Zohouri, Satoshi Matsuoka
Ameer M.S. Abdelhadi, Christos-Savvas Bouganis, George A. Constantinides
Johannes de Fine Licht, Torsten Hoefler
Yuanming Hu, Tzu-Mao Li, Luke Anderson, Jonathan Ragan-Kelley, Frédo Durand
Martin Bauer, Sebastian Eibl, Christian Godenschwager, Nils Kohl, Michael Kuron, Christoph Rettinger, Florian Schornbaum, Christoph Schwarzmeier, Dominik Thönnes, Harald Köstler, Ulrich Rüde
Tags: Code generation, CUDA, Heterogeneous systems, Lattice Boltzmann model, MPI, nVidia, Package, Particle simulation, performance portability, Physics, Tesla P100