Filip Petrovič, David Střelák, Jana Hozzová, Jaroslav Oľha, Richard Trembecký, Siegfried Benkner, Jiří Filipovič
Tags: AMD Radeon RX Vega 56, Auto-Tuning, Benchmarking, Computer science, CUDA, Electron microscopy, Intel Xeon Phi, Microscopy, nVidia, nVidia GeForce GTX 1070, nVidia GeForce GTX 750, nVidia GeForce RTX 2080 Ti, OpenCL, Package, Performance, performance portability, Tesla K20
Ilia Sivkov, Alfio Lazzaro, Jurg Hutter
Tags: Computer science, CUDA, Data mining, Linear Algebra, Machine learning, Matrix multiplication, nVidia, Package, Signal processing, Sparse matrix, Tesla P100
Hamid Reza Zohouri, Satoshi Matsuoka
Ameer M.S. Abdelhadi, Christos-Savvas Bouganis, George A. Constantinides
Johannes de Fine Licht, Torsten Hoefler
Yuanming Hu, Tzu-Mao Li, Luke Anderson, Jonathan Ragan-Kelley, Frédo Durand
Martin Bauer, Sebastian Eibl, Christian Godenschwager, Nils Kohl, Michael Kuron, Christoph Rettinger, Florian Schornbaum, Christoph Schwarzmeier, Dominik Thönnes, Harald Köstler, Ulrich Rüde
Tags: Code generation, CUDA, Heterogeneous systems, Lattice Boltzmann model, MPI, nVidia, Package, Particle simulation, performance portability, Physics, Tesla P100
Jehandad Khan, Paul Fultz, Artem Tamazov, Daniel Lowell, Chao Liu, Michael Melesse, Murali Nandhimandalam, Kamil Nasyrov, Ilya Perminov, Tejash Shah, Vasilii Filippov, Jing Zhang, Jing Zhou, Bragadeesh Natarajan, Mayank Daga
Nouamane Laanait, Joshua Romero, Junqi Yin, M. Todd Young, Sean Treichler, Vitalii Starchenko, Albina Borisevich, Alex Sergeev, Michael Matheson
September 29, 2019 by
hgpuSteffen Holst Larsen
September 29, 2019 by
hgpuMohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, Bryan Catanzaro
September 22, 2019 by
hgpu