Yehonatan Fridman, Guy Tamir, Gal Oren
Stijn Heldens, Ben van Werkhoven
Jacob O. Tørring, Ben van Werkhoven, Filip Petrovic, Floris-Jan Willemsen, Jirí Filipovic, Anne C. Elster
Tags: Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia GeForce RTX 2080 Ti, nVidia GeForce RTX 3060, nVidia GeForce RTX 3090, nVidia Titan RTX, Package, performance portability
Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, David Beckingsale, Todd Gamblin, Bronis de Supinski
Tags: AMD Radeon Instinct Mi50, ATI, Benchmarking, Code generation, Compilers, Computer science, CUDA, Heterogeneous systems, Machine learning, nVidia, OpenMP, performance portability, Tesla P100, Tesla V100
Polykarpos Thomadakis, Nikos Chrisochoides
Anna Fortenberry, Stanimire Tomov
December 25, 2022 by
hgpuAugust Ernstsson, Dalvan Griebler, Christoph Kessler
December 11, 2022 by
hgpuYu-Hsiang M. Tsai, Terry Cojean, Hartwig Anzt
Tags: AMD Radeon Instinct MI100, ATI, Computer science, CUDA, Linear Algebra, nVidia, nVidia A100, OpenCL, Package, performance portability, Sparse, SYCL
Gregor Daiß, Patrick Diehl, Dominic Marcello, Alireza Kheirkhahan, Hartmut Kaiser, Dirk Pflüger
Polykarpos Thomadakis, Nikos Chrisochoides
Zheming Jin, Jeffrey S. Vetter