Pietro Incardona, Aryaman Gupta, Serhii Yaskovets, Ivo F. Sbalzarini
Tags: AMD RX Vega 64, ATI, Computer science, CUDA, nVidia, nVidia A100, nVidia GeForce RTX 3090, OpenACC, OpenCL, OpenMP, Package, Performance, performance portability, SYCL
Joachim Meyer, Aksel Alpay, Sebastian Hack, Holger Fröning, Vincent Heuveline
Yehonatan Fridman, Guy Tamir, Gal Oren
Stijn Heldens, Ben van Werkhoven
Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, David Beckingsale, Todd Gamblin, Bronis de Supinski
Tags: AMD Radeon Instinct Mi50, ATI, Benchmarking, Code generation, Compilers, Computer science, CUDA, Heterogeneous systems, Machine learning, nVidia, OpenMP, performance portability, Tesla P100, Tesla V100
Jacob O. Tørring, Ben van Werkhoven, Filip Petrovic, Floris-Jan Willemsen, Jirí Filipovic, Anne C. Elster
Tags: Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia GeForce RTX 2080 Ti, nVidia GeForce RTX 3060, nVidia GeForce RTX 3090, nVidia Titan RTX, Package, performance portability
Polykarpos Thomadakis, Nikos Chrisochoides
Anna Fortenberry, Stanimire Tomov
December 25, 2022 by
hgpuAugust Ernstsson, Dalvan Griebler, Christoph Kessler
December 11, 2022 by
hgpuYu-Hsiang M. Tsai, Terry Cojean, Hartwig Anzt
Tags: AMD Radeon Instinct MI100, ATI, Computer science, CUDA, Linear Algebra, nVidia, nVidia A100, OpenCL, Package, performance portability, Sparse, SYCL