Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Martin Swany, Dingwen Tao, Franck Cappello
Filip Petrovič, Jiří Filipovič
Tags: Computer science, CUDA, nVidia, nVidia GeForce GTX 1070, nVidia GeForce GTX 680, nVidia GeForce GTX 750, nVidia GeForce RTX 2080 Ti, OpenCL, Package, Performance, Python, Vulkan
S.N. Swatman, A. Krasznahorkay, P. Gessinger
Zhiyi Li, Douglas Orr, Valeriu Ohan, Godfrey Da costa, Tom Murray, Adam Sanders, Deniz Beker, Dominic Masters
Stijn Heldens, Ben van Werkhoven
YuPeng Huang, Hong Zhang, Siyuan Jiang, Dajiong Yue, Xiaohan Lin, Jun Zhang, Yi Qin Gao
João Bispo, Nuno Paulino, Luís Miguel Sousa
Gregor Daiß, Patrick Diehl, Hartmut Kaiser, Dirk Pflüger
Jacob O. Tørring, Ben van Werkhoven, Filip Petrovic, Floris-Jan Willemsen, Jirí Filipovic, Anne C. Elster
Tags: Auto-Tuning, Benchmarking, Computer science, CUDA, nVidia, nVidia GeForce RTX 2080 Ti, nVidia GeForce RTX 3060, nVidia GeForce RTX 3090, nVidia Titan RTX, Package, performance portability
Giorgis Georgakoudis, Konstantinos Parasyris, Chunhua Liao, David Beckingsale, Todd Gamblin, Bronis de Supinski
Tags: AMD Radeon Instinct Mi50, ATI, Benchmarking, Code generation, Compilers, Computer science, CUDA, Heterogeneous systems, Machine learning, nVidia, OpenMP, performance portability, Tesla P100, Tesla V100
Polykarpos Thomadakis, Nikos Chrisochoides