Philip Salzmann, Fabian Knorr, Peter Thoman, Philipp Gschwandtner, Biagio Cosenza, Thomas Fahringer
Juan Fumero, György Rethy, Athanasios Stratikopoulos, Nikos Foutris, Christos Kotselidis
Simon John Pennycook, Ben Ashbaugh, James Brodman, Michael Kinsner, Steffen Larsen, Greg Lueck, Roland Schulz, Michael Voss
Akash Dutta, Jordi Alcaraz, Ali TehraniJamsaz, Eduardo Cesar, Anna Sikora, Ali Jannesari
Tags: AMD Radeon HD 7970, ATI, Benchmarking, Computer science, Deep learning, Heterogeneous systems, Neural networks, nVidia, nVidia GeForce GTX 970, OpenCL, OpenMP
Gargi Alavani, Santonu Sarkar
Tags: Computer science, CUDA, Energy-efficient computing, Java, Machine learning, nVidia, Package, Performance, PTX, Tesla K20, Thesis
Bastian Köpcke, Sergei Gorlatch, Michel Steuwer
Yanwen Xu, Ang Li, Tyler Sorensen
Tags: Benchmarking, Computer science, CUDA, FPGA, Heterogeneous systems, HLS, Intel UHD 630, nVidia, nVidia Jetson AGX Xavier, nVidia Jetson Nano, Package, Performance, SYCL
Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4
Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello