Simon John Pennycook, Ben Ashbaugh, James Brodman, Michael Kinsner, Steffen Larsen, Greg Lueck, Roland Schulz, Michael Voss
Yueming Hao, Xu Zhao, Bin Bao, David Berard, Will Constable, Adnan Aziz, Xu Liu
Akash Dutta, Jordi Alcaraz, Ali TehraniJamsaz, Eduardo Cesar, Anna Sikora, Ali Jannesari
Tags: AMD Radeon HD 7970, ATI, Benchmarking, Computer science, Deep learning, Heterogeneous systems, Neural networks, nVidia, nVidia GeForce GTX 970, OpenCL, OpenMP
Bastian Köpcke, Sergei Gorlatch, Michel Steuwer
Gargi Alavani, Santonu Sarkar
Tags: Computer science, CUDA, Energy-efficient computing, Java, Machine learning, nVidia, Package, Performance, PTX, Tesla K20, Thesis
Yanwen Xu, Ang Li, Tyler Sorensen
Tags: Benchmarking, Computer science, CUDA, FPGA, Heterogeneous systems, HLS, Intel UHD 630, nVidia, nVidia Jetson AGX Xavier, nVidia Jetson Nano, Package, Performance, SYCL
Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4
Salem Ameen, Kangaranmulle Siriwardana, Theo Theodoridis
Boyuan Zhang, Jiannan Tian, Sheng Di, Xiaodong Yu, Yunhe Feng, Xin Liang, Dingwen Tao, Franck Cappello
Erik A. Träff, Anton Rydahl, Sven Karlsson, Ole Sigmund, Niels Aage
Vsevolod Livinskii, Dmitry Babokin, John Regehr