Zhuren Liu, Shouzhe Zhang, Justin Garrigus, Hui Zhao
Leonardo Solis-Vasquez, Edward Mascarenhas, Andreas Koch
Siddharth Singh, Zack Sating, Abhinav Bhatele
Pratik Fegade, Tianqi Chen, Phillip B. Gibbons, Todd C. Mowry
Philip Salzmann, Fabian Knorr, Peter Thoman, Philipp Gschwandtner, Biagio Cosenza, Thomas Fahringer
Bastian Köpcke, Sergei Gorlatch, Michel Steuwer
Gargi Alavani, Santonu Sarkar
Tags: Computer science, CUDA, Energy-efficient computing, Java, Machine learning, nVidia, Package, Performance, PTX, Tesla K20, Thesis
Yanwen Xu, Ang Li, Tyler Sorensen
Tags: Benchmarking, Computer science, CUDA, FPGA, Heterogeneous systems, HLS, Intel UHD 630, nVidia, nVidia Jetson AGX Xavier, nVidia Jetson Nano, Package, Performance, SYCL
Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4