Rodrigo de Oliveira Lourenço Lopes
Alexander Matz, Johannes Doerfert, Holger Fröning
Guyue Huang, Guohao Dai, Yu Wang, Huazhong Yang
Sohan Lal, Aksel Alpay, Philip Salzmann, Biagio Cosenza, Alexander Hirsch, Nicolai Stawinoga, Peter Thoman, Thomas Fahringer, Vincent Heuveline
Tags: Benchmarking, Computer science, FPGA, Heterogeneous systems, nVidia, nVidia GeForce GTX Titan X, OpenCL, Package, Performance, PTX, SYCL
Yuanhang Yu, Dong Wen, Ying Zhang, Xiaoyang Wang, Wenjie Zhang, Xuemin Lin
Min Li, Yulong Ao, Chao Yang
Hartwig Anzt, Terry Cojean, Goran Flegar, Fritz Göbel, Thomas Grützmacher, Pratik Nayak, Tobias Ribizel, Yuhsiang Mike Tsai, Enrique S. Quintana-Ortí
Tags: Algorithms, AMD Radeon VII, ATI, Computer science, CUDA, HIP, Linear Algebra, Mathematical Software, nVidia, Package, Sparse, Sparse matrix, Tesla V100
Utpal Kiran, Sachin Singh Gautam, Deepak Sharma
Trevor Gale, Matei Zaharia, Cliff Young, Erich Elsen
Yuhsiang M. Tsai, Terry Cojean, Tobias Ribizel, Hartwig Anzt
Somashekaracharya G. Bhaskaracharya, Julien Demouth, Vinod Grover
Tags: Compilers, Computer science, CUBLAS, CUDA, Deep learning, Matrix multiplication, nVidia, nVidia Quadro GV100, Performance, Programming Languages, PTX