Guohao Dai, Guyue Huang, Shang Yang, Zhongming Yu, Hengrui Zhang, Yufei Ding, Yuan Xie, Huazhong Yang, Yu Wang
February 20, 2022 by
hgpuErfan Bank Tavakoli, Michael Riera, Masudul Hassan Quraishi, Fengbo Ren
December 26, 2021 by
hgpuYu-Ching Hu, Yuliang Li, Hung-Wei Tseng
December 19, 2021 by
hgpuNavdeep Katel, Vivek Khandelwal, Uday Bondhugula
September 5, 2021 by
hgpuJan Solanti, Michal Babej, Julius Ikkala, Vinod Kumar Malamal Vadakital, Pekka Jääskeläinen
Tags: Computer science, GPU cluster, Heterogeneous systems, Matrix multiplication, nVidia, nVidia GeForce GTX 1060, nVidia GeForce GTX 2080 Ti, OpenCL, Package, Rendering, Tesla P100, Tesla V100
Xiaoyan Liu, Yi Liu, Ming Dun, Bohong Yin, Hailong Yang, Zhongzhi Luan, Depei Qian
Beau Johnston, Jeffrey S. Vetter, Josh Milthorpe
Tags: AMD Radeon VII, ATI, Benchmarking, Computer science, CUDA, Heterogeneous systems, HIP, Matrix multiplication, nVidia, OpenCL, Package, Performance, Tesla P100
November 29, 2020 by
hgpuSteven Harris, Roger D. Chamberlain, Christopher Gill
Thomas Faingnaert, Tim Besard, Bjorn De Sutter
Tags: Computer science, CUBLAS, CUDA, Julia, Machine learning, Mathematical Software, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2080 Ti, Package, Performance
Orestis Zachariadis, Nitin Satpute, Juan Gómez-Luna, Joaquín Olivares
Tags: Algorithms, Computer science, CUDA, Matrix multiplication, Mixed precision, nVidia, nVidia GeForce RTX 2070, nVidia Titan RTX, Package, Performance, Sparse matrix