Afzal Ahmad, Linfeng Du, Wei Zhang
Junjie Li, Yinzhi Wang, Xiao Liang, Hang Liu
Jou-An Chen, Hsin-Hsuan Sung, Nathan Tallent, Kevin Barker, Xipeng Shen, Ang Li
Tiziano De Matteis, Johannes de Fine Licht, Torsten Hoefler
Vadim Demchik, Miroslav Bačák, Stefan Bordag
John Lawson, Mehdi Goli, Duncan McBain, Daniel Soutar, Louis Sugy
Tags: AMD R9 Nano, ATI, BLAS, Computer science, Deep learning, Linear Algebra, Machine learning, Mathematical Software, OpenCL, Package, Performance, performance portability, SYCL
Carl Yang, Aydin Buluc, John D. Owens
Azzam Haidar, Panruo Wu, Stanimire Tomov, Jack Dongarra
December 10, 2017 by
hgpuHamidreza Khaleghzadeh, Ziming Zhong, Ravi Reddy, Alexey Lastovetsky
Tags: BLAS, Cloud, Computer science, CUBLAS, CUDA, FPGA, Heterogeneous systems, Intel Xeon Phi, Matrix multiplication, nVidia, OpenCL, Package, Virtualization
September 16, 2017 by
hgpuShaohuai Shi, Pengfei Xu, Xiaowen Chu
Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance
February 14, 2017 by
hgpu