Tiziano De Matteis, Johannes de Fine Licht, Torsten Hoefler

Vadim Demchik, Miroslav Bačák, Stefan Bordag

John Lawson, Mehdi Goli, Duncan McBain, Daniel Soutar, Louis Sugy

Tags: AMD R9 Nano, ATI, BLAS, Computer science, Deep learning, Linear Algebra, Machine learning, Mathematical Software, OpenCL, Package, Performance, performance portability, SYCL

Carl Yang, Aydin Buluc, John D. Owens

Azzam Haidar, Panruo Wu, Stanimire Tomov, Jack Dongarra

December 10, 2017 by

hgpuHamidreza Khaleghzadeh, Ziming Zhong, Ravi Reddy, Alexey Lastovetsky

Tags: BLAS, Cloud, Computer science, CUBLAS, CUDA, FPGA, Heterogeneous systems, Intel Xeon Phi, Matrix multiplication, nVidia, OpenCL, Package, Virtualization

September 16, 2017 by

hgpuShaohuai Shi, Pengfei Xu, Xiaowen Chu

Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance

February 14, 2017 by

hgpuAli Charara, David Keyes, Hatem Ltaief

Farhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan

Tingxing Dong, Azzam Haidar, Piotr Luszczek, Stanimire Tomov, Ahmad Abdelfattah, Jack Dongarra

Samuel D. Relton, Pedro Valero-Lara, Mawussi Zounon