Azzam Haidar, Panruo Wu, Stanimire Tomov, Jack Dongarra

December 10, 2017 by

hgpuHamidreza Khaleghzadeh, Ziming Zhong, Ravi Reddy, Alexey Lastovetsky

Tags: BLAS, Cloud, Computer science, CUBLAS, CUDA, FPGA, Heterogeneous systems, Intel Xeon Phi, Matrix multiplication, nVidia, OpenCL, Package, Virtualization

September 16, 2017 by

hgpuShaohuai Shi, Pengfei Xu, Xiaowen Chu

Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance

February 14, 2017 by

hgpuAli Charara, David Keyes, Hatem Ltaief

Farhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan

Tingxing Dong, Azzam Haidar, Piotr Luszczek, Stanimire Tomov, Ahmad Abdelfattah, Jack Dongarra

Samuel D. Relton, Pedro Valero-Lara, Mawussi Zounon

Chemseddine Chohra, Philippe Langlois, David Parello

Ichitaro Yamazaki, Stanimire Tomov, Jack Dongarra

Paul Springer, Aravind Sankaran, Paolo Bientinesi

Tags: BLAS, Compilers, Computer science, CUDA, Intel Xeon Phi, Linear Algebra, Mathematical Software, nVidia, nVidia GeForce 840 M, Package, Performance, Tesla K40

Michel Steuwer, Toomas Remmelg, Christophe Dubach

Tags: ARM, ATI, ATI Radeon HD 7970, BLAS, Code generation, Computer science, Linear Algebra, Matrix multiplication, nVidia, nVidia GeForce GTX Titan Black, OpenCL, performance portability