Shaohuai Shi, Pengfei Xu, Xiaowen Chu

Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance

February 14, 2017 by

hgpuJoao Paulo Tarasconi Ruschel

Tags: Algorithms, Benchmarking, Computer science, CUDA, Linear Algebra, Matrix decomposition, nVidia, OpenCL, OpenMP, Package, Performance, Tesla K80, Thesis

Ali Charara, David Keyes, Hatem Ltaief

Farhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan

Tags: Algorithms, Computer science, CUDA, Factorization, FPGA, Linear Algebra, Mathematical Software, Matrix multiplication, nVidia, Performance, Tesla C2050

December 17, 2016 by

hgpuRichard Michael Veras, Tze Meng Low, Tyler Michael Smith, Robert van de Geijn, Franz Franchetti

December 14, 2016 by

hgpuSteven Eliuk, Cameron Upright, Hars Vardhan, Stephen Walsh, Trevor Gale

November 25, 2016 by

hgpuAndrea Picciau, Gordon E. Inggs, John Wickerson, Eric C. Kerrigan, George A. Constantinides

Farhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan

Gregoire Pichon, Mathieu Faverge, Pierre Ramet, Jean Roman

Jiaquan Gao, Panpan Qi, Guixia He