Yan Shen, Yuxing Dai, Zhiliang Zhu

Siddharth Samsi, Brian Helfer, Jeremy Kepner, Albert Reuther, Darrell O. Ricke

Yuechao Lu, Fumihiko Ino, Yasuyuki Matsushita

Bangtian Liu, Chengyao Wen, Anand D.Sarwate, Maryam Mehri Dehnavi

Shaohuai Shi, Pengfei Xu, Xiaowen Chu

Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance

February 14, 2017 by

hgpuJoao Paulo Tarasconi Ruschel

Tags: Algorithms, Benchmarking, Computer science, CUDA, Linear Algebra, Matrix decomposition, nVidia, OpenCL, OpenMP, Package, Performance, Tesla K80, Thesis

Ali Charara, David Keyes, Hatem Ltaief

Farhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan

Tags: Algorithms, Computer science, CUDA, Factorization, FPGA, Linear Algebra, Mathematical Software, Matrix multiplication, nVidia, Performance, Tesla C2050

December 17, 2016 by

hgpuRichard Michael Veras, Tze Meng Low, Tyler Michael Smith, Robert van de Geijn, Franz Franchetti

December 14, 2016 by

hgpuSteven Eliuk, Cameron Upright, Hars Vardhan, Stephen Walsh, Trevor Gale

November 25, 2016 by

hgpu