Hamidreza Khaleghzadeh, Ziming Zhong, Ravi Reddy, Alexey Lastovetsky
Tags: BLAS, Cloud, Computer science, CUBLAS, CUDA, FPGA, Heterogeneous systems, Intel Xeon Phi, Matrix multiplication, nVidia, OpenCL, Package, Virtualization
September 16, 2017 by
hgpuSiddharth Samsi, Brian Helfer, Jeremy Kepner, Albert Reuther, Darrell O. Ricke
Aravind Vasudevan, Andrew Anderson, David Gregg
Tags: Algorithms, ARM, Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Neural networks, nVidia, nVidia Tegra TX1, Performance
Shaohuai Shi, Pengfei Xu, Xiaowen Chu
Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance
February 14, 2017 by
hgpuYi-Yan Nan, Quan-Zhe Li, Jin-Chun Piao, Shin-Dug Kim
February 10, 2017 by
hgpuAli Charara, David Keyes, Hatem Ltaief
Seth D. Pendergrass, J. Nathan Kutz, Steven L. Brunton
December 26, 2016 by
hgpuFarhad Merchant, Tarun Vatwani, Anupam Chattopadhyay, Soumyendu Raha, S K Nandy, Ranjani Narayan
Tags: Algorithms, Computer science, CUDA, Factorization, FPGA, Linear Algebra, Mathematical Software, Matrix multiplication, nVidia, Performance, Tesla C2050
December 17, 2016 by
hgpuSyed Tahir Hussain Rizvi, Gianpiero Cabodi, Denis Patti, Gianluca Francini
December 10, 2016 by
hgpu