Yusuke Nagasaka, Satoshi Matsuoka, Ariful Azad, Aydin Buluc
Mehmet Deveci, Simon D. Hammond, Michael M. Wolf, Sivasankaran Rajamanickam
Carl Yang, Aydin Buluc, John D. Owens
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe
Tags: Compilers, Computer science, DSL, FPGA, Matrix multiplication, nVidia, OpenMPI, Performance, Programming Languages, PTX, Tesla K40
Muhammad Adnan, Faisal Aslam, Zubair Nawaz, Syed Mansoor Sarwar
Tags: Aparapi, Benchmarking, Computer science, CUDA, Java, Matrix multiplication, nVidia, nVidia GeForce GT 630 M, OpenCL, OpenGL, Package
Hamidreza Khaleghzadeh, Ziming Zhong, Ravi Reddy, Alexey Lastovetsky
Tags: BLAS, Cloud, Computer science, CUBLAS, CUDA, FPGA, Heterogeneous systems, Intel Xeon Phi, Matrix multiplication, nVidia, OpenCL, Package, Virtualization
September 16, 2017 by
hgpuSiddharth Samsi, Brian Helfer, Jeremy Kepner, Albert Reuther, Darrell O. Ricke
Aravind Vasudevan, Andrew Anderson, David Gregg
Tags: Algorithms, ARM, Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Neural networks, nVidia, nVidia Tegra TX1, Performance
Shaohuai Shi, Pengfei Xu, Xiaowen Chu
Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance
February 14, 2017 by
hgpu