Mehmet Deveci, Simon D. Hammond, Michael M. Wolf, Sivasankaran Rajamanickam

Carl Yang, Aydin Buluc, John D. Owens

Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe

Tags: Compilers, Computer science, DSL, FPGA, Matrix multiplication, nVidia, OpenMPI, Performance, Programming Languages, PTX, Tesla K40

Muhammad Adnan, Faisal Aslam, Zubair Nawaz, Syed Mansoor Sarwar

Tags: Aparapi, Benchmarking, Computer science, CUDA, Java, Matrix multiplication, nVidia, nVidia GeForce GT 630 M, OpenCL, OpenGL, Package

Hamidreza Khaleghzadeh, Ziming Zhong, Ravi Reddy, Alexey Lastovetsky

Tags: BLAS, Cloud, Computer science, CUBLAS, CUDA, FPGA, Heterogeneous systems, Intel Xeon Phi, Matrix multiplication, nVidia, OpenCL, Package, Virtualization

September 16, 2017 by

hgpuSiddharth Samsi, Brian Helfer, Jeremy Kepner, Albert Reuther, Darrell O. Ricke

Aravind Vasudevan, Andrew Anderson, David Gregg

Tags: Algorithms, ARM, Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Neural networks, nVidia, nVidia Tegra TX1, Performance

Shaohuai Shi, Pengfei Xu, Xiaowen Chu

Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance

February 14, 2017 by

hgpuYi-Yan Nan, Quan-Zhe Li, Jin-Chun Piao, Shin-Dug Kim

February 10, 2017 by

hgpu