Jianyu Huang, Chenhan D. Yu, Robert A. van de Geijn

September 2, 2018 by

hgpuAmmar Ahmad Awan, Hari Subramoni, Dhabaleswar K. Panda

Tags: Benchmarking, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Intel Xeon Phi, Machine learning, nVidia, Tela K40, Tesla K80, Tesla P100

December 24, 2017 by

hgpuHamidreza Khaleghzadeh, Ziming Zhong, Ravi Reddy, Alexey Lastovetsky

Tags: BLAS, Cloud, Computer science, CUBLAS, CUDA, FPGA, Heterogeneous systems, Intel Xeon Phi, Matrix multiplication, nVidia, OpenCL, Package, Virtualization

September 16, 2017 by

hgpuRuxi Qi, Wesley M. Botello-Smith, Ray Luo

Tags: Biomolecules, Biophysics, Boltzmann equation, Computational Physics, CUBLAS, CUDA, Electrostatics, Molecular simulation, nVidia, nVidia GeForce GTX 980 Ti, Physics, Poisson Boltzmann

Shaohuai Shi, Pengfei Xu, Xiaowen Chu

Tags: Algorithms, BLAS, Caffe, Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Matrix multiplication, Neural networks, nVidia, nVidia GeForce GTX 1080, Package, Performance

February 14, 2017 by

hgpuAxel Modave, Amik St-Cyr, Tim Warburton

Tags: BLAS, Computational Physics, CUBLAS, CUDA, FEM, Finite element method, Geoscience, Linear Algebra, nVidia, nVidia GeForce GTX 980, OCCA, Physics, Profiling, Seismic modeling, Seismology

Linnan Wang, Wei Wu, Jianxiong Xiao, Yi Yang

Tags: Benchmarking, Computer science, CUBLAS, CUDA, Heterogeneous systems, Linear Algebra, nVidia, nVidia GeForce GTX Titan X, Package, Performance, Tesla K40

Reza Bosagh Zadeh, Xiangrui Meng, Burak Yavuz, Aaron Staple, Li Pu, Shivaram Venkataraman, Evan Sparks, Alexander Ulanov, Matei Zaharia

Tags: Algorithms, Benchmarking, Computer science, CUBLAS, CUDA, Linear Algebra, Machine learning, Matrix multiplication, nVidia, Package, Scala, Tesla M2050

September 15, 2015 by

hgpuAyaz ul Hasan Khan, Mayez Al-Mouhamed, Allam Fatayer

Azzam Haidar, Tingxing "Tim" Dong, Stanimire Tomov, Piotr Luszczek, Jack Dongarra

Azzam Haidar, Tingxing Dong, Piotr Luszczek, Stanimire Tomov, Jack Dongarra