Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe
Tags: Algorithms, Code generation, Computer science, CUDA, Distributed computing, FPGA, Heterogeneous systems, Linear Algebra, LLVM, MPI, nVidia, OpenMPI, Tesla K40
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe
Tags: Compilers, Computer science, DSL, FPGA, Matrix multiplication, nVidia, OpenMPI, Performance, Programming Languages, PTX, Tesla K40
Jingbo Zhou, Qi Guo, H. V. Jagadish, Lubos Krcal, Siyuan Liu, Wenhao Luan, Anthony K. H. Tung, Yueji Yang, Yuxin Zheng
December 28, 2017 by
hgpuAmmar Ahmad Awan, Ching-Hsiang Chu, Hari Subramoni, Dhabaleswar K. Panda
Linnan Wang, Wei Wu, George Bosilca, Richard Vuduc, Zenglin Xu
November 16, 2016 by
hgpuSteven Eliuk, Cameron Upright, Anthony Skjellum
Tags: Computer science, CUDA, Deep learning, Heterogeneous systems, Linear Algebra, Matrix multiplication, Neural and Evolutionary Computing, Neural networks, nVidia, OpenMPI, Tesla K80
Flavio Vella, Giancarlo Carbone, Massimo Bernaschi
Lev E. Givon, Aurel A. Lazar
Yukio Iwaya, Makoto Otani, Takao Tsuchiya, Yasushi Inoguchi