Chuanhao Zhuge, Xinheng Liu, Xiaofan Zhang, Sudeep Gummadi, Jinjun Xiong, Deming Chen
H Daisaka, N Nakasato, T Ishikawa, F Yuasa, K Nitadori
Kamel Abdelouahab, Maxime Pelcat, Jocelyn Serot, Francois Berry
Stylianos I. Venieris, Alexandros Kouris, Christos-Savvas Bouganis
Maria Kotsifakou, Prakalp Srivastava, Matthew D. Sinclair, Rakesh Komuravelli, Vikram Adve, Sarita Adve
Ryohei Kobayashi, Yuma Oobata, Norihisa Fujita, Yoshiki Yamaguchi, Taisuke Boku
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe
Tags: Compilers, Computer science, DSL, FPGA, Matrix multiplication, nVidia, OpenMPI, Performance, Programming Languages, PTX, Tesla K40
Alfonso Rodriguez, Cesar Castanares, Teresa Riesgo, Eduardo de la Torre
February 17, 2018 by
hgpuTianqi Chen, Thierry Moreau, Ziheng Jiang, Haichen Shen, Eddie Yan, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy
Tags: Artificial intelligence, Computer science, CUDA, Deep learning, FPGA, Machine learning, nVidia, nVidia GeForce GTX 1080, OpenCL, Package, performance portability, TensorFlow, Tesla K80
February 15, 2018 by
hgpuJavier Alejandro Varela, Norbert Wehn
February 10, 2018 by
hgpuHamid Reza Zohouri, Artur Podobas, Satoshi Matsuoka