Nadav Rotem, Jordan Fix, Saleem Abdulrasool, Summer Deng, Roman Dzhabarov, James Hegeman, Roman Levenstein, Bert Maher, Satish Nadathur, Jakob Olesen, Jongsoo Park, Artem Rakhov, Misha Smelyanskiy
Tags: Code generation, Compilers, Computer science, CUDA, Deep learning, Heterogeneous systems, Linear Algebra, Machine learning, Neural networks, nVidia, Package
Ken Nakanishi, Shin-ichi Maeda, Takeru Miyato, Daisuke Okanohara
Mattia Antonino Di Gangi, Marcello Federico
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe
Tags: Algorithms, Code generation, Computer science, CUDA, Distributed computing, FPGA, Heterogeneous systems, Linear Algebra, LLVM, MPI, nVidia, OpenMPI, Tesla K40
Leyuan Wang, Yangzihao Wang, Carl Yang, John D. Owens
Shin Morishima, Hiroki Matsutani
Nicolas Weber, Florian Schmidt, Mathias Niepert, Felipe Huici
Tags: AI, Artificial intelligence, cache, CNN, Computer science, cpu, CUDA, Deep learning, GPU, Machine learning, Neural and Evolutionary Computing, nVidia, nVidia GeForce GTX 1080 Ti
Yosuke Oyama, Tal Ben-Nun, Torsten Hoefler, Satoshi Matsuoka
Zhe Jia, Marco Maggioni, Benjamin Staiger, Daniele P. Scarpazza