Tal Ben-Nun, Alice Shoshana Jakobovits, Torsten Hoefler
Tags: ATI, ATI Radeon HD 7970, Code generation, Computer science, CUDA, Deep learning, LLVM, LSTM, Machine learning, NLP, nVidia, nVidia GeForce GTX 970, OpenCL, Programming Languages, RNN
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane, Emanuele Del Sozzo, Patricia Suriana, Shoaib Kamil, Saman Amarasinghe
Tags: Compilers, Computer science, DSL, FPGA, Matrix multiplication, nVidia, OpenMPI, Performance, Programming Languages, PTX, Tesla K40
Tim Besard, Christophe Foket, Bjorn De Sutter
December 15, 2017 by
hgpuGheorghe-Teodor Bercea, Carlo Bertolli, Arpith C. Jacob, Alexandre Eichenberger, Alexey Bataev, Georgios Rokos, Hyojin Sung, Tong Chen, Kevin O'Brien
November 30, 2017 by
hgpuAshkan Tousimojarad, Wim Vanderbauwhede, W Paul Cockshott
November 30, 2017 by
hgpuTyler Sorensen, Hugues Evrard, Alastair F. Donaldson
Tian Zhao, Xiaobing Huang, Yu Cao
Tags: Computer science, CUDA, Deep learning, DSL, Java, Machine learning, nVidia, Package, Programming Languages, Scala, Tesla K40
Blake A. Hechtman, Andrew D. Hilton, Daniel J. Sorin
Sreepathi Pai, Keshav Pingali
Andreas Klockner, Lucas C. Wilcox, T. Warburton
Tags: Algorithms, AMD Radeon R9 Fury, ATI, Code generation, Computer science, Fortran, Numerical Analysis, nVidia, nVidia GeForce GTX Titan X, OpenCL, Package, Performance, Prefetch, Programming Languages, Python, Tesla K40