Zane Fink, Simeng Liu, Jaemin Choi, Matthias Diener, Laxmikant V. Kale
Tags: Benchmarking, Computer science, CUDA, Distributed computing, HPC, Machine learning, MPI, nVidia, Package, Performance, Python, Tesla V100
November 14, 2021 by
hgpuJaemin Choi, Zane Fink, Sam White, Nitin Bhat, David F. Richards, Laxmikant V. Kale
February 28, 2021 by
hgpuAamir Shafi, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. (DK) Panda
Johannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun, Dominic Hofer, Torsten Hoefler
Tags: Code generation, Computer science, CUDA, Distributed computing, FPGA, Heterogeneous systems, nVidia, OpenCL, Package, Tesla P100, Tesla V100
José Á. Morell, Andrés Camero, Enrique Alba
Guanhua Wang, Shivaram Venkataraman, Amar Phanishayee, Jorgen Thelin, Nikhil Devanur, Ion Stoica
Yidi Wu, Kaihao Ma, Xiao Yan, Zhi Liu, James Cheng
September 29, 2019 by
hgpuWei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny
Adrian Jackson, Andrew Turner, Michele Weiland, Nick Johnson, Olly Perks, Mark Parsons
David Pfander, Gregor Daiss, Dirk Pfluger
Tags: Clustering, Computer science, Data mining, Distributed computing, Heterogeneous systems, Machine learning, MPI, nVidia, OpenCL, Package, performance portability, Tesla P100
February 10, 2019 by
hgpuXi Chen, Gregory S. Gutmann, Joe Bungo
December 23, 2018 by
hgpu