Changho Hwang, KyoungSoo Park, Ran Shu, Xinyuan Qu, Peng Cheng, Yongqiang Xiong
Jinfan Chen, Shigang Li, Ran Gun, Jinhui Yuan, Torsten Hoefler
Nathan Pemberton, Anton Zabreyko, Zhoujie Ding, Randy Katz, Joseph Gonzalez
December 25, 2022 by
hgpuStijn Heldens, Pieter Hijma, Ben van Werkhoven, Jason Maassen, Rob V. van Nieuwpoort
February 20, 2022 by
hgpuJi Liu, Zhihua Wu, Dianhai Yu, Yanjun Ma, Danlei Feng, Minxu Zhang, Xinxuan Wu, Xuefeng Yao, Dejing Dou
November 28, 2021 by
hgpuZane Fink, Simeng Liu, Jaemin Choi, Matthias Diener, Laxmikant V. Kale
Tags: Benchmarking, Computer science, CUDA, Distributed computing, HPC, Machine learning, MPI, nVidia, Package, Performance, Python, Tesla V100
November 14, 2021 by
hgpuJaemin Choi, Zane Fink, Sam White, Nitin Bhat, David F. Richards, Laxmikant V. Kale
February 28, 2021 by
hgpuAamir Shafi, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. (DK) Panda
Johannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun, Dominic Hofer, Torsten Hoefler
Tags: Code generation, Computer science, CUDA, Distributed computing, FPGA, Heterogeneous systems, nVidia, OpenCL, Package, Tesla P100, Tesla V100
José Á. Morell, Andrés Camero, Enrique Alba