Siddharth Singh, Zack Sating, Abhinav Bhatele
Philip Salzmann, Fabian Knorr, Peter Thoman, Philipp Gschwandtner, Biagio Cosenza, Thomas Fahringer
Diandian Gu, Xintong Xie, Gang Huang, Xin Jin, Xuanzhe Liu
Menglu Yu, Bo Ji, Hridesh Rajan, Jia Liu
Zhancai Yan, Yaqiu Liu, Hongrun Shao
Yutaro Akahoshi, Sinya Aoki, Tatsumi Aoyama, Issaku Kanamori, Kazuyuki Kanaya, Hideo Matsufuru, Yusuke Namekawa, Hidekatsu Nemura, Yusuke Taniguchi
November 14, 2021 by
hgpuJan Solanti, Michal Babej, Julius Ikkala, Vinod Kumar Malamal Vadakital, Pekka Jääskeläinen
Tags: Computer science, GPU cluster, Heterogeneous systems, Matrix multiplication, nVidia, nVidia GeForce GTX 1060, nVidia GeForce GTX 2080 Ti, OpenCL, Package, Rendering, Tesla P100, Tesla V100
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
Dominik Strassel, Philipp Reusch, Janis Keuper
Jianqi Lai, Hang Yu, Zhengyu Tian, Hua Li
September 27, 2020 by
hgpuDeepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
Tags: Computer science, CUDA, Deep learning, FPGA, GPU cluster, Heterogeneous systems, nVidia, Optimization, Task scheduling, Tesla K80, Tesla P100, Tesla V100
Amir Hossein Sojoodi, Majid Salimi Beni, Farshad Khunjush