Zhancai Yan, Yaqiu Liu, Hongrun Shao
Yutaro Akahoshi, Sinya Aoki, Tatsumi Aoyama, Issaku Kanamori, Kazuyuki Kanaya, Hideo Matsufuru, Yusuke Namekawa, Hidekatsu Nemura, Yusuke Taniguchi
November 14, 2021 by
hgpuJan Solanti, Michal Babej, Julius Ikkala, Vinod Kumar Malamal Vadakital, Pekka Jääskeläinen
Tags: Computer science, GPU cluster, Heterogeneous systems, Matrix multiplication, nVidia, nVidia GeForce GTX 1060, nVidia GeForce GTX 2080 Ti, OpenCL, Package, Rendering, Tesla P100, Tesla V100
Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia
Dominik Strassel, Philipp Reusch, Janis Keuper
Jianqi Lai, Hang Yu, Zhengyu Tian, Hua Li
September 27, 2020 by
hgpuDeepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, Matei Zaharia
Tags: Computer science, CUDA, Deep learning, FPGA, GPU cluster, Heterogeneous systems, nVidia, Optimization, Task scheduling, Tesla K80, Tesla P100, Tesla V100
Amir Hossein Sojoodi, Majid Salimi Beni, Farshad Khunjush
David J. Lusher, Satya P. Jammy, Neil D. Sandham
Tags: cfd, Code generation, CUDA, Fluid dynamics, GPU cluster, Heterogeneous systems, Numerical simulation, nVidia, OpenCL, OpenMP, OpenMPI, Package, Python, Tesla P100
Jay H. Park, Gyeongchan Yun, Chang M. Yi, Nguyen T. Nguyen, Seungmin Lee, Jaesik Choi, Sam H. Noh, Young-ri Choi
Tags: Computer science, CUDA, Data parallelism, Deep learning, GPU cluster, Heterogeneous systems, Neural networks, nVidia, nVidia GeForce GTX Titan V, nVidia GeForce RTX 2060, nVidia Quadro P 4000, nVidia Titan RTX
Mengdi Wang, Chen Meng, Guoping Long, Chuan Wu, Jun Yang, Wei Lin, Yangqing Jia