Ingunn Sund, Knut A. Kirkhorn, Jacob O. Tørring, Anne C. Elster
Tags: Auto-Tuning, Benchmarking, Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 980, Package, Performance, Tesla T4, Tesla V100
November 21, 2021 by
hgpuYutaro Akahoshi, Sinya Aoki, Tatsumi Aoyama, Issaku Kanamori, Kazuyuki Kanaya, Hideo Matsufuru, Yusuke Namekawa, Hidekatsu Nemura, Yusuke Taniguchi
November 14, 2021 by
hgpuYuhang Li, Mingzhu Shen, Jian Ma, Yan Ren, Mingxin Zhao, Qi Zhang, Ruihao Gong, Fengwei Yu, Junjie Yan
November 14, 2021 by
hgpuZane Fink, Simeng Liu, Jaemin Choi, Matthias Diener, Laxmikant V. Kale
Tags: Benchmarking, Computer science, CUDA, Distributed computing, HPC, Machine learning, MPI, nVidia, Package, Performance, Python, Tesla V100
November 14, 2021 by
hgpuAditya K Kamath, Arkaprava Basu
Jan Hückelheim, Laurent Hascoët
Zhengda Bian, Hongxin Liu, Boxiang Wang, Haichen Huang, Yongbin Li, Chuanrui Wang, Fan Cui, Yang You
Yao-Yuan Yang, Moto Hira, Zhaoheng Ni, Anjali Chourdia, Artyom Astafurov, Caroline Chen, Ching-Feng Yeh, Christian Puhrsch, David Pollack, Dmitriy Genzel, Donny Greenberg, Edward Z. Yang, Jason Lian, Jay Mahadeokar, Jeff Hwang, Ji Chen, Peter Goldsborough, Prabhat Roy, Sean Narenthiran, Shinji Watanabe, Soumith Chintala, Vincent Quenneville-Bélair, Yangyang Shi
Muhammet Abdullah Soyturk, Palwisha Akhtar, Erhan Tezcan, Didem Unat