Carl Andersson, Jonathan Nilsson
December 24, 2023 by
hgpuShixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen
Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4
Yi Zhai, Yu Zhang, Shuo Liu, Xiaomeng Chu, Jie Peng, Jianmin Ji, Yanyong Zhang
February 12, 2023 by
hgpuIngunn Sund, Knut A. Kirkhorn, Jacob O. Tørring, Anne C. Elster
Tags: Auto-Tuning, Benchmarking, Computer science, CUDA, Heterogeneous systems, nVidia, nVidia GeForce GTX 980, Package, Performance, Tesla T4, Tesla V100
November 21, 2021 by
hgpuJiarong Xing, Leyuan Wang, Shang Zhang, Jack Chen, Ang Chen, Yibo Zhu
Lars Bjertnes, Jacob O. Tørring, Anne C. Elster
Kai Zhu, Wenyi Zhao, Zhen Zheng, Tianyou Guo, Pengzhan Zhao, Junjie Bai, Jun Yang, Xiaoyong Liu, Lansong Diao, Wei Lin
Mike Turner, Jamil Appa, Neil Ashton
Yatin Chaudhary, Pankaj Gupta, Khushbu Saxena, Vivek Kulkarni, Thomas Runkler, Hinrich Schütze