Daniel Cussen, Jeffrey D. Ullman

Hiroyuki Ootomo, Katsuhisa Ozaki, Rio Yokota

Tags: Computer science, CUBLAS, CUDA, Deep learning, Linear Algebra, Machine learning, Matrix multiplication, nVidia, nVidia A100, nVidia Jetson AGX Orin, nVidia RTX 6000 Ada, nVidia Titan RTX, Package

Fumiya Kono, Naohito Nakasato, Maho Nakata

Shixun Wu, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Bryan M. Wong, Zizhong Chen

Tags: Code generation, Computer science, CUDA, GEMM, Linear Algebra, Matrix multiplication, nVidia, nVidia A100, Package, Performance, Reliability, Tesla T4

Zhiyi Li, Douglas Orr, Valeriu Ohan, Godfrey Da costa, Tom Murray, Adam Sanders, Deniz Beker, Dominic Masters

Jonathan Wapman, Sean Treichler, Serban D. Porumbescu, John D. Owens

Anna Fortenberry, Stanimire Tomov

December 25, 2022 by

hgpuMuhammad Osama

Tags: Algorithms, Computer science, CUDA, Linear Algebra, load balancing, Matrix multiplication, nVidia, nVidia A100, Package, Sparse, Thesis

December 25, 2022 by

hgpuGenghan Zhang, Yuetong Zhao, Yanting Tao, Zhongming Yu, Guohao Dai, Sitao Huang, Yuan Wen, Pavlos Petoumenos, Yu Wang

September 11, 2022 by

hgpuTim Dettmers, Mike Lewis, Younes Belkada, Luke Zettlemoyer