Taesu Kim, Jongho Lee, Daehyun Ahn, Sarang Kim, Jiwoong Choi, Minkyu Kim, Hyungjun Kim
Tags: Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Mixed precision, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia RTX A6000, Package
February 18, 2024 by
hgpuBin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao
November 19, 2023 by
hgpuJan Solanti, Michal Babej, Julius Ikkala, Pekka Jääskeläinen
September 6, 2023 by
hgpuJustus Henneberg, Felix Schuhknecht
Neha Jawalkar, Kanav Gupta, Arkaprava Basu, Nishanth Chandran, Divya Gupta, Rahul Sharma
February 26, 2023 by
hgpuHanqiu Chen, Yahya Alhinai, Yihan Jiang, Eunjee Na, Cong Hao
Wenzel Jakob, Sébastien Speierer, Nicolas Roussel, Delio Vicini
Stefan Abi-Karam, Yuqi He, Rishov Sarkar, Lakshmi Sathidevi, Zihang Qiao, Cong Hao
Martin Uray, Eduard Hirsch, Gerold Katzinger, Michael Gadermayr