Dahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
Jiaping Wang, Simiao Zhang, Qiao-Chu He, Yifan Chen
Tags: Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia A100, nVidia RTX A6000, Package, Python, PyTorch
Taesu Kim, Jongho Lee, Daehyun Ahn, Sarang Kim, Jiwoong Choi, Minkyu Kim, Hyungjun Kim
Tags: Computer science, CUDA, Deep learning, Machine learning, Matrix multiplication, Mixed precision, nVidia, nVidia A100, nVidia GeForce RTX 4090, nVidia RTX A6000, Package
February 18, 2024 by
hgpuBin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao
November 19, 2023 by
hgpuJan Solanti, Michal Babej, Julius Ikkala, Pekka Jääskeläinen
September 6, 2023 by
hgpuJustus Henneberg, Felix Schuhknecht
Neha Jawalkar, Kanav Gupta, Arkaprava Basu, Nishanth Chandran, Divya Gupta, Rahul Sharma
February 26, 2023 by
hgpuHanqiu Chen, Yahya Alhinai, Yihan Jiang, Eunjee Na, Cong Hao
Wenzel Jakob, Sébastien Speierer, Nicolas Roussel, Delio Vicini