Khoi N.M. Nguyen, Hoang Duy Nguyen Do, Huyen Thao Le, Thanh Tuan Dao
Jianling Li, Shangzhan Li, Zhenye Gao, Qi Shi, Yuxuan Li, Zefan Wang, Jiacheng Huang, Haojie Wang, Jianrong Wang, Xu Han, Zhiyuan Liu, Maosong Sun
Anne Ouyang, Simon Guo, Simran Arora, Alex L. Zhang, William Hu, Christopher Ré, Azalia Mirhoseini
February 24, 2025 by
hgpuRobert Tjarko Lange, Aaditya Prasad, Qi Sun, Maxence Faldor, Yujin Tang, David Ha
February 24, 2025 by
hgpuHeejun Lee, Geon Park, Jaduk Suh, Sung Ju Hwang
February 16, 2025 by
hgpuYouhe Jiang, Fangcheng Fu, Xiaozhe Yao, Guoliang He, Xupeng Miao, Ana Klimovic, Bin Cui, Binhang Yuan, Eiko Yoneki
February 10, 2025 by
hgpuDahua Feng, Zhiming Xu, Rongxiang Wang, Felix Xiaozhu Lin
Tags: AI, Apple M2 Max, Apple M2 Pro, Apple M2 Ultra, Computer science, CUDA, Linear Algebra, LLM, Machine learning, nVidia, nVidia GeForce RTX 4090, nVidia GeFroce RTX 2080 Ti, nVidia Quadro RTX 4000, nVidia RTX A6000, Performance, PyTorch
Ruijun Feng, Hammond Pearce, Pietro Liguori, Yulei Sui
Jiaping Wang, Simiao Zhang, Qiao-Chu He, Yifan Chen
Tags: Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia A100, nVidia RTX A6000, Package, Python, PyTorch
Davide Italiano, Chris Cummins
Aman Chaturvedi, Daniel Nichols, Siddharth Singh, Abhinav Bhatele
Tags: Code generation, Computer science, CUDA, HIP, HPC, LLM, MPI, nVidia, nVidia A100, OpenMP, Package
December 24, 2024 by
hgpu