Kris Shengjun Dong, Sahil Modi, Dima Nikiforov, Sana Damani, Edward Lin, Siva Kumar Sastry Hari, Christos Kozyrakis
February 23, 2026 by
hgpuArijit Bhattacharjee, Heng Ping, Son Vu Le, Paul Bogdan, Nesreen K. Ahmed, Ali Jannesari
February 23, 2026 by
hgpuAaron Jarmusch, Connor Vitz, Sunita Chandrasekaran
February 16, 2026 by
hgpuRyo Mikasa, Shun-ichiro Hayashi, Daichi Mukunoki, Tetsuya Hoshino, Takahiro Katagiri
Tags: Benchmarking, Code generation, Computer science, CUDA, HPC, LLM, Matrix multiplication, nVidia, nVidia H100, OpenMP, Performance
February 16, 2026 by
hgpuZixi Zhang, Zhiwen Mo, Yiren Zhao, Robert Mullins
February 16, 2026 by
hgpuBastian Hagedorn, Alexander Collins, Tony Mongkolsmai, Vinod Grover
Bohua Zou, Debayan Roy, Dhimankumar Yogesh Airao, Weihao Xu, Binqi Sun, Yutao Liu, Haibo Chen
Kaixuan Zhang, Yunfan Cui, Shuhao Zhang, Chutong Ding, Shiyou Qian, Luping Wang, Jian Cao, Guangtao Xue, Cheng Huang, Guodong Yang, Liping Zhang
Tags: Computer science, CUDA, Heterogeneous systems, Machine learning, nVidia, nVidia A100, nVidia A40, nVidia H100, nVidia H20, nVidia H200, nVidia H800, nVidia L20, nVidia L40, nVidia RTX 6000 Ada, Performance, Triton
Ruifan Chu, Anbang Wang, Xiuxiu Bai, Shuai Liu, and Xiaoshe Dong
Genghan Zhang, Shaowei Zhu, Anjiang Wei, Zhenyu Song, Allen Nie, Zhen Jia, Nandita Vijaykumar, Yida Wang, Kunle Olukotun
December 29, 2025 by
hgpuRyan Swann, Muhammad Osama, Xiaohu Guo, Bryant Nelson, Lixun Zhang, Alex Brown, Yen Ong, Ali Yazdani, Sean Siddens, Ganesh Dasika, Alex Underwood
Tags: AMD, AMD Radeon Instinct MI300X, AMD Radeon Instinct MI350X, ATI, BLAS, Computer science, HPC, Package, Performance, ROCm, Triton
Burkhard Ringlein, Jan van Lunteren, Radu Stoica, Thomas Parnell
Tags: AMD Radeon Instinct MI250, AMD Radeon Instinct MI300X, ATI, Computer science, CUDA, DSL, HIP, LLM, nVidia, nVidia H100, Performance, Programming Languages, Triton
November 23, 2025 by
hgpu