hgpu.org » AMD Radeon Instinct MI355X
Musa Cim, Poovaiah Palangappa, Miro Hodak, Ravi Dwivedula, Meena Arunachalam, Mahmut Taylan Kandemir
Tags: AMD, AMD Radeon Instinct MI355X, Computer science, LLM, Precision, ROCm
May 20, 2026 by hgpu
William Hu, Drew Wadsworth, Sean Siddens, Stanley Winata, Daniel Y. Fu, Ryann Swann, Muhammad Osama, Christopher Ré, Simran Arora
November 16, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
- Analyzing the Impact of Kernel Fusion on GPU Tensor Operation Performance: A Systematic Performance Study
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
- Source-to-Source Transformations for GPU Code Generation
- Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in CUDA Kernel Generation
- CodegenBench: Can LLMs Write Efficient Code Across Architectures?
* * *




