hgpu.org » AMD Radeon Instinct MI200
Muhammad Usman Tariq, Abhinav Jangda, Angelica Moreira, Madan Musuvathi, Tyler Sorensen
Tags: AI, AMD, AMD Radeon Instinct MI200, ATI, Computer science, CUDA, HIP, HLSL, LLM, NLP, nVidia, nVidia RTX A6000
December 29, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
- CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
- Source-to-Source Transformations for GPU Code Generation
* * *



