hgpu.org » AMD Radeon Instinct MI300X
Krishna Teja Chitty-Venkata, Siddhisanket Raskar, Bharat Kale, Farah Ferdaus, Aditya Tanikanti, Ken Raffenetti, Valerie Taylor, Murali Emani, Venkatram Vishwanath
Tags: AI, AMD Radeon Instinct MI250, AMD Radeon Instinct MI300X, Artificial intelligence, ATI, Benchmarking, Computer science, CUDA, LLM, Machine learning, nVidia, nVidia A100, nVidia GH200, nVidia H100, OpenCL, Performance
November 10, 2024 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- LLMPerf: GPU Performance Modeling meets Large Language Models
- Analyzing Modern NVIDIA GPU cores
- The Shamrock code: I- Smoothed Particle Hydrodynamics on GPUs
- Hardware-Assisted Software Testing and Debugging for Heterogeneous Computing
- Hercules: A Compiler for Productive Programming of Heterogeneous Systems
- Concurrent Scheduling of High-Level Parallel Programs on Multi-GPU Systems
- TileLink: Generating Efficient Compute-Communication Overlapping Kernels using Tile-Centric Primitives
- ML-Triton, A Multi-Level Compilation and Language Extension to Triton GPU Programming
- PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch
- Advances in Semantic Patching for HPC-oriented Refactorings with Coccinelle
* * *