hgpu.org » AMD Vega FE
Szilárd Páll, Artem Zhmurov, Paul Bauer, Mark Abraham, Magnus Lundborg, Alan Gray, Berk Hess, Erik Lindahl
Tags: Algorithms, AMD Radeon Instinct Mi50, AMD Vega FE, ATI, Chemistry, CUDA, Heterogeneous systems, Molecular dynamics, MPI, nVidia, nVidia GeForce RTX 2080, nVidia Quadro P 6000, OpenCL, Package, Tesla V100
June 21, 2020 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study
- AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
- LLMQ: Efficient Lower-Precision LLM Training for Consumer GPUs
- An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU
- DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
- MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- KernelFoundry: Hardware-aware evolutionary GPU kernel optimization
- Mixed-precision numerics in scientific applications: survey and perspectives
- Triton-Sanitizer: A Fast and Device-Agnostic Memory Sanitizer for Triton with Rich Diagnostic Context
* * *




