hgpu.org » AMD Radeon R9
A.J. Lazaro-Munoz, J.M. Gonzalez-Linares, J. Gomez-Luna, N. Guil
Tags: AMD Radeon R9, ATI, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenCL, Performance, Task scheduling, Tesla K20
June 28, 2018 by hgpu
A.J. Lazaro-Munoz, J.M. Gonzalez-Linares, J. Gomez-Luna, N. Guil
Tags: AMD Radeon R9, ATI, Benchmarking, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenCL, Task scheduling, Tesla K20
June 17, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study
- AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
- LLMQ: Efficient Lower-Precision LLM Training for Consumer GPUs
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
- MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
- Mixed-precision numerics in scientific applications: survey and perspectives
- Triton-Sanitizer: A Fast and Device-Agnostic Memory Sanitizer for Triton with Rich Diagnostic Context
- SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits
- MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
* * *



