hgpu.org » AMD Radeon R9
A.J. Lazaro-Munoz, J.M. Gonzalez-Linares, J. Gomez-Luna, N. Guil
Tags: AMD Radeon R9, ATI, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenCL, Performance, Task scheduling, Tesla K20
June 28, 2018 by hgpu
A.J. Lazaro-Munoz, J.M. Gonzalez-Linares, J. Gomez-Luna, N. Guil
Tags: AMD Radeon R9, ATI, Benchmarking, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenCL, Task scheduling, Tesla K20
June 17, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- MusaCoder: Native GPU Kernel Generation with Full-Stack Training on Moore Threads GPU
- KernelBenchX: A Comprehensive Benchmark for Evaluating LLM-Generated GPU Kernels
- Pretraining large language models with MXFP4 on Native FP4 Hardware
- Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
- KForge: LLM-Driven Cross-Platform Kernel Generation for AI Accelerators
* * *



