hgpu.org » AMD Radeon R9
A.J. Lazaro-Munoz, J.M. Gonzalez-Linares, J. Gomez-Luna, N. Guil
Tags: AMD Radeon R9, ATI, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenCL, Performance, Task scheduling, Tesla K20
June 28, 2018 by hgpu
A.J. Lazaro-Munoz, J.M. Gonzalez-Linares, J. Gomez-Luna, N. Guil
Tags: AMD Radeon R9, ATI, Benchmarking, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, OpenCL, Task scheduling, Tesla K20
June 17, 2017 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel Optimization
- Scalable GPU-Based Integrity Verification for Large Machine Learning Models
- STARK: Strategic Team of Agents for Refining Kernels
- Tutoring LLM into a Better CUDA Optimizer
- INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats
- Collective Communication for 100k+ GPUs
- An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
- Enhancing Transformer Performance and Portability through Auto-tuning Frameworks
- RDMA Point-to-Point Communication for LLM Systems
- A Study of Floating-Point Precision Tuning in Deep Learning Operators Implementations
* * *



