hgpu.org » AMD FirePro S9000
Ivan Grasso
Tags: AMD FirePro S9000, ATI, ATI Radeon HD 5870, Compilers, Computer science, Heterogeneous systems, Intel Xeon Phi, nVidia, nVidia GeForce GTX 480, OpenCL, Performance, Tesla K20, Thesis
July 25, 2017 by hgpu
Klaus Kofler, Biagio Cosenza, Thomas Fahringer
Tags: Algorithms, AMD FirePro S9000, ATI, Computer science, Data Structures and Algorithms, nVidia, nVidia GeForce GTX 480, OpenCL, Optimization, Tesla K20
June 17, 2015 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
- Accurate Models of NVIDIA Tensor Cores
- Microbenchmarking NVIDIA's Blackwell Architecture: An in-depth Architectural Analysis
- TritonForge: Profiling-Guided Framework for Automated Triton Kernel Optimization
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution
- Decoupled Triton: A Block-Level Decoupled Language for Writing and Exploring Efficient Machine-Learning Kernels
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters
* * *



