hgpu.org » real time systems
Ante Poljicak, Guillermo Botella, Carlos Garcia, Luka Kedmenec, Manuel Prieto-Matias
Tags: APU, ARM, ATI, ATI Radeon HD 6870, Image processing, nVidia, nVidia GeForce GTX 980, OpenCL, real time systems
March 10, 2018 by hgpu
Rajesh Gandham
Tags: ATI, ATI Radeon HD 7970, cfd, Computer science, CUDA, Fluid dynamics, Numerical simulation, nVidia, nVidia GeForce GTX Titan, OCCA, OpenCL, OpenMP, real time systems, Tesla C2050, Tesla K20, Tesla K40, Thesis
February 8, 2016 by hgpu
Sparsh Mittal
Tags: cache, cache partitioning, cpu, GPU, multitasking, real time systems, survey, WCET
December 15, 2015 by sparsh0mittal
Recent source codes
* * *
Most viewed papers (last 30 days)
- Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study
- AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
- LLMQ: Efficient Lower-Precision LLM Training for Consumer GPUs
- CuTeGen: An LLM-Based Agentic Framework for Generation and Optimization of High-Performance GPU Kernels using CuTe
- DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
- MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
- Mixed-precision numerics in scientific applications: survey and perspectives
- Triton-Sanitizer: A Fast and Device-Agnostic Memory Sanitizer for Triton with Rich Diagnostic Context
- SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits
- MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
* * *



