hgpu.org » Stream Computing
Gabriele Mencagli, Patrizio Dazzi, Massimo Coppola
Tags: Computer science, CUDA, DSP, nVidia, nVidia A30, Package, Stream Computing
August 4, 2024 by hgpu
I. Buck, T. Foley, D. Horn, J. Sugerman, K. Mike, H. Pat
Tags: ATI, ATI Radeon 9800 XT, ATI Stream, Brook, Computer science, High-level Languages, nVidia, nVidia GeForce FX 5900 Ultra, OpenGL, Stream Computing
November 3, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Diagnosing FP4 inference: a layer-wise and block-wise sensitivity analysis of NVFP4 and MXFP4
- Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study
- EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery
- LLMQ: Efficient Lower-Precision LLM Training for Consumer GPUs
- KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization
- AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
- An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU
- MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
- DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
- KernelFoundry: Hardware-aware evolutionary GPU kernel optimization
* * *




