hgpu.org » nVidia GeForce GTX 860
Usman Ahmed, Jerry Chun-Wei Lin, Gautam Srivastava
Tags: Computer science, Heterogeneous systems, IoT, Machine learning, nVidia, nVidia GeForce GTX 760, nVidia GeForce GTX 860, OpenCL
July 17, 2022 by hgpu
Bilel Ben Romdhanne
May 29, 2015 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Diagnosing FP4 inference: a layer-wise and block-wise sensitivity analysis of NVFP4 and MXFP4
- Architecture-Aware LLM Inference Optimization on AMD Instinct GPUs: A Comprehensive Benchmark and Deployment Study
- EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery
- LLMQ: Efficient Lower-Precision LLM Training for Consumer GPUs
- KernelSkill: A Multi-Agent Framework for GPU Kernel Optimization
- AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
- An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU
- MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
- DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
- KernelFoundry: Hardware-aware evolutionary GPU kernel optimization
* * *



