hgpu.org » nVidia GeForce RTX 4060
Erel Kaplan, Tomer Bitan, Lian Ghrayeb, Le Chen, Tom Yotam, Niranjan Hasabnis, Gal Oren
Tags: Code generation, Computer science, CUDA, LLM, nVidia, nVidia GeForce RTX 4060, OpenMP, Package
January 12, 2026 by hgpu
Performant Unified GPU Kernels for Portable Singular Value Computation Across Hardware and Precision
Evelyne Ringoot, Rabab Alomairy, Valentin Churavy, Alan Edelman
Tags: AMD Radeon Instinct MI250, Apple M1 Pro, ATI, Computer science, HIP, Intel, Intel Ponte Vecchio Max 1100, Kokkos, Linear Algebra, Machine learning, nVidia, nVidia A100, nVidia GeForce RTX 4060, nVidia H100, OpenCL, SYCL
August 17, 2025 by hgpu
Recent source codes
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
* * *
Most viewed papers (last 30 days)
- DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
- Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
- Deep Kernel Fusion for Transformers
- Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards
- SciDef: Automating Definition Extraction from Academic Literature with Large Language Models
- StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning
- Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
- Inside VOLT: Designing an Open-Source GPU Compiler (Tool)
- Execution-Centric Characterization of FP8 Matrix Cores, Asynchronous Execution, and Structured Sparsity on AMD MI300A
- HetCCL: Accelerating LLM Training with Heterogeneous GPUs
* * *




