hgpu.org » nVidia GeForce RTX 4060
Erel Kaplan, Tomer Bitan, Lian Ghrayeb, Le Chen, Tom Yotam, Niranjan Hasabnis, Gal Oren
Tags: Code generation, Computer science, CUDA, LLM, nVidia, nVidia GeForce RTX 4060, OpenMP, Package
January 12, 2026 by hgpu
Performant Unified GPU Kernels for Portable Singular Value Computation Across Hardware and Precision
Evelyne Ringoot, Rabab Alomairy, Valentin Churavy, Alan Edelman
Tags: AMD Radeon Instinct MI250, Apple M1 Pro, ATI, Computer science, HIP, Intel, Intel Ponte Vecchio Max 1100, Kokkos, Linear Algebra, Machine learning, nVidia, nVidia A100, nVidia GeForce RTX 4060, nVidia H100, OpenCL, SYCL
August 17, 2025 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- Hardware Acceleration for Neural Networks: A Comprehensive Survey
- cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- BoltzGen:Toward Universal Binder Design
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
- The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers
- AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization
- SeedFold: Scaling Biomolecular Structure Prediction
* * *




