hgpu.org » nVidia
Richard Vuduc, Aparna Chandramowlishwaran, Jee Choi, Murat Guney, Aashay Shringarpure
October 27, 2010 by hgpu
Nikolaj Leischner, Vitaly Osipov, Peter Sanders
Tags: Computer science, CUDA, Data Structures and Algorithms, nVidia, nVidia GeForce GTX 285, Sorting, Tesla C1060
October 27, 2010 by hgpu
Guobin Shen,Lihua Zhu,Shipeng Li,Heung-Yeung Shum,Ya-Qin Zhang
Tags: Computer science, nVidia, Video decoding
October 27, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
- Accurate Models of NVIDIA Tensor Cores
- Microbenchmarking NVIDIA's Blackwell Architecture: An in-depth Architectural Analysis
- TritonForge: Profiling-Guided Framework for Automated Triton Kernel Optimization
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution
- Decoupled Triton: A Block-Level Decoupled Language for Writing and Exploring Efficient Machine-Learning Kernels
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters
* * *



