hgpu.org » Text mining
Yongpeng Zhang, Frank Mueller, Xiaohui Cui, Thomas Potok
March 11, 2011 by hgpu
M. D. Lieberman, J. Sankaranarayanan, H. Samet
December 12, 2010 by hgpu
Joseph M. Cavanagh, Thomas E. Potok, Xiaohui Cui
November 21, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
- Accurate Models of NVIDIA Tensor Cores
- Microbenchmarking NVIDIA's Blackwell Architecture: An in-depth Architectural Analysis
- TritonForge: Profiling-Guided Framework for Automated Triton Kernel Optimization
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution
- Decoupled Triton: A Block-Level Decoupled Language for Writing and Exploring Efficient Machine-Learning Kernels
- Beyond Code Pairs: Dialogue-Based Data Generation for LLM Code Translation
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters
* * *



