hgpu.org » Graph
Ka Wai Wu
Tags: Computer science, CUDA, Graph, Heterogeneous systems, Neural networks, nVidia, PyTorch, Tesla V100
December 24, 2024 by hgpu
Craig McMillan, Emma Hart, Kevin Chalmers
April 15, 2015 by craigmcmillan01
Recent source codes
* * *
Most viewed papers (last 30 days)
- PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations
- Hardware Acceleration for Neural Networks: A Comprehensive Survey
- Tilus: A Tile-Level GPGPU Programming Language for Low-Precision Computation
- AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization
- The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers
- SeedFold: Scaling Biomolecular Structure Prediction
- Memory-Efficient Acceleration of Block Low-Rank Foundation Models on Resource Constrained GPUs
- KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta
- GPU Kernel Optimization Beyond Full Builds: An LLM Framework with Minimal Executable Programs
- Optimal Software Pipelining and Warp Specialization for Tensor Core GPUs
* * *



