hgpu.org » Graph
Ka Wai Wu
Tags: Computer science, CUDA, Graph, Heterogeneous systems, Neural networks, nVidia, PyTorch, Tesla V100
December 24, 2024 by hgpu
Craig McMillan, Emma Hart, Kevin Chalmers
April 15, 2015 by craigmcmillan01
Recent source codes
A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
* * *
Most viewed papers (last 30 days)
- DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
- Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
- BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics
- Deep Kernel Fusion for Transformers
- Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards
- Private LLM Inference on Consumer Blackwell GPUs: A Practical Guide for Cost-Effective Local Deployment in SMEs
- SciDef: Automating Definition Extraction from Academic Literature with Large Language Models
- ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler
- Nsight Python: A Python-First Profiling Toolkit for Seamless GPU Kernel Analysis (Tool)
- Generating Literature-Driven Scientific Theories at Scale
* * *



