hgpu.org » Distributed computing
Jiri Barnat, Lubos Brim, Milan Ceska
Tags: Computer science, CUDA, Distributed computing, nVidia, nVidia GeForce GTX 280, Software Engineering
November 9, 2010 by hgpu
Cyrille Rossant, Dan F. Goodman, Jonathan Platkiewicz, Romain Brette
Tags: Adaptive threshold, Biology, CUDA, Distributed computing, Neuroscience, nVidia, nVidia GeForce GTX 295, Optimization, Package
November 6, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Revealing NVIDIA Closed-Source Driver Command Streams for CPU-GPU Runtime Behavior Insight
- Evaluating CUDA Tile for AI Workloads on Hopper and Blackwell GPUs
- FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
- DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
- CuBridge: An LLM-Based Framework for Understanding and Reconstructing High-Performance Attention Kernels
- ARGUS: Agentic GPU Optimization Guided by Data-Flow Invariants
- Kerncap: Automated Kernel Extraction and Isolation for AMD GPUs
- KEET: Explaining Performance of GPU Kernels Using LLM Agents
- A Human–Machine Collaborative Tuning Framework for Triton Kernel Optimization on SIMD Platforms
- CUDAHercules: Benchmarking Hardware-Aware Expert-level CUDA Optimization for LLMs
* * *




