hgpu.org » Optimization
Daniel Cederman, Philippas Tsigas
Tags: Computer science, CUDA, nVidia, nVidia GeForce 8800 GT, nVidia GeForce 9600 GT, Optimization, Performance
November 5, 2010 by hgpu
Ogier Maitre, Laurent A. Baumes, Nicolas Lachiche, Avelino Corma, Pierre Collet
November 5, 2010 by hgpu
Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, Sara S. Baghsorkhi, Sain Z. Ueng, John A. Stratton, Wen mei
Tags: Computer science, CUDA, nVidia, nVidia GeForce 8800 GTX, Optimization, Performance, Programming techniques
November 1, 2010 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Data-efficient LLM Fine-tuning for Code Generation
- LithOS: An Operating System for Efficient Machine Learning on GPUs
- Large Language Model Powered C-to-CUDA Code Translation: A Novel Auto-Parallelization Framework
- MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
- GigaAPI for GPU Parallelization
- Scalability Evaluation of HPC Multi-GPU Training for ECG-based LLMs
- A Power-Efficient Scheduling Approach in a Cpu-Gpu Computing System by Thread-Based Parallel Programming
- DeepCompile: A Compiler-Driven Approach to Optimizing Distributed Deep Learning Training
- InteropUnityCUDA: A Tool for Interoperability Between Unity and CUDA
- GPU-centric Communication Schemes for HPC and ML Applications
* * *