hgpu.org » nVidia GeForce GTX 780 M
Raphael Hiesgen, Dominik Charousset, Thomas C. Schmidt
Tags: Computer science, Data parallelism, Heterogeneous systems, Intel Xeon Phi, nVidia, nVidia GeForce GTX 780 M, OpenCL, Package, Tesla C2075
September 28, 2017 by hgpu
David S. Lawrie
March 10, 2016 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Data-efficient LLM Fine-tuning for Code Generation
- LithOS: An Operating System for Efficient Machine Learning on GPUs
- Large Language Model Powered C-to-CUDA Code Translation: A Novel Auto-Parallelization Framework
- MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
- GigaAPI for GPU Parallelization
- Scalability Evaluation of HPC Multi-GPU Training for ECG-based LLMs
- A Power-Efficient Scheduling Approach in a Cpu-Gpu Computing System by Thread-Based Parallel Programming
- DeepCompile: A Compiler-Driven Approach to Optimizing Distributed Deep Learning Training
- InteropUnityCUDA: A Tool for Interoperability Between Unity and CUDA
- GPU-centric Communication Schemes for HPC and ML Applications
* * *