hgpu.org » AMD Radeon RX 6900 XT
Tsung-Wei Huang
Tags: AMD Radeon RX 6900 XT, ATI, Computer science, Deep learning, nVidia, nVidia GeForce RTX 3090, OpenCL, performance portability, PyTorch, SYCL
September 11, 2022 by hgpu
Recent source codes
* * *
Most viewed papers (last 30 days)
- Data-efficient LLM Fine-tuning for Code Generation
- LithOS: An Operating System for Efficient Machine Learning on GPUs
- Large Language Model Powered C-to-CUDA Code Translation: A Novel Auto-Parallelization Framework
- MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
- GigaAPI for GPU Parallelization
- Scalability Evaluation of HPC Multi-GPU Training for ECG-based LLMs
- A Power-Efficient Scheduling Approach in a Cpu-Gpu Computing System by Thread-Based Parallel Programming
- DeepCompile: A Compiler-Driven Approach to Optimizing Distributed Deep Learning Training
- InteropUnityCUDA: A Tool for Interoperability Between Unity and CUDA
- GPU-centric Communication Schemes for HPC and ML Applications
* * *