User homepage
You must sign in before.
Recent source codes
* * *
Most viewed papers (last 30 days)
- Data-efficient LLM Fine-tuning for Code Generation
- LithOS: An Operating System for Efficient Machine Learning on GPUs
- Dynamic Memory Management on GPUs with SYCL
- LIFT: LLM-Based Pragma Insertion for HLS via GNN Supervised Fine-Tuning
- MSCCL++: Rethinking GPU Communication Abstractions for Cutting-edge AI Applications
- Efficient deep learning inference on end devices
- DeepCompile: A Compiler-Driven Approach to Optimizing Distributed Deep Learning Training
- InteropUnityCUDA: A Tool for Interoperability Between Unity and CUDA
- Mìmir: A real-time interactive visualization library for CUDA programs
- Efficient Graph Embedding at Scale: Optimizing CPU-GPU-SSD Integration
* * *