Recent source codes
* * *
Most viewed papers (last 30 days)
- Acceleration as a Service (XaaS) Source Containers
- Omniwise: Predicting GPU Kernels Performance with LLMs
- Exploring SYCL as a Portability Layer for High-Performance Computing on CPUs
- All You Need Is Binary Search! A Practical View on Lightweight Database Indexing on GPUs
- CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
- Engineering Supercomputing Platforms for Biomolecular Applications
- GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis
- chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations
- A First Look at Bugs in LLM Inference Engines
- LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters
* * *