Sorry, but there aren't any posts in the Downloads category yet.
These might be of interest though...
- No More Shading Languages: Compiling C++ to Vulkan Shaders
- GCStack+GCScaler: Fast and Accurate GPU Performance Analyses Using Fine-Grained Stall Cycle Accounting and Interval Analysis
- Omniwise: Predicting GPU Kernels Performance with LLMs
- Survey of HPC in US Research Institutions
- WiLLM: An Open Wireless LLM Communication System
- Engineering Supercomputing Platforms for Biomolecular Applications
- A First Look at Bugs in LLM Inference Engines
- A CPU+FPGA OpenCL Heterogeneous Computing Platform for Multi-Kernel Pipeline
- A Novel Compiler Transformation for Fast Sparse Matrix Multiplication in GPUs
- LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters
- CUDA-LLM: LLMs Can Write Efficient CUDA Kernels
- HPCTransCompile: An AI Compiler Generated Dataset for High-Performance CUDA Transpilation and LLM Preliminary Exploration
- GPU Acceleration of SQL Analytics on Compressed Data
- Enabling Profile Guided Optimizations (PGO) for Graphics
- chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations