Sorry, but there aren't any posts in the Downloads category yet.
These might be of interest though...
- Enhancing Deployment-Time Predictive Model Robustness for Code Analysis and Optimization
- Finding Missed Code Size Optimizations in Compilers using LLMs
- A comparison of HPC-based quantum computing simulators using Quantum Volume
- Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement
- Debunking the CUDA Myth Towards GPU-based AI Systems
- Scalable Access-Pattern Aware I/O Acceleration and Multi-Tiered Data Management for HPC and AI Workloads
- Development of a new framework for high performance volunteer computing
- A survey on FPGA-based accelerator for ML models
- TorchQC – A framework for efficiently integrating machine and deep learning methods in quantum dynamics and control
- Asynchronous-Many-Task Systems: Challenges and Opportunities – Scaling an AMR Astrophysics Code on Exascale machines using Kokkos and HPX
- CPPJoules: An Energy Measurement Tool for C++
- Utilizing Tensor Cores in Futhark
- Reproducible Study and Performance Analysis of GPU Programming Paradigms: OpenACC vs. CUDA in Key Linear Algebra Computations
- HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages
- Accelerating Sparse Graph Neural Networks with Tensor Core Optimization