Scaling GPU-Accelerated Databases beyond GPU Memory Size
Microsoft
Proceedings of the VLDB Endowment, Volume 18, No. 11, 2025
@article{wei2025scaling,
title={Scaling GPU-Accelerated Databases beyond GPU Memory Size},
author={Wei, Yinan Li Bailu Ding Ziyun and Al-Ghosien, Lukas M Maas Momin and Blanas, Spyros and Bruno, Nicolas and Johannes Gehrke,CCMICPKR},
year={2025}
}
There has been considerable interest in leveraging GPUs’ computational power and high memory bandwidth for analytical database workloads. However, their limited memory capacity remains a fundamental limitation for databases whose sizes far exceed the GPU memory size. This challenge is exacerbated by the slow PCIe data transfer speed, that creates a bottleneck in overall system performance. In this work, we introduce a hybrid CPU-GPU query processing strategy that leverages the distinct strengths of CPU and GPU to alleviate the data transfer bottleneck. Our approach performs highly efficient data filtering on the CPU, which substantially reduces the volume of data transferred to the GPU via PCIe, and offloads compute-intensive operators such as joins to the GPU for further processing. Our evaluation on the TPC-H benchmark at scale factors up to 1000 (1TB), using a single A100 GPU with 80GB memory, demonstrates that our approach can effectively handle datasets significantly larger than the GPU memory size. Moreover, it substantially outperforms a state-of-the-art CPU-only database system in both performance and cost-effectiveness.
August 31, 2025 by hgpu
Your response
You must be logged in to post a comment.