GPU Auto-tuning Framework for Optimal Performance and Power Consumption

Sunbal Cheema, Gul N. Khan
Department of Electrical, Computer and Biomedical Engineering, Toronto Metropolitan University, Toronto ON, Canada
15th Workshop on General Purpose Processing Using GPU, Montreal, Canada, (GPGPU ’23), 2023


   title={GPU Auto-tuning Framework for Optimal Performance and Power Consumption},

   author={Cheema, Sunbal and Khan, Gul},

   booktitle={Proceedings of the 15th Workshop on General Purpose Processing Using GPU},




Download Download (PDF)   View View   Source Source   



An auto-tuning framework for GPU devices is presented for tuning application kernels of OpenCL. The GPU tuner employs multi-objective optimization methodology to improve the performance and power consumption of applications. It efficiently explores a user defined solution space comprising of possible tunable algorithmic and hardware counter variations through code transformations. The methodology targets GPU code tuning situations where performance and energy consumption are critical. The proposed framework is evaluated for 2D convolution kernels. It utilizes a non-dominated sorting Genetic Algorithm with hardware power sensor data for application code transformation through code rewrite and validation. Various algorithmic variations such as loop unrolling, caching, workgroup size and memory utilization are applied. The final pareto optimal configurations code utilized around 30% less power and 4% faster execution time. The analysis shows the convergence of optimization, and 45% improvement in standard deviation.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: