Auto-Generation and Auto-Tuning of 3D Stencil Codes on GPU Clusters

Yongpeng Zhang, Frank Mueller
North Carolina State University, Raleigh, NC 27695-7534
International Symposium on Code Generation and Optimization (CGO’2012), 2012


   title={Auto-Generation and Auto-Tuning of 3D Stencil Codes on GPU Clusters},

   author={Zhang, Y. and Mueller, F.},



Download Download (PDF)   View View   Source Source   



This paper develops and evaluates search and optimization techniques for auto-tuning 3D stencil (nearest-neighbor) computations on GPUs. Observations indicate that parameter tuning is necessary for heterogeneous GPUs to achieve optimal performance with respect to a search space. Our proposed framework takes a most concise specification of stencil behavior from the user as a single formula, auto-generates tunable code from it, systematically searches for the best configuration and generates the code with optimal parameter configurations for different GPUs. This auto-tuning approach guarantees adaptive performance for different generations of GPUs while greatly enhancing programmer productivity. Experimental results show that the delivered floating point performance is very close to previous handcrafted work and outperforms other auto-tuned stencil codes by a large margin.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2021 hgpu.org

All rights belong to the respective authors

Contact us: