Autotuning Stencil-Based Computations on GPUs

Azamat Mametjanov, Daniel Lowell, Ching-Chen May, Boyana Norris
Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439
Preprint ANL/MCS-P2094-0512, 2012


   title={Autotuning Stencil-Based Computations on GPUs},

   author={Mametjanov, A. and Lowell, D. and Ma, C.C. and Norris, B.},



Download Download (PDF)   View View   Source Source   Source codes Source codes




Finite-difference, stencil-based discretization approaches are widely used in the solution of partial differential equations describing physical phenomena. Newton-Krylov iterative methods commonly used in stencil-based solutions generate matrices that exhibit diagonal sparsity patterns. To exploit these structures on modern GPUs, we extend the standard diagonal sparse matrix representation and define new matrix and vector data types in the PETSc parallel numerical toolkit. We create tunable CUDA implementations of the operations associated with these types after identifying a number of GPU-specific optimizations and tuning parameters for these operations. We discuss our implementation of GPU autotuning capabilities in the Orio framework and present performance results for several kernels, comparing them with vendor-tuned library implementations.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2017 hgpu.org

All rights belong to the respective authors

Contact us: