Acceleration of tensor-product operations for high-order finite element methods
Department of Mathematics, Virginia Tech, McBryde Hall, 24061 Blacksburg, VA, USA
arXiv:1711.00903 [cs.MS], (2 Nov 2017)
@article{swirydowicz2017acceleration,
title={Acceleration of tensor-product operations for high-order finite element methods},
author={Swirydowicz, Kasia and Chalmers, Noel and Karakus, Ali and Warburton, Timothy},
year={2017},
month={nov},
archivePrefix={"arXiv"},
primaryClass={cs.MS}
}
This paper is devoted to GPU kernel optimization and performance analysis of three tensor-product operators arising in finite element methods. We provide a mathematical background to these operations and implementation details. Achieving close-to-the-peak performance for these operators requires extensive optimization because of the operators’ properties: low arithmetic intensity, tiered structure, and the need to store intermediate results inside the kernel. We give a guided overview of optimization strategies and we present a performance model that allows us to compare the efficacy of these optimizations against an empirically calibrated roofline.
November 7, 2017 by hgpu