Evaluating the impact of reordering unstructured meshes on the performance of finite volume GPU solvers
Depto. Lenguajes y Sistemas Informaticos, Universidad de Granada
12th International Conference on Computational and Mathematical Methods in Science and Engineering (CMMSE 2012), 2012
@article{de2012evaluating,
title={Evaluating the impact of reordering unstructured meshes on the performance of finite volume GPU solvers},
author={de la Asunci{‘o}n, M. and Mantas, J.M. and Castro, M.J.},
year={2012}
}
In this work, we study the impact of renumbering the cells of unstructured triangular finite volume meshes on the performance of CUDA implementations of several finite volume schemes to simulate two-layer shallow water systems. We have used several numerical schemes with different demands of computational power whose CUDA implementations exploit the texture and L1 cache units of the GPU multiprocessors. Two different reordering schemes based on reducing the bandwidth of the adjacency matrix for the volume mesh have been used. Several numerical experiments performed on a Fermi-class GPU show that enforcing an ordering which enhances the data locality can have a significant impact on the runtime, and this impact is higher when the numerical scheme is computationally expensive.
June 21, 2012 by hgpu