5654

Unstructured grid applications on GPU: performance analysis and improvement

Lizandro Solano-Quinde, Zhi Jian Wang, Brett Bode, Arun K. Somani
Iowa State University, Ames, IA
Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-4, 2011

@inproceedings{solano2011unstructured,

   title={Unstructured grid applications on GPU: performance analysis and improvement},

   author={Solano-Quinde, L. and Wang, Z.J. and Bode, B. and Somani, A.K.},

   booktitle={Proceedings of the Fourth Workshop on General Purpose Processing on Graphics Processing Units},

   pages={13},

   year={2011},

   organization={ACM}

}

Download Download (PDF)   View View   Source Source   

1234

views

Performance of applications running on GPUs is mainly affected by hardware occupancy and global memory latency. Scientific applications that rely on analysis using unstructured grids could benefit from the high performance capabilities provided by GPUs, however, its memory access pattern and algorithm limit the potential benefits. In this paper we analyze the algorithm for unstructured grid analysis on the basis of hardware occupancy and memory access efficiency. In general, the algorithm can be divided into three stages: cell-oriented analysis, edge-oriented analysis and information update, which present different memory access patterns. Based on the analysis we modify the algorithm to make it suitable for GPUs. The proposed algorithm aims for high hardware occupancy and efficient global memory access. Finally, through implementation we show that our design achieves up to 88 times speedup compared to the sequential CPU version.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2024 hgpu.org

All rights belong to the respective authors

Contact us: