A structured parallel periodic arnoldi shooting algorithm for RF-PSS analysis based on GPU platforms
University of California, Riverside, CA
Proceedings of the 16th Asia and South Pacific Design Automation Conference, ASPDAC ’11
@inproceedings{liu2011structured,
title={A structured parallel periodic arnoldi shooting algorithm for RF-PSS analysis based on GPU platforms},
author={Liu, X.X. and Yu, H. and Relles, J. and Tan, S.X.D.},
booktitle={Proceedings of the 16th Asia and South Pacific Design Automation Conference},
pages={13–18},
year={2011},
organization={IEEE Press}
}
The recent multi/many-core CPUs or GPUs have provided an ideal parallel computing platform to accelerate the time-consuming analysis of radio-frequency/millimeter-wave (RF/MM) integrated circuit (IC). This paper develops a structured shooting algorithm that can fully take advantage of parallelism in periodic steady state (PSS) analysis. Utilizing periodic structure of the state matrix of RF/MM-IC simulation, a cyclic-block-structured shooting-Newton method has been parallelized and mapped onto recent GPU platforms. We first present the formulation of the parallel cyclic-block-structured shooting-Newton algorithm, called periodic Arnoldi shooting method. Then we will present its parallel implementation details on GPU. Results from several industrial examples show that the structured parallel shooting-Newton method on Tesla’s GPU can lead to speedups of more than 20x compared to the state-of-the-art implicit GMRES methods under the same accuracy on the CPU.
August 27, 2011 by hgpu