Parallelizing Alternating Direction Implicit Solver on GPUs

Zhangping Wei, Byunghyun Jang, Yaoxin Zhang, Yafei Jia
National Center for Computational Hydroscience & Engineering, The University of Mississippi, University, MS 38677 U.S.A.
Procedia Computer Science, Volume 18, Pages 389-398, 2013














Download Download (PDF)   View View   Source Source   



We present a parallel Alternating Direction Implicit (ADI) solver on GPUs. Our implementation significantly improves ex- isting implementations in two aspects. First, we address the scalability issue of existing Parallel Cyclic Reduction (PCR) implementations by eliminating their hardware resource constraints. As a result, our parallel ADI, which is based on PCR, no longer has the maximum domain size limitation. Second, we optimize inefficient data accesses of parallel ADI solver by leveraging hardware texture memory and matrix transpose techniques. These memory optimizations further make already parallelized ADI solver twice faster, achieving overall more than 100 times speedup over a highly optimized CPU version. We also present the analysis of numerical accuracy of the proposed parallel ADI solver.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2020 hgpu.org

All rights belong to the respective authors

Contact us: