https://hgpu.org/?p=8004
Parallelizing flow-accumulation calculations on graphics processing units - From iterative DEM preprocessing algorithm to recursive multiple-flow-direction algorithm