https://hgpu.org/?p=19121
Parallelizing Multiple Flow Accumulation Algorithm using CUDA and OpenACC