https://hgpu.org/?p=16560
A parallel pattern for iterative stencil + reduce