https://hgpu.org/?p=27940
EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs