https://hgpu.org/?p=2632
Weak execution ordering - exploiting iterative methods on many-core GPUs