https://hgpu.org/?p=6886
A Hybrid Circular Queue Method for Iterative Stencil Computations on GPUs