https://hgpu.org/?p=1531
SBLOCK: A Framework for Efficient Stencil-Based PDE Solvers on Multi-core Platforms