https://hgpu.org/?p=4803
An auto-tuning framework for parallel multicore stencil computations