https://hgpu.org/?p=7633
Tuning a Finite Difference Computation for Parallel Vector Processors