https://hgpu.org/?p=20024
Data Movement Optimization for High-Performance Computing