https://hgpu.org/?p=25765
Improving Performance and Energy Efficiency of GPUs through Locality Analysis