https://hgpu.org/?p=5062
Cache Miss Analysis for GPU Programs Based on Stack Distance Profile