https://hgpu.org/?p=24941
Performance analysis and optimization of highly diverging algorithms on GPUs