https://hgpu.org/?p=12528
Performance-efficient mechanisms for managing irregularity in throughput processors