https://hgpu.org/?p=1241
A Practical Quicksort Algorithm for Graphics Processors