https://hgpu.org/?p=4182
High performance comparison-based sorting algorithm on many-core GPUs