https://hgpu.org/?p=923
Fast parallel GPU-sorting using a hybrid algorithm