https://hgpu.org/?p=18432
A study of integer sorting on multicores