https://hgpu.org/?p=15901
Performance Evaluation of Parallel Count Sort using GPU Computing with CUDA