https://hgpu.org/?p=15041
A Study of Parallel Sorting Algorithms Using CUDA and OpenMP