https://hgpu.org/?p=12959
A Performance Comparison of Sort and Scan Libraries for GPUs