https://hgpu.org/?p=1091
On sorting and load balancing on GPUs