https://hgpu.org/?p=1250
Optimization of linked list prefix computations on multithreaded GPUs using CUDA