https://hgpu.org/?p=6651
Implementation of a Parallel Tree Method on a GPU