https://hgpu.org/?p=1168
Treecode and fast multipole method for N-body simulation with CUDA