https://hgpu.org/?p=5935
Optimization strategies for parallel CPU and GPU implementations of a meshfree particle method