https://hgpu.org/?p=9168
A Many-core Machine Model for Designing Algorithms with Minimum Parallelism Overheads