https://hgpu.org/?p=10762
Understanding the Costs of Many-Task Computing Workloads on Intel Xeon Phi Coprocessors