https://hgpu.org/?p=11379
A scheduling and runtime framework for a cluster of heterogeneous machines with multiple accelerators