https://hgpu.org/?p=5759
An Execution Model and Runtime For Heterogeneous Many-Core Systems