https://hgpu.org/?p=9106
Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming