https://hgpu.org/?p=24324
Exploiting BSP Abstractions for Compiler Based Optimizations of GPU Applications on multi-GPU Systems