https://hgpu.org/?p=6135
Exploring Fine-Grained Task-based Execution on Multi-GPU Systems