https://hgpu.org/?p=8287
Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs