https://hgpu.org/?p=24236
Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning