https://hgpu.org/?p=14856
Free Launch: Optimizing GPU Dynamic Kernel Launches through Thread Reuse