https://hgpu.org/?p=15277
A Case for Work-stealing on FPGAs with OpenCL Atomics