https://hgpu.org/?p=5965
Efficient Synchronization Primitives for GPUs