https://hgpu.org/?p=5894
Towards scalar synchronization in SIMT architectures