https://hgpu.org/?p=18388
Block-Size Independence for GPU Programs