https://hgpu.org/?p=1801
GPU histogram computation