https://hgpu.org/?p=6218
Parallel Implementation of Niblack's Binarization Approach on CUDA