https://hgpu.org/?p=9637
Non-Uniformly Partitioned Block Convolution on Graphics Processing Units