https://hgpu.org/?p=27591
Fast convolution kernels on pascal GPU with high memory efficiency