https://hgpu.org/?p=17271
Optimizing Memory Efficiency for Convolution Kernels on Kepler GPUs