https://hgpu.org/?p=24348
I/O Lower Bounds for Auto-tuning of Convolutions in CNNs