https://hgpu.org/?p=27035
CPU-GPU Layer-Switched Low Latency CNN Inference