https://hgpu.org/?p=18601
Workload-aware Automatic Parallelization for Multi-GPU DNN Training