https://hgpu.org/?p=28638
Novel Parallelization Strategies for High-Performance DNN Training on HPC Systems