https://hgpu.org/?p=18920
Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models