https://hgpu.org/?p=24880
Efficient Large-Scale Language Model Training on GPU Clusters