https://hgpu.org/?p=28720
Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs