https://hgpu.org/?p=24932
How to Train BERT with an Academic Budget