https://hgpu.org/?p=19141
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations