https://hgpu.org/?p=7737
Large, Pruned or Continuous Space Language Models on a GPU for Statistical Machine Translation