https://hgpu.org/?p=26833
Lossless Acceleration for Seq2seq Generation with Aggressive Decoding