https://hgpu.org/?p=7384
An Efficient Block Cipher Implementation on Many-Core Graphics Processing Units