https://hgpu.org/?p=12881
A massively parallel algorithm for constructing the BWT of large string sets