https://hgpu.org/?p=12121
cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on a GPU