https://hgpu.org/?p=13183
SiftCU: An Accelerated Cuda Based Implementation of SIFT