https://hgpu.org/?p=3137
Accelerating H.264 inter prediction in a GPU by using CUDA