https://hgpu.org/?p=6078
Efficient Implementation of Optical Flow Algorithm Based on Directional Filters on a GPU Using CUDA