https://hgpu.org/?p=10565
Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology