Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology
Jawaharlal Nehru Krrishi Vishwavidyalaya Jabalpur
International Journal of Engineering Research and Application, Vol. 3, Issue 5, pp.225-228, 2013
@article{srivastava2013performing,
title={Performing DCT8x8 Computation on GPU Using NVIDIA CUDA Technology},
author={Srivastava, Jagdamb Behari and Singh, RB and Jain, Jitendra},
year={2013}
}
In this paper, we have proposed sequential and parallel Discrete Cosine Transform (DCT) in compute unified device architecture (CUDA) libraries. The introduction of programmable pipeline in the graphics processing units (GPU) has enabled configurability. GPU which is available in every computer has a tremendous feat of highly parallel SIMD processing, but its capability is often under-utilized. The two-dimensional variation of the transform that operates on 8×8 blocks (DCT8x8) is widely used in image and video coding because it exhibits high signal de-correlation rates and can be easily implemented on the majority of contemporary computing architectures. Performing DCT8x8 computation on GPU using NVIDIA CUDA technology gives significant performance boost even compared to a modern CPU.
September 21, 2013 by hgpu