1972

H.264/AVC motion estimation implementation on Compute Unified Device Architecture (CUDA)

Wei-Nien Chen, Hsueh-Ming Hang
Department of Electronics Engineering, National Chiao-Tung University, Taiwan
Multimedia and Expo, 2008 IEEE International Conference on In Multimedia and Expo, 2008 IEEE International Conference on (2008), pp. 697-700
BibTeX

Download Download (PDF)   View View   Source Source   

1861

views

Due to the rapid growth of graphics processing unit (GPU) processing capability, using GPU as a coprocessor to assist the central processing unit (CPU) in computing massive data becomes essential. In this paper, we present an efficient block-level parallel algorithm for the variable block size motion estimation (ME) in H.264/AVC with fractional pixel refinement on a computer unified device architecture (CUDA) platform, developed by NVIDIA in 2007. The CUDA enhances the programmability and flexibility for general-purpose computation on GPU. We decompose the H.264 ME algorithm into 5 steps so that we can achieve highly parallel computation with low external memory transfer rate. Experimental results show that, with the assistance of GPU, the processing time is 12 times faster than that of using CPU only.
No votes yet.
Please wait...

* * *

* * *

HGPU group © 2010-2025 hgpu.org

All rights belong to the respective authors

Contact us:

contact@hpgu.org