Multiprocessing Acceleration of H.264/AVC Motion Estimation Full Search Algorithm under CUDA Architecture
Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, Brazil
26th South Symposium on Microelectronics (SIM 2011), 2011
@article{monteiro2011multiprocessing,
title={Multiprocessing Acceleration of H. 264/AVC Motion Estimation Full Search Algorithm under CUDA Architecture},
author={Monteiro, E.R. and Vizzotto, B.B. and Diniz, C.M. and Zatt, B. and Bampi, S.},
year={2011}
}
This work presents a parallel GPU-based solution for the Motion Estimation (ME) process in a videoencoding system. We propose a way to partition the steps of Full Search block matching algorithm in the CUDA architecture, and to compare the performance with a theoretical model and two implementations (sequential and parallel using OpenMP library). We obtained a O(n2/log2n) speed-up which fits the theoretical model considering different search areas. It represents up to 600x gain compared to the serial implementation, and 66x compared to the parallel OpenMP implementation.
December 6, 2011 by hgpu