Parallel Deblocking Filtering in MPEG-4 AVC/H.264 on Massively-Parallel Architectures
Multimedia Lab, ELIS, Ghent University IBBT, Belgium
IEEE Transactions on Circuits and Systems for Video Technology (2011)
@article{pieters2010parallel,
title={Parallel Deblocking Filtering in MPEG-4 AVC/H. 264 on Massively-Parallel Architectures},
author={Pieters, B. and Hollemeersch, C.F. and De Cock, J. and Lambert, P. and De Neve, W. and Van de Walle, R.},
journal={IEEE Transactions on Circuits and Systems for Video Technology},
pages={1–7},
year={2010}
}
The deblocking filter in the MPEG-4 AVC/H.264 standard is computationally complex because of its high content adaptivity, resulting in a significant number of data dependencies. These data dependencies interfere with parallel filtering of multiple macroblocks on massively-parallel architectures. In this paper, we introduce a novel macroblock partitioning scheme for concurrent deblocking in the MPEG-4 AVC/H.264 standard, based on our idea of Deblocking Filter Independency, a corrected version of the Limited Error Propagation Effect proposed in the literature. Our proposed scheme enables concurrent macroblock deblocking of luma samples with limited synchronization effort, independently of slice configuration, and is compliant with the MPEG-4 H.264/AVC standard. We implemented the method on the massively-parallel architecture of the Graphics Processing Unit (GPU). Experimental results show that our GPU implementation achieves faster-than real-time deblocking at 1309 frames per second for 1080p video pictures. Both software-based deblocking filters and state-of-the-art GPU-enabled algorithms are outperformed in terms of speed by factors up to 10.2 and 19.5 respectively for 1080p video pictures.
January 14, 2011 by hgpu