Efficient Motion Estimation and Mode Decision Algorithms for Advanced Video Coding
Total Page:16
File Type:pdf, Size:1020Kb
University of Windsor Scholarship at UWindsor Electronic Theses and Dissertations Theses, Dissertations, and Major Papers 2011 Efficient Motion Estimation and Mode Decision Algorithms for Advanced Video Coding Mohammed Golam Sarwer University of Windsor Follow this and additional works at: https://scholar.uwindsor.ca/etd Recommended Citation Sarwer, Mohammed Golam, "Efficient Motion Estimation and Mode Decision Algorithms for Advanced Video Coding" (2011). Electronic Theses and Dissertations. 439. https://scholar.uwindsor.ca/etd/439 This online database contains the full-text of PhD dissertations and Masters’ theses of University of Windsor students from 1954 forward. These documents are made available for personal study and research purposes only, in accordance with the Canadian Copyright Act and the Creative Commons license—CC BY-NC-ND (Attribution, Non-Commercial, No Derivative Works). Under this license, works must always be attributed to the copyright holder (original author), cannot be used for any commercial purposes, and may not be altered. Any other use would require the permission of the copyright holder. Students may inquire about withdrawing their dissertation and/or thesis from this database. For additional inquiries, please contact the repository administrator via email ([email protected]) or by telephone at 519-253-3000ext. 3208. Efficient Motion Estimation and Mode Decision Algorithms for Advanced Video Coding by Mohammed Golam Sarwer A Dissertation Submitted to the Faculty of Graduate Studies through the Department of Electrical and Computer Engineering in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy at the University of Windsor Windsor, Ontario, Canada 2011 © 2011, Mohammed Golam Sarwer All Rights Reserved. No part of this document may be reproduced, stored or otherwise retained in a retrieval system or transmitted in any form, on any medium by any means without prior written permission of the author. Declaration of Co-Authorship / Previous Publication I. Co-Authorship Declaration I hereby declare that this thesis incorporates material that is result of joint research, as follows: This thesis also incorporates the outcome of a joint research undertaken by me under the supervision of Professor Dr. Jonathan Wu. The collaboration is covered in Chapter 3, 4, 5, 6, and 7 of the thesis. In all cases, the key ideas, primary contributions, experimental designs, data analysis and interpretation, were performed by the author, and the contribution of co-author was primarily through the provision of valuable suggestions and helping in comprehensive analysis of the simulation results for publication. I am aware of the University of Windsor Senate Policy on Authorship and I certify that I have properly acknowledged the contribution of other researchers to my thesis, and have obtained written permission from each of the co-author(s) to include the above material(s) in my thesis. I certify that, with the above qualification, this thesis, and the research to which it refers, is the product of my own work. II. Declaration of Previous Publication This thesis includes 16 original papers that have been previously published/submitted for publication in peer reviewed journals and conferences, as follows: Thesis Publication title/full citation Publication Chapter status* Mohammed Golam Sarwer, and Q.M. Jonathan Wu published “Efficient Two Step Edge based Partial Distortion Search for Fast Block Motion Estimation” IEEE Transactions on Consumer Electronics, Vol. 55, No. 4, Nov. 2009, pp. 2154- 2162. Mohammed Golam Sarwer, and Q.M. Jonathan Wu, “Fast published Block Motion Estimation by edge based partial distortion search”, Proceedings of the IEEE International Conference on Image Processing 2009 (ICIP-2009), Cairo, Egypt, Chapter 3 November 7-10, 2009, pp. 1573 - 1576. Mohammed Golam Sarwer, and Q.M. Jonathan Wu, published “Efficient Partial Distortion Search Algorithm for Block based Motion Estimation”, Proceedings of the IEEE CCECE 2009, St John’s, NL, pp. 890-893. Mohammed Golam Sarwer, and Q.M. Jonathan Wu published “Adaptive Variable Block-Size Early motion estimation iv termination algorithm for H.264/AVC Video Coding Standard.” IEEE Transaction on Circuit and System for Video Technology, volume 19, number 8, August 2009, pp 1196-1201. Mohammed Golam Sarwer, Thanh Minh Nguyen and Q.M. published Jonathan Wu, “Fast Motion estimation of H.264/AVC by adaptive early termination”, Proceedings of the 10th IASTED International Conference SIP 2008, HI, USA, pp. Chapter 4 140-145. Mohammed Golam Sarwer, and Q.M. Jonathan Wu, published “Region Based Searching for Early Terminated Motion Estimation Algorithm of H.264/AVC Video Coding Standard”, Proceedings of the IEEE CCECE 2009, St John’s, NL, pp. 468-471. Mohammed Golam Sarwer, and Q.M. Jonathan Wu, " An published Efficient search range Decision Algorithm for Motion Estimation of H.264/AVC, " International Journal of Circuits, Systems and Signal Processing, vol. 3, issue 4, pp. 173-180, 2009. Mohammed Golam Sarwer, and Q.M. Jonathan Wu, published Chapter 5 “Adaptive Search Area Selection of Variable Block-Size Motion Estimation of H.264/AVC Video Coding Standard”, IEEE International Symposium on Multimedia ISM 2009, San Diego, California, USA, December 14-16, 2009, pp. 100-105. Mohammed Golam Sarwer, and Q.M. Jonathan Wu In Press “Improved Intra Prediction of H.264/AVC” accepted in the book "Video Coding", ISBN 978-953-7619-X-X, IN-TECH publisher. Mohammed Golam Sarwer, and Q.M. Jonathan Wu, published “Enhanced Intra Coding of H.264/AVC Advanced Video Coding Standard with Adaptive Number of Modes”, International Conferences on Active Media Technology (AMT 2010), Toronto, Canada, LNCS 6335, pp. 361–372, August 2010. Mohammed Golam Sarwer, and Q.M. Jonathan Wu, “A published Novel Bit Rate Reduction Method of H.264/AVC Intra Chapter 6 Coding”, International Congress on Image and Signal Processing (CISP'10), 16-18 October 2010, Yantai, China, pp. 24-28. Mohammed Golam Sarwer, and Q. M. Jonathan Wu, " Accepted Improved DC Prediction for H.264/AVC intra Coding " 2011 International Conference on Communication and Electronics Information - ICCEI 2011, Haikou, China. Mohammed Golam Sarwer, and Q.M. Jonathan Wu Submitted “Performance Improvement of Intra Coding in H.264/AVC v Advanced Video Coding Standard”. Submitted to Journal of Visual Communication and Image Representation. Chapter 7 Mohammed Golam Sarwer, and Q.M. Jonathan Wu Submitted “Enhanced SATD based Cost Function for Mode Selection of H.264/AVC Intra Coding”. Submitted to Springer Journal of Signal, Image and Video Processing. Mohammed Golam Sarwer, and Q.M. Jonathan Wu Accepted “Enhanced low complex cost function of H.264/AVC intra mode decision”. International Conference on Multimedia and Signal Processing (CMSP'11), China, Mohammed Golam Sarwer, Q.M. Jonathan Wu, X. P Zhang Submitted “Efficient rate-distortion optimization of H.264/AVC intra coder”. Submitted to International Conference on Image Proceeding, 2011 I certify that I have obtained permission from the copyright owner(s) to include the above published material(s) in my thesis. I certify that the above material describes work completed during my registration as graduate student at the University of Windsor. I declare that, to the best of my knowledge, my thesis does not infringe upon anyone’s copyright nor violate any proprietary rights and that any ideas, techniques, quotations, or any other material from the work of other people included in my thesis, published or otherwise, are fully acknowledged in accordance with the standard referencing practices. Furthermore, to the extent that I have included copyrighted material that surpasses the bounds of fair dealing within the meaning of the Canada Copyright Act, I certify that I have obtained a written permission from the copyright owner(s) to include such material(s) in my thesis. I declare that this is a true copy of my thesis, including any final revisions, as approved by my thesis committee and the Graduate Studies office, and that this thesis has not been submitted for a higher degree to any other University or Institution. vi ABSTRACT H.264/AVC video compression standard achieved significant improvements in coding efficiency, but the computational complexity of the H.264/AVC encoder is drastically high. The main complexity of encoder comes from variable block size motion estimation (ME) and rate-distortion optimized (RDO) mode decision methods. This dissertation proposes three different methods to reduce computation of motion estimation. Firstly, the computation of each distortion measure is reduced by proposing a novel two step edge based partial distortion search (TS-EPDS) algorithm. In this algorithm, the entire macroblock is divided into different sub-blocks and the calculation order of partial distortion is determined based on the edge strength of the sub-blocks. Secondly, we have developed an early termination algorithm that features an adaptive threshold based on the statistical characteristics of rate-distortion (RD) cost regarding current block and previously processed blocks and modes. Thirdly, this dissertation presents a novel adaptive search area selection method by utilizing the information of the previously computed motion vector differences (MVDs). In H.264/AVC intra coding, DC mode is used to predict regions with no unified direction and the predicted pixel values are same and thus smooth varying regions are not well de-correlated. This dissertation proposes