Motion Vector Forecast and Mapping (MV-Fmap) Method for Entropy Coding Based Video Coders Julien Le Tanou, Jean-Marc Thiesse, Joël Jung, Marc Antonini
Total Page:16
File Type:pdf, Size:1020Kb
Motion Vector Forecast and Mapping (MV-FMap) Method for Entropy Coding based Video Coders Julien Le Tanou, Jean-Marc Thiesse, Joël Jung, Marc Antonini To cite this version: Julien Le Tanou, Jean-Marc Thiesse, Joël Jung, Marc Antonini. Motion Vector Forecast and Mapping (MV-FMap) Method for Entropy Coding based Video Coders. MMSP’10 2010 IEEE International Workshop on Multimedia Signal Processing, Oct 2010, Saint Malo, France. pp.206. hal-00531819 HAL Id: hal-00531819 https://hal.archives-ouvertes.fr/hal-00531819 Submitted on 3 Nov 2010 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Motion Vector Forecast and Mapping (MV-FMap) Method for Entropy Coding based Video Coders Julien Le Tanou #1, Jean-Marc Thiesse #2, Joël Jung #3, Marc Antonini ∗4 # Orange Labs 38 rue du G. Leclerc, 92794 Issy les Moulineaux, France 1 [email protected] {2 jeanmarc.thiesse,3 joelb.jung}@orange-ftgroup.com ∗ I3S Lab. University of Nice-Sophia Antipolis/CNRS 2000 route des Lucioles, 06903 Sophia Antipolis, France 4 [email protected] Abstract—Since the finalization of the H.264/AVC standard between motion vectors of neighboring frames and blocks, we and in order to meet the target set by both ITU-T and MPEG propose in this paper a method for motion vector coding based to define a new standard that reaches 50% bit rate reduction on a motion vector residuals forecast followed by an adaptive compared to H.264/AVC, many tools have efficiently improved the texture coding and the motion compensation accuracy. These mapping of residuals. This scheme is noted MV-FMap for improvements have resulted in increasing the proportion of bit Motion Vector Forecast and Mapping. rate allocated to motion information. Thus, the bit rate reduction The remaining of this article is organized as follows: we of this information becomes a key subject of research. This first introduce a state of the art of motion vector coding in paper proposes a method for motion vector coding based on an Section II, then we describe the proposed scheme in Section adaptive redistribution of motion vector residuals before entropy coding. Motion information is gathered to forecast a list of III. Finally, we present the experimental results in Section IV. motion vector residuals which are redistributed to unexpected II. STATE OF THE ART residuals of lower coding cost. Compared to H.264/AVC, this scheme provides systematic gain on tested sequences, and 2.3% A. H.264/AVC Motion Vector Coding in average, reaching up to 4.9% for a given sequence. The H.264/AVC reference codec is based on a competition I. INTRODUCTION between multiple Intra and Inter prediction coding modes, The H.264/AVC standard [1], which is the result of the joint through the calculation of a rate-distortion (RD) criterion [4]. work of the Video Coding Experts Group (VCEG) and Moving For Inter prediction, a block matching motion estimation is Picture Experts Group (MPEG), has achieved significant com- performed in order to exploit the temporal redundancies of pression gain over its predecessor, H.263 and MPEG-4 part 2. video sequence. A predictive coding of the motion vector is This comes from the improvement of existing tools and the then applied, with the motion vector residual r defined by introduction of new ones, especially: various Intra predictors, r = mv − mvp, where mv is the motion vector, mvp the variable block sizes, multiple reference frames and ¼ pel motion vector predictor defined by a median of the motion motion accuracy for motion compensation, deblocking filter vectors of the spatially neighboring blocks. The accuracy of and Context Adaptive Binary Arithmetic Coding (CABAC). the motion vector prediction is sensitive to irregular motion Today, new standardization activities in video coding are and remains a key feature of the motion vector coding. launched. VCEG and MPEG work collaboratively within the Finally, an entropy coding is independently applied to each Joint Collaborative Team on Video Coding (JCT-VC) and component of the residual of the optimally selected motion have issued a Call for Proposals with answers expected by vector. CABAC is currently the most powerful as it enables April 2010. The objective is a standard that reaches 50% to adapt, for each block, the coding table applied to residual bit rate reduction for the same subjective quality, with an vectors. However, this table is defined from a finite number allowed increase of the complexity by a factor 2 or 3. Several of probability models. These models are selected by a context improvements are already known and gathered in the JM KTA criterion which can be insufficient. Furthermore, although the (Key Technical Area) [2]. In particular, MV-Comp [3] effi- probabilities defined by these models are updated according to ciently acts on the motion coding. However, the coding cost of the occurrence statistics of each bit of the binarized residual, motion vectors remains high and its reduction is a major way this coding is essentially a solution based model and can be to improve the next standard. Based on several observations supplemented by a method more driven by a motion vector that characterize the movement, such as the strong correlation forecast in addition to these observed data. B. Recent Motion Vector Coding Improvements MMSP’10, October 4-6, 2010, Saint-Malo, France. Based on the coding scheme described above, several tools 978-1-4244-8112-5/10/$26.00 ©2010 IEEE. have been proposed to improve the efficiency of motion vector USB 978-1-4244-8111-8/10/$26.00 ©2010 IEEE 206 coding. In [3], a proportion of the motion information in the motion vector residuals is performed: the forecasted residuals total bit stream of nearly 35% is reported at low bit rate. To are distributed to residuals positions of lower coding cost. reduce the cost of this motion information, a method based on a spatio-temporal motion vector predictor competition (MV- Comp) is proposed. It takes advantage from the use of spatial and temporal redundancies in the motion vector fields, where the simple spatial median usually fails. Although an index transmission is required to signal the selected predictor, this coding scheme provides a very good compromise between coding efficiency and added complexity. On top of the MV-Comp framework, several contributions on the motion vectors prediction (MVP) have been recently made. In [5], the set of predictor candidates is first adaptively determined based on the characteristics of neighboring motion Figure 1. Proposed scheme overview. vectors, secondly the optimal predictor is chosen from this set to minimize the number of bits used to encode motion vector This mapping is performed during the motion estimation difference (MVD). An index is then optimally transmitted to and is computed at the decoder side: no side information is signal the selected predictor according to the rules defined in necessary. The three steps are explained in the following parts. [6] and a boundary-matching which is applied at the decoder side in order to reduce predictor signalization cost. A similar B. Step 1 : Construction of the Motion Information scheme is proposed in [7] without consuming any additional Ideally, we wish to know the list of the motion vectors se- bits to inform the choice of motion vector predictor to the lected for the prediction of the current frame. This infomation decoder. It slightly improves MV-Comp yet suffers from a de- is of course not available: we consequently gather observed crease of the motion vector prediction quality. However, none motion information. We define Dmv and Dr the two definition of these methods manage to significantly improve upon MV- domains, respectively, of motion vectors and of residuals, such 2 2 Comp and to tackle its inherent signalization cost problem. as Dmv ⊂ Z ;Dr ⊂ Z . In [8], a new Inter coding mode, using quantized motion A histogram (h) of the motion vectors previously encoded is vectors, is introduced. Competing with other modes, it takes computed for each block. h gives the frequency of each motion care of better distributing bits ressources between texture and vector, mv = (mvx; mvy) 2 Dmv. It is computed from the n residual motion vectors. However, this scheme is limited by previously coded frames and also the previously coded blocks the optimal choice of the quantization step used for the motion of the current frame. For the ith block, the frequency of each vector coding and its signalization. motion vector is noted hi, hi(mv) ≥ 0. The related frame and Finally, [9] addresses both the MVP and the MVD coding. block positions of each motion vector are also stored. The predictor is firstly adaptively coded with additional bits h is initialized as follows: based on the distribution of reference vectors. Secondly, an appropriate code table for the MVD is adaptively selected out h0(mv) = 0; 8mv 2 Dmv: (1) of a couple of predefined tables. The latter approach seems The h update for the i + 1th block is performed as follows: promising as it tries to adapt the table for small or large MVDs. i i hi+1(mv ) = hi(mv ) + 1; (2) III. PROPOSED SCHEME: MV-FMAP where mvi is the motion vector of the ith block. A. Scheme Overview For each current block to encode, a motion vector residual The main idea of the scheme comes on the one hand from r = (rx; ry) 2 Dr; is associated to each vector mv of h: the observation of the high correlation of the motion vectors between successive frames and blocks, and on the other hand r = mv − mvpi; (3) from the choice to correct a posteriori the error prediction of where mvpi is the motion vector predictor of the ith block.