Signal Processing for Improved MPEG-Based Communication Systems
Total Page:16
File Type:pdf, Size:1020Kb
Signal processing for improved MPEG-based communication systems Citation for published version (APA): Eerenberg, O. (2015). Signal processing for improved MPEG-based communication systems. Technische Universiteit Eindhoven. Document status and date: Published: 09/12/2015 Document Version: Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers) Please check the document version of this publication: • A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal. If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement: www.tue.nl/taverne Take down policy If you believe that this document breaches copyright please contact us at: [email protected] providing details and we will investigate your claim. Download date: 04. Oct. 2021 Signal Processing for Improved MPEG-based Communication Systems Signal Processing for Improved MPEG-based Communication Systems PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Eindhoven, op gezag van de rector magnificus prof.dr.ir. F.P.T. Baaijens, voor een commissie aangewezen door het College voor Promoties, in het openbaar te verdedigen op woensdag 9 december 2015 om 16:00 uur door Onno Eerenberg geboren te Zwolle Dit proefschrift is goedgekeurd door de promotor en de samenstelling van de promotiecommissie is als volgt: voorzitter: prof.dr.ir. J.W.M. Bergmans 1e promotor: prof.dr.ir. P.H.N. de With copromotor(en): prof.dr. R.M. Aarts leden: prof.dr. C. Hentschel (Brandenb. Univ. of Technol. Cottbus, Germany) prof.dr. S. Sherratt (Univ. of Reading, United Kingdom) prof.dr.ir. J.J. Lukkien adviseur(s): dr.P.Hofman(PhilipsIP&S) Het onderzoek of ontwerp dat in dit proefschrift wordt beschreven is uitgevo- erd in overeenstemming met de TU/e Gedragscode Wetenschapsbeoefening. Nulla tenaci invia est via – Voor de volhouder is geen weg onbegaanbaar (Spyker Automobielen N.V.). CIP-DATA LIBRARY TECHNISCHE UNIVERSITEIT EINDHOVEN Onno Eerenberg Signal Processing for Improved MPEG-based Communication Systems / by Onno Eeren- berg. - Eindhoven : Technische Universiteit Eindhoven, 2015. A catalogue record is available from the Eindhoven University of Technology Library ISBN: 978-90-386-3979-6 NUR: 959 Trewf: videocompressie / personal video recording / MPEG-2 / H.264/MPEG4-AVC / trick play / DVB-H / video coding artifact detection. Subject headings: conventional video navigation / advanced video navigation / MPEG- 2-compliant video navigation / DVB-H link layer / mosquito and ringing artifact loca- tion detection. Cover design: D.J.M. Frishert. Printed by: Dereumaux. Copyright c 2015 by O. Eerenberg All rights reserved. No part of this material may be reproduced or transmitted in any form or by any means, electronic, mechanical, including photocopying, recording or by any information storage and retrieval system, without the prior permission of the copyright owner. Summary Signal Processing for Improved MPEG-based Communication Systems This thesis describes improvements for MPEG-based consumer communica- tion systems and focuses on three areas. The first area addresses intra-program video navigation for disk-based storage media, where three forms of video nav- igation are investigated enabling conventional as well as more advanced forms of video navigation. The second area presents an efficient and robust data link layer for DVB-H, a standard targeting battery-powered mobile television recep- tion. The improved link layer results in a higher robustness and efficiency. The third area addresses picture quality for digital-television reception, presenting two detection systems for locating visual-coding artifact regions, which are po- tentially contaminated with either mosquito noise or ringing. The location in- formation is used to attenuate the detected coding noise. The emphasis of the presented work is on embedded system solutions to be integrated into existing consumer platforms. The three areas are briefly summarized below. In this thesis, three navigation techniques are presented for disk-based storage systems. The first navigation technique equals that of full-frame fast-search and slow-motion playback and is suitable for a push-based architecture, en- abling deployment in a networked client-server system setup. Networked full- frame fast-search video navigation is based on re-using intra-coded MPEG- compressed normal-play video pictures. The proposed solution divides the signal processing for navigation over both recording and navigation playback operation mode. It is based on the finding of characteristic point information during recording, revealing the storage locations of intra-coded pictures, which are then re-used for generation of a fast-search navigation sequence. Furthermore, in order to adapt the frame rate, refresh-rate, bit rate and playback speed, the solution employs repetition pictures, which repeat normal- play reference pictures. On the basis of field-based repetition pictures, render- ing control at field level is obtained, enabling efficient removal of field-based video information (interlace kill), thereby avoiding motion judder during nav- i igation, enabling the navigation method to be applied to both progressive and interlaced video formats. Slow-motion video navigation is implemented in a similar fashion. When applying repetition pictures to control the Quality of Ex- perience (QoE) during navigation, the refresh-rate should not drop below 1/3 of the frame rate, otherwise a slide-show effect occurs, whereby the viewer loses the fast-search navigation experience. For a typical video broadcast at SD reso- lution and 4 Mbit/s, the required DSP processor load during recording requires a cycle frequency of 5 MHz, while a cycle frequency of 22 MHz is required for full-frame fast-search video playback with speed-up factor 12 and 5 MHz for slow-motion with playback speed 0.5. Both fast-search and slow-motion video navigation can be operated with a reduced refresh-rate and thus a considerably lower cycle frequency. Based on drawbacks associated to full-resolution fast-search trick play, a video navigation technique is presented based on a hierarchical mosaic screen representation. This screen is composed of re-used MPEG-compressed sub- pictures, avoiding transcoding of pictorial data. Hierarchical mosaic screens enable instant overview of video information associated with a large temporal interval, thereby eliminating the individual picture refresh-rate from the final video navigation rendering. Each subpicture is coded at a fixed bit cost, thereby simplifying the construction of the final mosaic screen in the compressed do- main. A fixed-cost subpicture is achieved by dividing each subpicture into a set of “mini slices”, which are also encoded at a fixed bit cost. Furthermore, when subpictures use P-type coding syntax, new mosaic screens can be con- structed using predictive coding, based on re-used subpictures available at the MPEG decoder. Both aspects clearly reduce the complexity of the implementa- tion. The continuous derivation of subpictures for mosaic screens requires only a low fraction of the computation complexity, because this intra-coded normal- play pictures appear at a rate of only 2 Hz, which results in a processing load of 0.3 Hz, when a scene duration of 3 seconds is used. It was found that this sys- tem can be implemented with the same architecture as the first navigation so- lution, because the processing required for the construction of a mosaic screen has a high resemblance with the fast-search full-frame navigation solution. We therefore expect that the involved playback processing for mosaic-screen navi- gation will show a similar throughput and DSP cycle load. Finally, an audio-enhanced dual-window video navigation technique is presented, combining normal-play audiovisual fragments, with a down-scaled fast-search information signal. This representation addresses human percep- tion which employs both visual and auditory queues. Hereby detailed normal- play information is rendered in a main window, while a coarse overview is pro- vided by fast-search information rendered in a second picture-in-Picture (PiP) window. Due to simultaneous rendering, a viewer can switch between the two information signals, perceiving either a fast- or a detailed overview, guiding