Description-Driven Media Resource Adaptation

Total Page:16

File Type:pdf, Size:1020Kb

Description-Driven Media Resource Adaptation Beschrijvingsgedreven aanpassing van mediabronnen Description-Driven Adaptation of Media Resources Wesley De Neve Promotor: prof. dr. ir. R. Van de Walle Proefschrift ingediend tot het behalen van de graad van Doctor in de Ingenieurswetenschappen: Computerwetenschappen Vakgroep Elektronica en Informatiesystemen Voorzitter: prof. dr. ir. J. Van Campenhout Faculteit Ingenieurswetenschappen Academiejaar 2006 - 2007 ISBN 978-90-8578-132-5 NUR 980, 986 Wettelijk depot: D/2007/10.500/6 Dankwoord Met het indienen van dit proefschrift sluit ik een periode van ruim vier jaar af. In deze tijdspanne heb ik de mogelijkheid gekregen om in een boeiend domein aan wetenschappelijk onderzoek te doen. Dit is dan ook een uitgelezen kans om de mensen te bedanken zonder wie dit werk niet mogelijk zou zijn geweest. In de eerste plaats wil ik mijn promotor, prof. dr. ir. Rik Van de Walle bedanken. Dankzij zijn onuitputtelijke inzet is het Multimedia Lab als onder- zoeksgroep uitgegroeid tot een gevestigde waarde op nationaal en internatio- naal vlak. Deze omgeving vormde voor mij dan ook het ideale kader voor het behalen van mijn doctoraat. Verder wil ik mijn promotor eveneens bedanken voor de kansen die hij mij geboden heeft om mijn onderzoek voor te stellen op verschillende internationale conferenties. Het bijwonen van deze conferenties heeft me als onderzoeker, maar ook als mens, enorm veel bijgebracht. Ook mijn vroegere (Boris Rogge, Sam Lerouge) en huidige collega’s van het Multimedia Lab wil ik uitgebreid bedanken. Het was voor mij erg aange- naam in een dergelijke open en vriendschappelijke omgeving te kunnen wer- ken. Een aantal collega’s hebben grote delen van dit proefschrift heel grondig nagelezen. Hun opmerkingen en suggesties hebben de kwaliteit van dit werk in grote mate verhoogd. Ik wil hen dan ook in het bijzonder danken voor deze inspanningen. Ook Rita Breems en Ellen Lammens bedank ik van harte voor de administratieve ondersteuning en de leuke babbels tijdens het middageten. Een bijzonder woord van dank gaat ook uit naar Davy Van Deursen voor de bijdrage die hij, als thesisstudent en als collega, tot dit werk heeft gele- verd. Ook Peter Lambert en Davy De Schrijver wens ik te bedanken voor de talrijke en boeiende discussies over H.264/AVC en MPEG-21 BSDL. In deze context zou ik eveneens Gary J. Sullivan (Microsoft Corporation) en Miska M. Hannuksela (Nokia) willen danken, en dit voor de tips en de informatie die zij verstrekt hebben met betrekking tot het gebruik van sub-sequences in de H.264/AVC-standaard. Tevens wens ik Danny Hong (Columbia Universi- ty) te bedanken voor het delen van zijn kennis met betrekking tot de werking van XFlavor. Ook de anonieme reviewers van het door Elsevier gepubliceerde ii manuscript over BFlavor wil ik uitdrukkelijk bedanken voor de uitgebreide en opbouwende feedback. Ik wens ook mijn ouders te bedanken voor de vele kansen die zij mij ga- ven, alsook voor hun raad, steun en vertrouwen in mij. Verder wil ik ook mijn zus Lindsey bedanken voor haar relativerende kijk op haar grote broer. Ook wens ik al mijn vrienden van harte te bedanken voor de vele boeiende en leuke momenten, zowel in de online als offline wereld. Zij zorgden ervoor dat ik regelmatig mijn zinnen kon verzetten en dat de vele uren voor het computer- scherm draagbaar bleven. Tot slot wil ik nog een aantal bijzondere mensen bedanken die ik heb le- ren kennen dankzij mijn onderzoek, zij het in een meer informele context: Charles-Frederik Hollemeersch, Elisabeth Hollemeersch, en Mieke De Kete- laere, voor de onvergetelijke deelname aan de Imagine Cup in Brazilie;¨ R.A. Skip Campbell, voor de avontuurlijke roadtrip in Coloroda; Leyda en Florence (Xiafen Zhang), voor jullie boeiende verhalen over China; en Sehee Lee, voor het gidsen in het Land van de Ochtendkalmte. Wesley M. De Neve 31 Augustus 2006 Summary The last decade has witnessed a significant number of innovative developments in the multimedia ecosystem. Advanced media formats have emerged for the efficient representation, storage, and presentation of digital media resources. New network technologies have been devised, wired and wireless, providing access to audio-visual information services such as online music stores, movie download services, and video blogs. A plethora of networked mobile devices has popped up as well, ranging from cell phones to personal entertainment de- vices, often having sufficient processing power for the playback of multimedia presentations. From these observations, it is clear that the multimedia land- scape is characterized by a vast diversity in terms of media formats, network capabilities, and device properties. The ever-increasing heterogeneity in the multimedia consumption chain poses a number of new challenges. One such challenge is the realization of the Universal Multimedia Access paradigm, which is the notion that multimedia content should be accessible at any place, at any time, and with any device. As acknowledged by the Moving Picture Experts Group (MPEG), the successful realization of ubiquitous and seamless access to multimedia content requires an appropriate reaction from different knowledge domains. • The answer of MPEG’s coding community consists of the specification of scalable or layered coding schemes. Indeed, the picture rate and spa- tial resolution of scalable video resources can for instance be adapted in a straightforward way to meet the different constraints that are imposed by a particular usage context (e.g., constraints in terms of available band- width, screen resolution, and so on). • The answer of MPEG’s metadata community consists of the develop- ment of a number of description tools. These tools are for example used to describe the properties of media resources and the capabilities or con- straints of usage environments. The resulting descriptions enable the construction of a format-agnostic content adaptation system that is able iv to maximize the user experience, for the consumption of a particular multimedia presentation in a well-defined usage environment. In this dissertation, we have first studied a number of concepts and de- sign principles of the state-of-the-art H.264/MPEG-4 Advanced Video Coding standard, typically abbreviated as H.264/AVC. The H.264/AVC specification incorporates the latest advances in standardized video coding technology. As demonstrated by our experiments, as well as by other scientific and technical sources, H.264/AVC provides up to 50% bit rate savings for equivalent per- ceptual quality relative to the performance of prior video coding standards. The design of the H.264/AVC standard is, besides efficiency, also char- acterized by a flexibility for use over a broad variety of network types and application domains1. In this context, we have reviewed several content adap- tation tools that are part of the initial version of the H.264/AVC specification. Examples include the exploitation of multi-layered temporal scalability and the use of Flexible Macroblock Ordering (FMO) for region of interest coding. The emphasis of our review was put on providing a complete and de- tailed picture regarding H.264/AVC’s temporal adaptivity provisions. As such, our overview contains an extensive discussion with respect to the use of sub- sequences and sub-sequence layers, coding patterns based on hierarchical bidi- rectionally coded pictures (B pictures), and Supplemental Enhancement Infor- mation messages (SEI messages) for communicating the bitstream structure to a bitstream extractor or decoder. Sub-sequences are employed to constrain H.264/AVC’s coding flexibility in a minimal way. This allows the execution of meaningful adaptations in the temporal dimension. More powerful adaptivity features and SEI messages are incorporated in a newly developed amendment to the H.264/AVC specification, which is com- monly referred to as H.264/AVC Scalable Video Coding (SVC). This amend- ment includes explicit support for spatial and quality scalability; temporal adaptivity tools are inherited from the first version of the H.264/AVC standard. Further, the principles of Bitstream Syntax Description-driven (BSD- driven) content adaptation were also discussed in this dissertation. A BSD con- tains a description of the high-level structure of a binary media resource, typ- ically expressed using the eXtensible Markup Language (XML). This XML- based description, i.e. the BSD, can be transformed to reflect a desired adap- tation of the binary media resource. The transformed BSD can subsequently be used to automatically create an adapted media resource by relying on a format-independent content adaptation engine. This adapted media resource is then suited for consumption in a particular usage environment. 1Efficient and flexible are two terms that are often used in the context of H.264/AVC. v Two different approaches for BSD-driven content adaptation were studied in more detail: a standardized framework driven by the MPEG-21 Bitstream Syntax Description Language (MPEG-21 BSDL) and a framework based on the use of the Formal Language for Audio-Visual Object Representation, ex- tended with XML features (XFlavor). Both technologies provide different means for the automatic translation of the structure of a binary media resource into an XML-based BSD, and for the subsequent generation of a tailored bit- stream using a transformed BSD. The high-level structure of a number of common video coding and con- tainer formats was described using MPEG-21 BSDL and XFlavor. Particular attention was paid to the construction of a description in MPEG-21 BSDL
Recommended publications
  • Advanced Video Editing with Mpeg Video Wizard
    ADVANCED VIDEO EDITING WITH MPEG VIDEO WIZARD. Start the program. TO EDIT OUT COMMERCIALS: Click on the ‘Files’ tab and then the ‘Open’ icon. Browse to the file you want to edit. For the purpose of this tutorial I will be editing a program stream from Digital TV called ‘Johnny Cash’. A thumbnail of the file will appear in the project window on the left. To start editing, drag the icon to the ‘Input’ window. Using the keyboard is the fastest way to edit. Slide the progress bar along till you get close to the point where you want to mark a starting point for a selection. You can use the left and right arrow keys to move slowly to locate the starting frame. The up and down arrow keys jump through in large steps. You can press d to start playing and again to pause. Press s once to speed up the play and again to go super fast, once more to return to normal play. This also applies to the navigation with the arrow keys so when you have marked the start press s once to change to fast forward or twice to super fast forward and then use the right arrow key to move very quickly to the end of the area where you want to put a mark out. A short key of the left arrow key will take you back one frame at a time as you will mostly zoom past the spot where you want to finish. If you have been zooming super fast just press the s key again and you can move back at normal speed to fine tune the selection.
    [Show full text]
  • MJPEG HOWTO − an Introduction to the MJPEG−Tools MJPEG HOWTO − an Introduction to the MJPEG−Tools Table of Contents
    MJPEG HOWTO − An introduction to the MJPEG−tools MJPEG HOWTO − An introduction to the MJPEG−tools Table of Contents MJPEG HOWTO − An introduction to the MJPEG−tools............................................................................1 Praschinger Bernhard...............................................................................................................................1 1. Introduction..........................................................................................................................................1 2. Unsorted list of useful Hints................................................................................................................2 2.1 Some books we found usefull............................................................................................................4 3. Recording videos.................................................................................................................................4 3.1 lavrec examples..................................................................................................................................4 3.2 Other recording hints.........................................................................................................................6 3.3 Some information about the typical lavrec output while recording...................................................6 3.4 Notes about "interlace field order − what can go wrong and how to fix it"......................................7 There are three bad things that can happen
    [Show full text]
  • Vicbond007's Guide to Working with DVD Footage
    VicBond007’s Guide to Working with DVD Footage for Use in Anime Music Videos Copying a DVD Copying a DVD The first step to allow you to use DVD footage as your source, is to copy the DVD to your hard drive. Commercial DVDs are protected by a Content Scrambling System, abbreviated as CSS. CSS is a set of 5, 2-digit hexidecimal keys that are only supposedly readable by set-top DVD players, and DVD player software. The purpose of this scrambling code is to prevent you from directly accessing the contents of the disc on your pc, otherwise you could easily make a perfect digital copy by just dragging and drop- ping the contents of the disk to your computer, or decoding the video stream directly off the Figure 1: DVD decrypter version 3.2.3.0 detecting the presence of my laptop’s DVD drive, as well as the disc to an AVI or some other file format. fact that Wolf’s Rain: Disc 8 is in the drive. If you tried to decode the contents of a DVD directly off the disc, the picture would be unrec- ognizable and scattered with various pink and green boxes. If you try to drag the .vob files off the disc, in most situations, it will transfer, however the data will just be unreadable and appear corrupt. If you want to know all the geeky information about CSS, then you can find out all about it at the following address: http://www.alleged.com/mirrors/DVD-CSS/ The solution is to copy the DVD to our drive, and remove the encryption in the process.
    [Show full text]
  • MPEG-2 Video Decoders Comparison
    MPEG-2 Video Decoders Comparison Project head: Dr. Dmitriy Vatolin, Dr. Maxim Smirnov Measurements, analysis: Dmitriy Kulikov Translation: Dr. Maxim Smirnov Verification: Alexander Parshin Decoders: • bitcontrol MPEG-2 Video Decoder • DScaler MPEG2 Video Decoder • Elecard MPEG-2 Video Decoder • ffdshow MPEG-4 Video Decoder (libavcodec) • InterVideo Video Decoder • Ligos MPEG Video Decoder • MainConcept MPEG Video Decoder • Pinnacle MPEG-2 Decoder May 2006 CS MSU Graphics&Media Lab Video Group http://www.compression.ru/video/ [email protected] MPEG-2 VIDEO DECODERS COMPARISON CS MSU GRAPHICS&MEDIA LAB VIDEO GROUP MOSCOW, MAY 2006 Contents Contents.......................................................................................................................2 Thanks .........................................................................................................................4 Overview ......................................................................................................................5 Decoders ............................................................................................................................... 5 Sequences............................................................................................................................. 5 Tasks and Test Rules ..................................................................................................6 MPEG-2 Video Codecs Comparison Objectives ..................................................................
    [Show full text]
  • Processing Videos for Use with ELAN
    Mar/2020 Processing videos for use with ELAN. Digital videos can be encoded in many different ways, with a variety of codecs and containers. Depending on the platform (Windows/Mac OS X/Linux) ELAN supports different file formats, largely determined by the available media player frameworks. Each framework supports one or more proprietary formats but also a few formats that can be considered to be more or less cross-platform. Although there can be good reasons to use one of the proprietary formats, we recommend to choose a cross-platform, “standardized” format where possible. In this document we present some recommendations for encoding MPEG-1, MPEG-2 and MP4 (H264) files for use in ELAN. Although MPEG-2 is not usually supported without installing additional codecs or plugins, we include it here because of its high quality. The settings presented here are general recommendations which don’t guarantee good playback performance on all systems, you might still have to experiment which settings that work best for your requirements. Note that this document does not contain any specific recommendation for video & audio files for use with The Language Archive. Although the encoding settings do produce valid video & audio files, one may use different settings to achieve the best quality for archiving. Index: 1. Recommended encoding settings for videos (mp4, mpeg2, mpg1) 2. Free software for processing media 3. Commercial software for processing media 4. Audio visualization in ELAN 1. Recommended encoding settings for video The following settings per encoding/format are general (tool independent) recommendations. Tool specific issues are listed at the end of this chapter.
    [Show full text]
  • Compression Tools
    Compression Tools As you have probably gathered by now, quite a lot of compression tools are available these days. This chapter outlines major multipurpose tools; interesting tools that are single-format or have special features for that format are covered in the chapter on that format. While the dream of “ the single tool that does everything ” hasn’t been realized, most compressionists wind up with a few go-to tools that work well with the sources and platforms they target. Generally, the broader the range of jobs you need to handle, the greater the variety of tools you’ll want to have to do them well. I ’m focusing on software-based tools for transcoding. Where products include live ingest or encoding I’ll call that out, but I am not covering live-only encoders here. What to Look for in a Compression Tool There are a lot of factors to consider in choosing a compression tool, and different factors will be more or less important depending on one’s needs. Class Different compression tools target different usage and pricing models, and what you need varies a lot depending on what you’re trying to do. I break it down into a few general classes, as described in the following sections. Consumer Consumer encoding tools are aimed at the casual users doing their own content. Increasingly these are web-based, not a traditional local app. We’re seeing a lot of adoption of free, open source tools for consumer transcoding, particularly for legally questionable tasks like transcoding from encrypted DVDs.
    [Show full text]
  • Avisynth 2.5 Selected External Plugin Reference
    Avisynth 2.5 Selected External Plugin Reference Wilbert Dijkhof Avisynth 2.5 Selected External Plugin Reference Table of Contents 1 Avisynth 2.5 Selected External Plugin Reference.........................................................................................1 1.1 General info.......................................................................................................................................1 1.2 Deinterlacing & Pulldown Removal..................................................................................................2 1.3 Spatio−Temporal Smoothers.............................................................................................................3 1.4 Spatial Smoothers..............................................................................................................................4 1.5 Temporal Smoothers.........................................................................................................................4 1.6 Sharpen/Soften Plugins......................................................................................................................4 1.7 Resizers..............................................................................................................................................5 1.8 Subtitle (source) Plugins....................................................................................................................5 1.9 MPEG Decoder (source) Plugins.......................................................................................................5
    [Show full text]
  • Overview on Digital Video
    Overview on Digital Video http://atg.wfu.edu/ yip 10/11/2001 For ? •Demos • Lectures • Synchronized Multimedia Presentation • Pre-recorded materials – news, movies • Live streaming of events Complaints • It is so small “stamp size movies” - “lower resolution” (smaller) than TV/VCR • It is so fuzzy from CD-ROM – heavy compression for 1-4x CD-ROM drive • Takes so long to download – not streamed • Stop every once a while – network congestion • Like a slide show instead of movie – fps • Can only be done in IT*(ACSes) / Lib ITC MM Lab / STARS / IS / Biomed Comm. Picky, picky • Digital ≠ Perfect Quality • Blocky artifact • Lossy compression • Degree of compression (4-15Mbps DVD) • Pure dark Advantages 1 • Stored on optical media for distribution – no download • Streamed – no download for large files • No degradation for storage • No generation loss from duplication of original materials “Advantages” 2 • Random start/stop, jump to sections • Programmable controls embedded within web pages for additional information – background, links • Mixed with animations, voice-over, screen capture (not covered – not video “movies”) 5 steps Acquire/ Capture/ Edit Store/ Deliver Create Transfer Make Acquire • From TV – Antenna/Cable/Satellite • From Analog Tape – VHS, SVHS, 8mm, Hi-8 • From Digital Tape – miniDV, Digital 8 • From LaserDisc, VHS-C, Betamax • NOT from DVD – copy protected Quality • SDTV Antenna < Cable (330 lines) • VHS < SVHS (250-410 lines) • 8mm < Hi-8 (250-410 lines) • Connection: Composite < RCA < S-Video < Component • DVB - Satellite, Digital Cable •DVD • DV (miniDV etc.), Digital 8 (500+ lines) S-VHS ET (S-VHS Quasi Playback – SQPB deck needed) Video Size • Video CD format – 352x240 MPEG-1 • DVD Video format – 720x480 MPEG-2 Quality demo Quality 2 - Pro • Misc.
    [Show full text]
  • Cinelerra CV Manual Non-Linear Video Editor for GNU/Linux Community Version 2.1 Edition 1.55.EN
    Cinelerra CV Manual Non-linear video editor for GNU/Linux Community Version 2.1 Edition 1.55.EN Heroine Virtual Ltd Cinelerra CV Team Copyright c 2003, 2004, 2005, 2006 Adam Williams - Heroine Virtual Ltd. Copyright c 2003, 2004, 2005, 2006, 2007 Cinelerra CV Team. This manual is free; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This document is distributed in the hope that it will be useful, but WITHOUT ANY WAR- RANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110, USA. i Table of Contents 1 Introduction ::::::::::::::::::::::::::::::::::::::::::::::::::: 1 1.1 About Cinelerra ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 1 1.2 The two versions of Cinelerra :::::::::::::::::::::::::::::::::::::::::::::::::::::: 1 1.3 About this manual :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 2 1.4 Getting help::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 2 1.5 Tutorials :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 3 1.6 HOWTOs ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 3 1.7 Miscellaneous
    [Show full text]
  • VCD Audio + Stills Guide
    VideoCD Audio + Stills A solution compatible with DVD players 1. INTRODUCTION This manual is a translation into English from the original Spanish document available in www.videoedicion.org and www.vcdsp.com, firstly published on June-2002. The procedure described in this manual is a modification from the one entitled VideoCD Audio: an alternative to MP3 CD compatible with DVD players, also available in Spanish in the quoted web. In that manual it was explained how to make a non-standard VCD with many hours of music, up to 99 themes directly reachable through menus, that could be played in many DVD. During the music reproduction the TV screen appeared in black (no image). The main feature added here consists in the possibility of associating a still picture to each music theme (or if you prefer, a music theme to each picture ;-). This will require more place in the CD so that there will be less room available for music, but it will allow you to display on the TV the scanned original CD cover, or any text or image you want, while listening to music. Anyway, the goal is to put several hours of music time in a single CD (from 3,5 up to 6 hours in an 80 minutes CD-R) compatible with (almost) any DVD player. This new method can be seen as the optimisation, for music and stills storage, of the non-standard VCD format called CVCD. When DVD movies are backed to CD you need to store a lot of video information and a comparatively smaller part of audio data.
    [Show full text]
  • Agradecería Que, Una Vez Usado El Manual, Me Indicaras Qué Te Ha Parecido. GRACIAS
    Agradecería que, una vez usado el manual, me indicaras qué te ha parecido. GRACIAS http://www.videoedicion.org/foro/index.php?board=7;action=display;threadid=212 Si después de leer el manual sigues teniendo dudas, puedes usar el foro. Por favor, haz tus preguntas en la categoría adecuada. GRACIAS http://foro.videoedicion.org http://www.videoedicion.org ©Ramón Cutanda López (videoed) Si quieres pasar tus archivos de vídeo MPEG sea cual sea su origen (edición, ripeo de DVD...) a CD o DVD y no te interesa aprender DE VERDAD a usar TMPGEnc y tan sólo quieres aprender a "salir del paso" puedes ahorrarte toda la explicación de los menús y pasar directamente al apartado 4. Creando VCD's, SVCD's o DVD's. También puedes usar la siguiente Tabla de configuración de TMGPEnc por Francisco González 0. Acerca de TMPGEnc 1. File (archivo) 1.1 MPEG Tools 1.1.1 Realizar un MPEG video+audio a partir de un archivo de vídeo y un archivo de audio separados (multiplexar) 1.1.2 Obtener archivos de vídeo y audio separados a partir de un MPEG vídeo+audio 1.1.3 Recortar un archivo MPEG o unir dos MPEG's en uno 2 Option (Opciones) 3. Definiendo el tipo de MPEG de salida: el meollo de la cuestión 3.1 Setting/Vídeo 3.2 Setting/Advanced 3.3 Setting/GOP Structure 3.4 Setting/Quantize matrix 3.5 Setting/Audio 3.6 Setting/System 4. Creando VCD's, SVCD's o DVD's 5. DVD's y TMPGEnc 6. TMPGEnc como compresor de audio 7.
    [Show full text]
  • Making Professional DVD from Authoring to Replication Table Of
    Making Professional DVD from Authoring to Replication Table of Contents 1.0 Introduction ........................................................................................................................................... 3 1.1 Some Basic DVD Concepts ................................................................................................................ 3 1.1.1 How many kinds of DVDs are there? ........................................................................................ 3 1.1.2 Are DVDs all the same around the world?................................................................................ 4 1.1.3 What do I need to know as the DVD author? ........................................................................... 4 2.0 The Workflow ........................................................................................................................................ 6 2.1 Video Capturing (Video Transfer) ................................................................................................... 6 2.2 Video Editing ..................................................................................................................................... 6 2.3 Video Encoding .................................................................................................................................. 7 2.4 DVD Authoring .................................................................................................................................. 7 2.5 DVD Mastering ..................................................................................................................................
    [Show full text]