MPEG STANDARDS EVOLUTION and IMPACT on CABLE Dr

Total Page:16

File Type:pdf, Size:1020Kb

MPEG STANDARDS EVOLUTION and IMPACT on CABLE Dr MPEG STANDARDS EVOLUTION AND IMPACT ON CABLE Dr. Paul Moroney, Dr. Ajay Luthra Motorola Broadband Communications Sector Abstract continues in that role to this day. MPEG-1 [1] targeted stored content up to a 1.5 Mbps rate, The MPEG standards process has evolved matching the parameters necessary to store on from its beginnings in 1988 through today to CDs. MPEG-1 addressed audio compression, cover a wide range of multimedia delivery video compression, and a storage-oriented technology. The entire phenomenon of digital systems layer to combine them with other television and the standards involved have higher-level information, and the resulting affected our culture through increased content standard earned an Emmy award. MPEG-2 choices for the consumer, increased [2] shifted the focus to entertainment competition, and new types of products. television, added more video compression Cable delivery has seen a huge impact, in tools, multi-channel audio, and a new systems particular from MPEG-2 based digital layer more tuned to the need for a transport television. If we can extrapolate from the definition for broadcast. This standard has evolution of the MPEG process, more change been deployed very successfully worldwide, is coming! and earned an unprecedented second Emmy for MPEG. The designation “MPEG-3” had INTRODUCTION been set aside for HDTV extensions, but MPEG-2 worked so well for this application No one would dispute that the cable that MPEG-3 was never needed. industry has been dramatically impacted by the transition to digital television and related MPEG-4 [3] began slowly, while MPEG-2 services, whether one considers the advent of based systems were deployed, and has effective DBS services, or the distribution of developed into a true next generation digital television to cable headends, or the multimedia system. MPEG-4 covers a much distribution of digital services through the broader scope than its predecessors, and cable plant to consumers. Although many achieves dramatic improvements in the factors have played a role in the process, underlying source coding technology as well. certainly the development of true standards was and continues to be a true driver. The MPEG has also begun efforts in two ISO/IEC sponsored Motion Picture Experts related areas, not focused per se on Group [MPEG] has been the primary multimedia compression. MPEG-7 [4] is organization for the formation of the basic targeted as multimedia search and retrieval, multimedia source coding standards. This enabling the creation of search engines for paper examines where MPEG has been, and such content. MPEG-21 [5] (the 21 suffix where it is going, to build a vision of the representing the 21st century) addresses the future of digital services over cable. goal of true universal multimedia access. This Particular attention is paid to video group focuses on defining a multimedia compression. framework so that broad classes of content can be created, delivered, and consumed in an The MPEG process began in October of interoperable manner. 1988, under the very effective leadership of its convener, Leonardo Chiariglione, who MPEG-1 AND MPEG-2 The last key aspect is the inclusion of MPEG-1 and MPEG-2 are defined in three motion estimation and compensation, where main parts, addressing video source coding, the above approach applies to only the audio source coding, and a systems layer. difference between a current frame (macroblock) region, and its best possible Video match in the prior frame. The offsets representing the best match, to the nearest x MPEG video source coding is designed to and y half-pixel, become motion vectors for remove redundancy in video content both the region, and are included in as header frame to frame, and within a frame. information. Algorithms that achieve these goals without degradation in the reconstructed video images Figure 1 shows the most basic block are “lossless,” that is, reversible. Algorithms diagram of this compression process. Note that sacrifice quality in the interests of bit rate the presence of the decode (decompress) loop, reduction are termed “lossy.” Lossy so that the compressor will be able to algorithms attempt to hide errors, or artifacts, transform the pixel differences required for based upon models of human video motion compensation. perception. For example, it is more difficult to detect artifacts under high motion. It is also true that human vision chroma resolution is lower than human vision luminance DCT Quantize VLC resolution. Compression systems, such as MPEG, that seek to address a wide range of applications typically employ both lossless Decode and lossy algorithms. MPEG video compression at its base employs transformation, quantization, and variable length coding (VLC), in that order. Figure 1: Basic MPEG Compressor Pixel domain representations of video frames are transformed through an 8 by 8 Discrete Although not shown in the Figure above, Cosine Transform (DCT) to a frequency MPEG also allows motion estimation from a domain representation. The resulting spatial future frame, or from both the previous and frequency coefficients are scanned in a zigzag future frames, in combination (See Figure 2). manner, and quantized. The resulting In MPEG parlance, a frame without motion sequence of amplitudes and runs of zero compensation is an I frame, one with values are then Huffman coded. The bits compensation from only the previous frame is generated are grouped under a syntax a P frame, and one with bi-directional motion organized roughly at the sequence level (a is a B frame. Note that in order to decode a B number of consecutive frames), the picture frame, coded frames must be sent out of (typically frame) level, the slice level (rows of (display) order. A decoder must have decoded blocks), the macroblock level (4 luminance the future frame before that frame can be used and 2 chrominance blocks) and the DCT block to “anchor” any frame decoded with bi- level, including header information directional motion. appropriate to each. film frame 4 becomes NTSC field 9 and 10. MPEG allows compressors to drop the I B B P duplicate fields, and code the film frames directly. An overhead bit called “repeat field” Figure 2: B Frame Prediction informs the decompressor, or decoder, how to restore NTSC for display. The MPEG-2 standard introduced various enhancements, but two major changes were Given the descriptions above, one can deduce the addition of frame/field mode and direct the philosophy adopted by the MPEG support for film content. In order to compress planners. The maximal benefit to equipment interlaced video signals intended to be suppliers, and consumers, taken together, was ultimately displayed on interlaced televisions, determined to be achieved through a standard (see Figure 3) efficiency is enhanced by that defines decoders to the extent required to allowing the 8 by 8 pixels to be assembled be able to receive all signals that are “MPEG from alternate (interlaced) fields. In fact, compliant” according to the standard syntax. throughout the interlaced content, some Thus all aspects of the decoder are defined, regions compress better in this fashion [frame with the exception of error recovery and the mode] and some compress better by taking the specifics of the display generation. 8 by 8 pixel block from each field separately! Thus the standard allows either approach, The compression process, however, signaled by header bits. provides substantial room for creativity and invention, within the context of the standard. Any compressor can be built, so long as an MPEG compliant decoder can decode the Field 1 signals produced. Compression is, thus, far Field 2 more of an art form, than decompression. Consider the various degrees of freedom. Which frame regions are to be predicted, and how? For predicted regions, what are the best motion vectors? Which frames do you select as I frames, or which slices are not predicted Figure 3: Interlaced Block Pair at all, so that decompression equipment can acquire a signal after a channel change, or after an error? How much should any given For film content carried in NTSC television block or region coefficients be quantized to signals, common practice has been to convert minimize perceptual loss yet achieve some the 24 frames per second progressive film to desired net output bit rate (rate control)? the NTSC 29.97 frames per second interlace How does the equipment estimate such loss? (59.94 fields per second) through a telecine Which regions are best coded as fields, rather process known as 3:2 pull down. The first than frames? What is the best way to scan the film frame is divided into two fields, and the coefficients (there is more than one option first field is repeated in the NTSC signal as a provided in the standard)? “field 3”. Frame 2 of the film becomes “field 4” and “field 5,” without any repeat. Film MPEG-2 introduced the concept of profiles frame 3 becomes fields 6 and 7, and field 6 is and levels, to allow the broad range of repeated as a field 8. To complete the cycle, compression tools and bit rates to be targeted at classes of applications. This allowed also defined a new non-backward compatible products to be compliant at only a specific algorithm called Advanced Audio Coding profile and level; thus low cost consumer (AAC). AAC provides the best quality overall devices were possible. No one device was audio coding approach, at added complexity, burdened with implementing all the tools of both for multi-channel surround and for MPEG, at all bit rates. regular stereo audio. With AAC, 64 kbps stereo audio is generally perceived as excellent. Audio Systems MPEG audio compression bears several MPEG defined a systems layer to combine similarities to the video compression the various audio and video streams into a described above.
Recommended publications
  • Versatile Video Coding – the Next-Generation Video Standard of the Joint Video Experts Team
    31.07.2018 Versatile Video Coding – The Next-Generation Video Standard of the Joint Video Experts Team Mile High Video Workshop, Denver July 31, 2018 Gary J. Sullivan, JVET co-chair Acknowledgement: Presentation prepared with Jens-Rainer Ohm and Mathias Wien, Institute of Communication Engineering, RWTH Aachen University 1. Introduction Versatile Video Coding – The Next-Generation Video Standard of the Joint Video Experts Team 1 31.07.2018 Video coding standardization organisations • ISO/IEC MPEG = “Moving Picture Experts Group” (ISO/IEC JTC 1/SC 29/WG 11 = International Standardization Organization and International Electrotechnical Commission, Joint Technical Committee 1, Subcommittee 29, Working Group 11) • ITU-T VCEG = “Video Coding Experts Group” (ITU-T SG16/Q6 = International Telecommunications Union – Telecommunications Standardization Sector (ITU-T, a United Nations Organization, formerly CCITT), Study Group 16, Working Party 3, Question 6) • JVT = “Joint Video Team” collaborative team of MPEG & VCEG, responsible for developing AVC (discontinued in 2009) • JCT-VC = “Joint Collaborative Team on Video Coding” team of MPEG & VCEG , responsible for developing HEVC (established January 2010) • JVET = “Joint Video Experts Team” responsible for developing VVC (established Oct. 2015) – previously called “Joint Video Exploration Team” 3 Versatile Video Coding – The Next-Generation Video Standard of the Joint Video Experts Team Gary Sullivan | Jens-Rainer Ohm | Mathias Wien | July 31, 2018 History of international video coding standardization
    [Show full text]
  • Digital Video Quality Handbook (May 2013
    Digital Video Quality Handbook May 2013 This page intentionally left blank. Executive Summary Under the direction of the Department of Homeland Security (DHS) Science and Technology Directorate (S&T), First Responders Group (FRG), Office for Interoperability and Compatibility (OIC), the Johns Hopkins University Applied Physics Laboratory (JHU/APL), worked with the Security Industry Association (including Steve Surfaro) and members of the Video Quality in Public Safety (VQiPS) Working Group to develop the May 2013 Video Quality Handbook. This document provides voluntary guidance for providing levels of video quality in public safety applications for network video surveillance. Several video surveillance use cases are presented to help illustrate how to relate video component and system performance to the intended application of video surveillance, while meeting the basic requirements of federal, state, tribal and local government authorities. Characteristics of video surveillance equipment are described in terms of how they may influence the design of video surveillance systems. In order for the video surveillance system to meet the needs of the user, the technology provider must consider the following factors that impact video quality: 1) Device categories; 2) Component and system performance level; 3) Verification of intended use; 4) Component and system performance specification; and 5) Best fit and link to use case(s). An appendix is also provided that presents content related to topics not covered in the original document (especially information related to video standards) and to update the material as needed to reflect innovation and changes in the video environment. The emphasis is on the implications of digital video data being exchanged across networks with large numbers of components or participants.
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • On the Accuracy of Video Quality Measurement Techniques
    On the accuracy of video quality measurement techniques Deepthi Nandakumar Yongjun Wu Hai Wei Avisar Ten-Ami Amazon Video, Bangalore, India Amazon Video, Seattle, USA Amazon Video, Seattle, USA Amazon Video, Seattle, USA, [email protected] [email protected] [email protected] [email protected] Abstract —With the massive growth of Internet video and therefore measuring and optimizing for video quality in streaming, it is critical to accurately measure video quality these high quality ranges is an imperative for streaming service subjectively and objectively, especially HD and UHD video which providers. In this study, we lay specific emphasis on the is bandwidth intensive. We summarize the creation of a database measurement accuracy of subjective and objective video quality of 200 clips, with 20 unique sources tested across a variety of scores in this high quality range. devices. By classifying the test videos into 2 distinct quality regions SD and HD, we show that the high correlation claimed by objective Globally, a healthy mix of devices with different screen sizes video quality metrics is led mostly by videos in the SD quality and form factors is projected to contribute to IP traffic in 2021 region. We perform detailed correlation analysis and statistical [1], ranging from smartphones (44%), to tablets (6%), PCs hypothesis testing of the HD subjective quality scores, and (19%) and TVs (24%). It is therefore necessary for streaming establish that the commonly used ACR methodology of subjective service providers to quantify the viewing experience based on testing is unable to capture significant quality differences, leading the device, and possibly optimize the encoding and delivery to poor measurement accuracy for both subjective and objective process accordingly.
    [Show full text]
  • Security Solutions Y in MPEG Family of MPEG Family of Standards
    1 Security solutions in MPEG family of standards TdjEbhiiTouradj Ebrahimi [email protected] NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG: Moving Picture Experts Group 2 • MPEG-1 (1992): MP3, Video CD, first generation set-top box, … • MPEG-2 (1994): Digital TV, HDTV, DVD, DVB, Professional, … • MPEG-4 (1998, 99, ongoing): Coding of Audiovisual Objects • MPEG-7 (2001, ongo ing ): DitiDescription of Multimedia Content • MPEG-21 (2002, ongoing): Multimedia Framework NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG-1 - ISO/IEC 11172:1992 3 • Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s – Part 1 Systems - Program Stream – Part 2 Video – Part 3 Audio – Part 4 Conformance – Part 5 Reference software NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG-2 - ISO/IEC 13818:1994 4 • Generic coding of moving pictures and associated audio – Part 1 Systems - joint with ITU – Part 2 Video - joint with ITU – Part 3 Audio – Part 4 Conformance – Part 5 Reference software – Part 6 DSM CC – Par t 7 AAC - Advance d Au dio Co ding – Part 9 RTI - Real Time Interface – Part 10 Conformance extension - DSM-CC – Part 11 IPMP on MPEG-2 Systems NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG-4 - ISO/IEC 14496:1998 5 • Coding of audio-visual objects – Part 1 Systems – Part 2 Visual – Part 3 Audio – Part 4 Conformance – Part 5 Reference
    [Show full text]
  • The H.264 Advanced Video Coding (AVC) Standard
    Whitepaper: The H.264 Advanced Video Coding (AVC) Standard What It Means to Web Camera Performance Introduction A new generation of webcams is hitting the market that makes video conferencing a more lifelike experience for users, thanks to adoption of the breakthrough H.264 standard. This white paper explains some of the key benefits of H.264 encoding and why cameras with this technology should be on the shopping list of every business. The Need for Compression Today, Internet connection rates average in the range of a few megabits per second. While VGA video requires 147 megabits per second (Mbps) of data, full high definition (HD) 1080p video requires almost one gigabit per second of data, as illustrated in Table 1. Table 1. Display Resolution Format Comparison Format Horizontal Pixels Vertical Lines Pixels Megabits per second (Mbps) QVGA 320 240 76,800 37 VGA 640 480 307,200 147 720p 1280 720 921,600 442 1080p 1920 1080 2,073,600 995 Video Compression Techniques Digital video streams, especially at high definition (HD) resolution, represent huge amounts of data. In order to achieve real-time HD resolution over typical Internet connection bandwidths, video compression is required. The amount of compression required to transmit 1080p video over a three megabits per second link is 332:1! Video compression techniques use mathematical algorithms to reduce the amount of data needed to transmit or store video. Lossless Compression Lossless compression changes how data is stored without resulting in any loss of information. Zip files are losslessly compressed so that when they are unzipped, the original files are recovered.
    [Show full text]
  • AVC, HEVC, VP9, AVS2 Or AV1? — a Comparative Study of State-Of-The-Art Video Encoders on 4K Videos
    AVC, HEVC, VP9, AVS2 or AV1? | A Comparative Study of State-of-the-art Video Encoders on 4K Videos Zhuoran Li, Zhengfang Duanmu, Wentao Liu, and Zhou Wang University of Waterloo, Waterloo, ON N2L 3G1, Canada fz777li,zduanmu,w238liu,[email protected] Abstract. 4K, ultra high-definition (UHD), and higher resolution video contents have become increasingly popular recently. The largely increased data rate casts great challenges to video compression and communication technologies. Emerging video coding methods are claimed to achieve su- perior performance for high-resolution video content, but thorough and independent validations are lacking. In this study, we carry out an in- dependent and so far the most comprehensive subjective testing and performance evaluation on videos of diverse resolutions, bit rates and content variations, and compressed by popular and emerging video cod- ing methods including H.264/AVC, H.265/HEVC, VP9, AVS2 and AV1. Our statistical analysis derived from a total of more than 36,000 raw sub- jective ratings on 1,200 test videos suggests that significant improvement in terms of rate-quality performance against the AVC encoder has been achieved by state-of-the-art encoders, and such improvement is increas- ingly manifest with the increase of resolution. Furthermore, we evaluate state-of-the-art objective video quality assessment models, and our re- sults show that the SSIMplus measure performs the best in predicting 4K subjective video quality. The database will be made available online to the public to facilitate future video encoding and video quality research. Keywords: Video compression, quality-of-experience, subjective qual- ity assessment, objective quality assessment, 4K video, ultra-high-definition (UHD), video coding standard 1 Introduction 4K, ultra high-definition (UHD), and higher resolution video contents have en- joyed a remarkable growth in recent years.
    [Show full text]
  • Video Quality Assessment: Subjective Testing of Entertainment Scenes Margaret H
    1 Video Quality Assessment: Subjective testing of entertainment scenes Margaret H. Pinson, Lucjan Janowski, and Zdzisław Papir This article describes how to perform a video quality subjective test. For companies, these tests can greatly facilitate video product development; for universities, removing perceived barriers to conducting such tests allows expanded research opportunities. This tutorial assumes no prior knowledge and focuses on proven techniques. (Certain commercial equipment, materials, and/or programs are identified in this article to adequately specify the experimental procedure. In no case does such identification imply recommendation or endorsement by the National Telecommunications and Information Administration, nor does it imply that the program or equipment identified is necessarily the best available for this application.) Video is a booming industry: content is embedded on many Web sites, delivered over the Internet, and streamed to mobile devices. Cisco statistics indicate that video exceeded 50% of total mobile traffic for the first time in 2012, and predicts that two-thirds of the world’s mobile data traffic will be video by 2017 [1]. Each company must make a strategic decision on the correct balance between delivery cost and user experience. This decision can be made by the engineers designing the service or, for increased accuracy, by consulting users [2]. Video quality assessment requires a combined approach that includes objective metrics, subjective testing, and live video monitoring. Subjective testing is a critical step to ensuring success in operational management and new product development. Carefully conducted video quality subjective tests are extremely reliable and repeatable, as is shown in Section 8 of [3]. This article provides an approachable tutorial on how to conduct a subjective video quality experiment.
    [Show full text]
  • ITC Confplanner DVD Pages Itcconfplanner
    Comparative Analysis of H.264 and Motion- JPEG2000 Compression for Video Telemetry Item Type text; Proceedings Authors Hallamasek, Kurt; Hallamasek, Karen; Schwagler, Brad; Oxley, Les Publisher International Foundation for Telemetering Journal International Telemetering Conference Proceedings Rights Copyright © held by the author; distribution rights International Foundation for Telemetering Download date 25/09/2021 09:57:28 Link to Item http://hdl.handle.net/10150/581732 COMPARATIVE ANALYSIS OF H.264 AND MOTION-JPEG2000 COMPRESSION FOR VIDEO TELEMETRY Kurt Hallamasek, Karen Hallamasek, Brad Schwagler, Les Oxley [email protected] Ampex Data Systems Corporation Redwood City, CA USA ABSTRACT The H.264/AVC standard, popular in commercial video recording and distribution, has also been widely adopted for high-definition video compression in Intelligence, Surveillance and Reconnaissance and for Flight Test applications. H.264/AVC is the most modern and bandwidth-efficient compression algorithm specified for video recording in the Digital Recording IRIG Standard 106-11, Chapter 10. This bandwidth efficiency is largely derived from the inter-frame compression component of the standard. Motion JPEG-2000 compression is often considered for cockpit display recording, due to the concern that details in the symbols and graphics suffer excessively from artifacts of inter-frame compression and that critical information might be lost. In this paper, we report on a quantitative comparison of H.264/AVC and Motion JPEG-2000 encoding for HD video telemetry. Actual encoder implementations in video recorder products are used for the comparison. INTRODUCTION The phenomenal advances in video compression over the last two decades have made it possible to compress the bit rate of a video stream of imagery acquired at 24-bits per pixel (8-bits for each of the red, green and blue components) with a rate of a fraction of a bit per pixel.
    [Show full text]
  • Using H.264 and H.265 Codecs for 4K Transmissions
    ISSN 1843-6188 Scientific Bulletin of the Electrical Engineering Faculty – Year 15 No.1 (29) OBJECTIVE VIDEO QUALITY ASSESMENT: USING H.264 AND H.265 CODECS FOR 4K TRANSMISSIONS NICOLETA ANGELESCU Department of Electronics, Telecommunications and Energetics, Valahia University of Targoviste E-mail: [email protected] Abstract. H.265 codecs has attracted a lot of attention solution can be applied only to MPEG 2 video files to for transmission of digital video content with 4k reduce storage space. In [5], the H.265 and H.264 codecs resolution (3840x2160 pixels), since offers necessary are evaluated on video quality for 4k resolution. The bandwidths saving in comparison with H.264 codec. We authors use three objective quality metrics for evaluation. compare both codecs using three objective video quality The authors conclude that H.265 codec offer the best metrics on five standard 4k video test sequences for improvement comparing to H.264 codec for low bit-rate. digital TV transmissions using both codecs. In [6], a solution to reduce bit rate for MPEG 2 and H.264 transmission is proposed. The base layer at MPEG Keywords: H.264, HECV, VQM, SSIM, objective video 2 standard resolution is multiplexed with the enhanced quality layer (which is the difference between scaled MPEG 2 video content and original high resolution video captured) compressed with H.264 codec and transmitted to user 1. INTRODUCTION destination. The same solution can be applied for H.264 and H.265 content. The users with receivers which Nowadays, the majority of video content is stored in support Full HD resolution view the 4K content scaled lossy compressed form.
    [Show full text]
  • On Evaluating Perceptual Quality of Online User-Generated Videos Soobeom Jang, and Jong-Seok Lee, Senior Member, IEEE
    JOURNAL OF LATEX CLASS FILES, VOL. 13, NO. 9, SEPTEMBER 2014 1 On Evaluating Perceptual Quality of Online User-Generated Videos Soobeom Jang, and Jong-Seok Lee, Senior Member, IEEE Abstract—This paper deals with the issue of the perceptual quality assessment by employing multiple human subjects. quality evaluation of user-generated videos shared online, which However, subjective quality assessment is not feasible for is an important step toward designing video-sharing services online videos because of a tremendous amount of videos that maximize users’ satisfaction in terms of quality. We first analyze viewers’ quality perception patterns by applying graph in online video-sharing services. An alternative is objective analysis techniques to subjective rating data. We then examine quality assessment, which uses a model that mimics the human the performance of existing state-of-the-art objective metrics for perceptual mechanism. the quality estimation of user-generated videos. In addition, we The traditional objective quality metrics to estimate video investigate the feasibility of metadata accompanied with videos quality have two limitations. First, the existence of the ref- in online video-sharing services for quality estimation. Finally, various issues in the quality assessment of online user-generated erence video, i.e., the pristine video of the given video, is videos are discussed, including difficulties and opportunities. important. In general, objective quality assessment frameworks are classified into three groups: full-reference (FR), reduced- Index Terms—User-generated video, paired comparison, qual- ity assessment, metadata. reference (RR), and no-reference (NR). In the cases of FR and RR frameworks, full or partial information about the reference is provided.
    [Show full text]
  • The H.264/MPEG4 Advanced Video Coding Standard and Its Applications
    SULLIVAN LAYOUT 7/19/06 10:38 AM Page 134 STANDARDS REPORT The H.264/MPEG4 Advanced Video Coding Standard and its Applications Detlev Marpe and Thomas Wiegand, Heinrich Hertz Institute (HHI), Gary J. Sullivan, Microsoft Corporation ABSTRACT Regarding these challenges, H.264/MPEG4 Advanced Video Coding (AVC) [4], as the latest H.264/MPEG4-AVC is the latest video cod- entry of international video coding standards, ing standard of the ITU-T Video Coding Experts has demonstrated significantly improved coding Group (VCEG) and the ISO/IEC Moving Pic- efficiency, substantially enhanced error robust- ture Experts Group (MPEG). H.264/MPEG4- ness, and increased flexibility and scope of appli- AVC has recently become the most widely cability relative to its predecessors [5]. A recently accepted video coding standard since the deploy- added amendment to H.264/MPEG4-AVC, the ment of MPEG2 at the dawn of digital televi- so-called fidelity range extensions (FRExt) [6], sion, and it may soon overtake MPEG2 in further broaden the application domain of the common use. It covers all common video appli- new standard toward areas like professional con- cations ranging from mobile services and video- tribution, distribution, or studio/post production. conferencing to IPTV, HDTV, and HD video Another set of extensions for scalable video cod- storage. This article discusses the technology ing (SVC) is currently being designed [7, 8], aim- behind the new H.264/MPEG4-AVC standard, ing at a functionality that allows the focusing on the main distinct features of its core reconstruction of video signals with lower spatio- coding technology and its first set of extensions, temporal resolution or lower quality from parts known as the fidelity range extensions (FRExt).
    [Show full text]