Multimedia Systems Video III (Video Coding Standards)

Total Page:16

File Type:pdf, Size:1020Kb

Multimedia Systems Video III (Video Coding Standards) Course Presentation Multimedia Systems Video III (Video Coding Standards) Mahdi Amiri December 2015 Sharif University of Technology Video Coding Standards Motivation Motivation Page 1 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards Standardization Organizations Two organizations have dominated video compression standardization. ITU-T Video Coding Experts Group ( VCEG ) International Telecommunications Union –Telecommunications Standardization Sector (ITU-T, a United Nations Organization, formerly CCITT), Study Group 16, Question 6. ISO/IEC Moving Picture Experts Group ( MPEG ) International Standardization Organization and International Electrotechnical Commission, Joint Technical Committee Number 1, Subcommittee 29, Working Group 11. Page 2 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards Dynamics VCEG is older and more focused on conventional (esp. low-delay) video coding goals (e.g. good compression and packet-loss/error resilience) MPEG is larger and takes on more ambitious goals (e.g. “object oriented video”, “synthetic-natural hybrid coding”, and digital cinema) Sometimes the major organizations team up (e.g. ISO, IEC and ITU teamed up for both MPEG-2 and JPEG) Relatively little industry consortium activity (DV and organizations that tweak the video coding standards in minor ways, such as DVD, 3GPP, 3GPP2, SMPTE, IETF, etc.) Growing activity for internet streaming media outside of formal standardization (e.g., Microsoft, Real Networks, Quicktime) Page 3 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards The Scope of Picture and Video Coding Standardization A Video standard specifically do not define an encoder; rather, they define the output that an encoder should produce. A decoding method is defined in each standard (only the Bitstream Syntax and Decoding Process are standardized): e.g. use IDCT, but not how to implement the IDCT. Permits optimization beyond the obvious. Permits complexity reduction for implementability. Provides no guarantees of Quality - only interoperability. Ensuring interoperability : Enabling communication between devices made by different manufacturers Page 4 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation Objective A computer algorithm judges the distortion between videos. Attempts to model a human observer. There is currently no standard method. Page 5 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation Objective Metrics: PSNR Peak Signal-To-Noise Ratio (PSNR). Used widely in evaluating coding performance. Purely mathematical difference. Can be tricked quite easily. Root Mean Squared Error (RMSE) 255 = 2^n – 1 n: the number of bits per image sample Page 6 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation PSNR, Example Original PSNR 35.4 [dB] PSNR 29.0 [dB] Page 7 Multimedia Systems, Mahdi Amiri, Video III ABC AB Original PSNR 45.53 [dB] PSNR 36.81 [dB] PSNR 31.45 [dB] Page 8 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation Mahalanobis Distance The Mahalanobis distance differs from Euclidean Prasanta Chandra Mahalanobis distance in that it takes into account the 1893-1972 Euclid, Floruit 300 BC (Statue of Euclid in the Oxford University correlations of the data set and is scale-invariant. Museum of Natural History) T Ref.: www.aiaccess.net T −1 ()()x −μ x − μ ()()x −μ Σ xμ − Euclidian distance (Squared) Mahalanobis distance (Squared) Page 9 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation Objective Metrics: PSNR How to trick PSNR Take a natural image Give more bits to areas you look at more Give less bits to areas you look at less Subjective rating will be high, PSNR low Original Attention Map Example Test (High subjective rating, low PSNR) Page 10 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation Subjective: MOS Mean Opinion Score (MOS) A human “subject” rates the video on a scale. A numerical indication of the perceived quality of the media received after being transmitted and eventually compressed using codecs. MOS is expressed in one number, from 1 to 5, 1 being the worst and 5 the best. Page 11 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation Subjective In the ITU recommendations, there are many subjective quality test methods. Absolute Category Rating ( ACR ) Degradation Category Rating ( DCR ) The Double-Stimulus Continuous Quality-Scale method ( DSCQS ) Page 12 Multimedia Systems, Mahdi Amiri, Video III Video Quality Evaluation Subjective: ACR and DCR Absolute Category Rating ( ACR ) No reference sequence. Subjects are asked to rate the quality of the presentation based on the level of the quality they have in their opinion for it after viewing or listening it (Single Stimulus). Degradation Category Rating ( DCR ) Known reference sequence. Test sequences are presented in pairs. The first stimulus presented in each pair is always the source reference without any impairments (Double Stimulus). Ref.: www.irisa.fr/armor/lesmembres/Mohamed /Thesis/node147.html Page 13 Multimedia Systems, Mahdi Amiri, Video III Double Stimulus Continuous Quality Scale Method (DSCQS) Video Quality Evaluation Subjective: DSCQS Unknown reference sequence. For having fidelity test the observers are not told which is the reference sequence. Ref.: www.irisa.fr/armor/lesmembres/Mohamed /Thesis/node147.html Page 14 Multimedia Systems, Mahdi Amiri, Video III R-D Curve of Video Codecs ABC Page 15 Multimedia Systems, Mahdi Amiri, Video III Reminder CIF-size image R-D Curve of Video Codecs 352 ×288 ABC Page 16 Multimedia Systems, Mahdi Amiri, Video III R-D Curve of Video Codecs ABC Page 17 Multimedia Systems, Mahdi Amiri, Video III R-D Curve of Video Codecs R-D PerformanceABC of MPEG Codecs 50 48 46 44 42 PSNR(Y) 40 38 36 34 32 350 450 550 650 750 850 950 1050 Bit rate (kbps) MPEG-1 MPEG-2 MPEG-4 H.264 Page 18 Multimedia Systems, Mahdi Amiri, Video III R-D Curve of Video Codecs ABC Page 19 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards Gary J. Sullivan, Ph.D. ITU-T VCEG Rapporteur/Chairman ISO/IEC MPEG Video Rapporteur/Co-Chairman ITU/ISO/IEC JVT Rapporteur/Co-Chairman VIDEO CODECS STANDARDIZATION HISTORY Page 20 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards History The Society of Motion Picture and Television Engineers, SMPTE (pron. simpti) is an internationally en.wikipedia.org/wiki/Data_compression recognized standards organizations founded in 1916 (en.wikipedia.org/wiki/SMPTE). Page 21 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards Major Video Compression Applications Page 22 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards Moving Picture Experts Group (MPEG) A working group of ISO/IEC in charge of the development of standards for coded representation of digital audio and video and related data. Established in 1988 26 years of activity The number of independent standards: more than 125 Ref.: en.wikipedia.org/wiki/Moving_Picture_Experts_Group Page 23 Multimedia Systems, Mahdi Amiri, Video III MPEG-1 The standard on which such products as Video CD and MP3 are based MPEG-2 The standard on which such products as Digital Television set top boxes and DVD are Videobased; Coding Standards MPEG-4 The standard for multimedia for the fixed and mobile web; MPEG-7 The standard for description and search of audio and visual content; MPEG-21 The Multimedia Framework; MPEG-A The standard providing application-specific formats by integrating multiple MPEG technologies; MPEG-B A collection of Systems specific standards MPEG-C A collection of Video specific standards MPEG-D A collection of Audio specific standards MPEG-E A standard (M3W) providing support to download and execution of multimedia applications MPEG-H A standard (HEVC) providing a significantly increased video compression performance MPEG-M A standard (MXM) for packaging and reusability of MPEG technologies MPEG-U A standard for rich-media user interface MPEG-V A standard for interchange with virtual worlds Page 24 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards Video Coding Experts Group (VCEG) Part of study group 16 (Multimedia coding, systems and applications) of the ITU-T. Established in 1984 H.120 The first digital video coding standard H.261 Was the first practical digital video coding standard. H.262 It is identical in content to the video part of the ISO/IEC MPEG-2 standard. H.263 Provided a suitable replacement for H.261 at all bitrates. H.263v2 Also known as H.263+, Enhanced robustness against data loss in the transmission channel. H.264 The ITU-T H.264 standard and the ISO/IEC MPEG-4 Part 10 standard (formally, ISO/IEC 14496- 10) are technically identical. H.265 Not yet developed; expected 2012 or later. H.271 Video back channel messages for conveyance of status information and requests from a video receiver to a video sender. Page 25 Multimedia Systems, Mahdi Amiri, Video III Video Coding Standards H.120 The First Digital Video Coding Standard ITU-T (ex-CCITT) Rec. H.120: 1984 v1 (1984) had conditional replenishment, DPCM, scalar quantization , variable-length coding, switch for quincunx sampling v2 (1988) added motion compensation and background prediction Operated at 1.544 (NTSC) and 2.048 (PAL) Mbits/s Few units made, essentially not in use today Conditional Replenishment : Can signal to leave a block area of the image unchanged, or replace it with new data (using a threshold value). Quincunx sampling : In a digital video system, a sampling structure with an array of samples where
Recommended publications
  • Efficient Multi-Codec Support for OTT Services: HEVC/H.265 And/Or AV1?
    Efficient Multi-Codec Support for OTT Services: HEVC/H.265 and/or AV1? Christian Timmerer†,‡, Martin Smole‡, and Christopher Mueller‡ ‡Bitmovin Inc., †Alpen-Adria-Universität San Francisco, CA, USA and Klagenfurt, Austria, EU ‡{firstname.lastname}@bitmovin.com, †{firstname.lastname}@itec.aau.at Abstract – The success of HTTP adaptive streaming is un- multiple versions (e.g., different resolutions and bitrates) and disputed and technical standards begin to converge to com- each version is divided into predefined pieces of a few sec- mon formats reducing market fragmentation. However, other onds (typically 2-10s). A client first receives a manifest de- obstacles appear in form of multiple video codecs to be sup- scribing the available content on a server, and then, the client ported in the future, which calls for an efficient multi-codec requests pieces based on its context (e.g., observed available support for over-the-top services. In this paper, we review the bandwidth, buffer status, decoding capabilities). Thus, it is state of the art of HTTP adaptive streaming formats with re- able to adapt the media presentation in a dynamic, adaptive spect to new services and video codecs from a deployment way. perspective. Our findings reveal that multi-codec support is The existing different formats use slightly different ter- inevitable for a successful deployment of today's and future minology. Adopting DASH terminology, the versions are re- services and applications. ferred to as representations and pieces are called segments, which we will use henceforth. The major differences between INTRODUCTION these formats are shown in Table 1. We note a strong differ- entiation in the manifest format and it is expected that both Today's over-the-top (OTT) services account for more than MPEG's media presentation description (MPD) and HLS's 70 percent of the internet traffic and this number is expected playlist (m3u8) will coexist at least for some time.
    [Show full text]
  • CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter
    CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter Prediction using Binary classification techniques A graduate project submitted in partial fulfillment of the requirements for the degree of Master of Science in Software Engineering by Alex Kit Romero May 2020 The graduate project of Alex Kit Romero is approved: ____________________________________ ____________ Dr. Katya Mkrtchyan Date ____________________________________ ____________ Dr. Kyle Dewey Date ____________________________________ ____________ Dr. John J. Noga, Chair Date California State University, Northridge ii Dedication This project is dedicated to all of the Computer Science professors that I have come in contact with other the years who have inspired and encouraged me to pursue a career in computer science. The words and wisdom of these professors are what pushed me to try harder and accomplish more than I ever thought possible. I would like to give a big thanks to the open source community and my fellow cohort of computer science co-workers for always being there with answers to my numerous questions and inquiries. Without their guidance and expertise, I could not have been successful. Lastly, I would like to thank my friends and family who have supported and uplifted me throughout the years. Thank you for believing in me and always telling me to never give up. iii Table of Contents Signature Page ................................................................................................................................ ii Dedication .....................................................................................................................................
    [Show full text]
  • Computational Complexity Reduction of Intra−Frame Prediction in Mpeg−2
    COMPUTATIONAL COMPLEXITY REDUCTION OF INTRA-FRAME PREDICTION IN MPEG-2/H.264 VIDEO TRANSCODERS Gerardo Fern ndez-Escribano, Pedro Cuenca, Luis Orozco-Barbosa and Antonio Garrido Instituto de Investigacin en Infor tica Universidad de Castilla-La Mancha, Ca pus Universitario, 02071 Albacete, SPAIN ,gerardo,pcuenca,lorozco,antonio-.info-ab.ucl .es ABSTRACT 1 :ualit1 video at about 1/3 to 1/2 of the bitrates offered b1 the MPEG-2 is the most widely digital video-encoding standard in MPEG-2 for at. 2hese significant band0idth savings open the use nowadays. It is being widely used in the development and ar5et to ne0 products and services, including 6423 services deployment of digital T services, D D and video-on-demand at lo0er bitrates. Further ore, given the relativel1 earl1 stage of services. However, recent developments have given birth to the video services in obile phones, obile phones 0ill be one of H.264/A C, offering better bandwidth to video )uality ratios the first ar5et seg ents to adopt 6.278 video. 6o0ever, these than MPEG2. It is expected that the H.264/A C will take over gains co e 0ith a significant increase in encoding and decoding the digital video market, replacing the use of MPEG-2 in most co pleAit1 ,3-. digital video applications. The complete migration to the new Bhile the 6.278 video standard is eApected to replace video-coding algorithm will take several years given the wide MPEG-2 video over the neAt several 1ears, a significant a ount scale use of MPEG-2 in the market place today.
    [Show full text]
  • RULING CODECS of VIDEO COMPRESSION and ITS FUTURE 1Dr.Manju.V.C, 2Apsara and 3Annapoorna
    International Journal of Application or Innovation in Engineering & Management (IJAIEM) Web Site: www.ijaiem.org Email: [email protected] Volume 8, Issue 7, July 2019 ISSN 2319 - 4847 RULING CODECS OF VIDEO COMPRESSION AND ITS FUTURE 1Dr.Manju.v.C, 2Apsara and 3Annapoorna 1Professor,KSIT 2Under-Gradute, Telecommunication Engineering, KSIT 3Under-Gradute, Telecommunication Engineering, KSIT Abstract Development of economical video codecs for low bitrate video transmission with higher compression potency has been a vigorous space of analysis over past years and varied techniques are projected to fulfill this need. Video Compression is a term accustomed outline a technique for reducing the information accustomed cipher digital video content. This reduction in knowledge interprets to edges like smaller storage needs and lower transmission information measure needs, for a clip of video content. In this paper, we survey different video codec standards and also discuss about the challenges in VP9 codec Keywords: Codecs, HEVC, VP9, AV1 1. INTRODUCTION Storing video requires a lot of storage space. To make this task manageable, video compression technology was developed to compress video, also known as a codec. Video coding techniques provide economic solutions to represent video data in a more compact and stout way so that the storage and transmission of video will be completed in less value in less cost in terms of bandwidth, size and power consumption. This technology (video compression) reduces redundancies in spatial and temporal directions. Spatial reduction physically reduces the size of the video data by selectively discarding up to a fourth or more of unneeded parts of the original data in a frame [1].
    [Show full text]
  • Platforms for Handling and Development of Audiovisual Data
    Faculdade de Engenharia da Universidade do Porto Platforms for Handling and Development of Audiovisual Data José Pedro Sousa Horta Project/Dissertation concerning Mestrado Integrado em Engenharia Informática Advisor: Professor Eurico Carrapatoso July 2008 © José Pedro Sousa Horta, 2008 Platforms for Handling and Development of Audiovisual Data José Pedro Sousa Horta Project/Dissertation concerning Mestrado Integrado em Engenharia Informática Approved in public display by the jury: President: Prof. Doutor António Fernando Coelho _________________________________________________ Examiner: Prof. Doutor Nuno Magalhães Ribeiro Vowel: Prof. Doutor Eurico Manuel Carrapatoso July 17th 2008 Abstract Everywhere around us, digital is replacing analogue. Such is especially true in the audiovisual: be it in consumer or professional market, the advantages of computer-based media have quickly persuaded investors. To choose in which technologies, software should be based, proper understanding of the major aspects behind digital media is essential. An overview of the state of the art regarding the compression of digital video, the existing container formats and the multimedia frameworks (that enable easier playback and editing of audiovisual content) will be given, preceded by an introduction of the main topics. The professional video market is particularly focused on this document, due to the context of its elaboration, predominantly MPEG-2, MXF and DirectShow. In order to obtain more accurate results, fruit of the practical contact with the technology, a hands-on programming project was accomplished using DirectShow to playback both local and remote WMV files. Compression algorithms are continuously being improved and new ones invented, the designers of container formats are paying increased attention to metadata and some serious alternatives to DirectShow are emerging.
    [Show full text]
  • Implementation of H.264/Mpeg-4 Advanced Video Coding Standard
    Indian Journal of Emerging Electronics in Computer Communications, Volume.3, Issue.1, 2016, Page.538-541 www.scientistlink.com/ijeecc Implementation of H.264/Mpeg-4 Advanced Video Coding Standard A.Princy1, C.Sridhar2 1PG Scholar, 2Assistant Professor Electronics and Communication Engineering S.K.R Engineering college, Chennai, India. [email protected],2 [email protected] Abstract: H.264/AVC is newest video coding standard of the ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture Experts Group. The main goals of the H.264/AVC standardization effort have been enhanced compression performance and provision of a “network-friendly” video representation addressing “conversational” (video telephony) and “nonconversational” (storage, broadcast, or streaming) applications.mH.264/AVC has achieved a significant improvement in rate-distortion efficiency relative to existing standards. This article provides an overview of the technical features of H.264/AVC, describes profiles and applications for the standard, and outlines the history of the standardization process. It covers all common video applications ranging from mobile services and videoconferencing to IPTV, HDTV, and HD video storage. This article discusses the technology behind the new H.264/MPEG4-AVC standard, focusing on the main distinct features of its core coding technology and its first set of extensions, known as the fidelity range extensions. In addition, this article also discusses the current status of adoption and deployment of the new standard in various application areas. Keywords- Advanced video codec, H.264, High Definition,MPEG-4,Standard Definition. I. INTRODUCTION Video processing systems require high performance processing elements and an efficient memory hierarchy design to reach real –time performance in the decoding of high definition video sequences.
    [Show full text]
  • H.265 (Hevc) Bitstream to H.264 (Mpeg 4 Avc)
    H.265 (HEVC) BITSTREAM TO H.264 (MPEG 4 AVC) BITSTREAM TRANSCODER by DEEPAK HINGOLE Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE IN ELECTRICAL ENGINEERING THE UNIVERSITY OF TEXAS AT ARLINGTON December 2015 Copyright © by Deepak Hingole 2015 All Rights Reserved ACKNOWLEDGEMENTS I would like to express my heartfelt gratitude to my advisor Dr. K. R. Rao for his unwavering support, encouragement, supervision and valuable inputs throughout this research work. He has been a constant source of inspiration for me to pursue this research work. I would also like to extend my gratitude to my colleagues in Adobe Systems Incorporated for their invaluable industrial insight and experience to help me understand and grow in the field of digital video processing. Additionally I would like to thank Dr. Schizas and Dr. Dillon for serving as members of my graduate committee. A big thank you to Vasavee, Rohith, Maitri, Shwetha, Srikanth and Uma, my Multimedia Processing Lab mates for providing valuable suggestions during the course of my research work. Last but not the least; I would like to thank my parents, my siblings and my close friends for believing in me and supporting me in this undertaking. I wish for your continued support in future. November 25, 2015 ABSTRACT H.265 (HEVC) BITSTREAM TO H.264 (MPEG 4 AVC) BITSTREAM TRANSCODER Deepak Hingole, MS The University of Texas at Arlington, 2015 Supervising Professor: K. R. Rao With every new video coding standard the general rule of thumb has been to maintain same video quality at a reduced bit rate of about 50% as compared to the previous standard.
    [Show full text]
  • Multimedia Production
    Multimedia Production Lecture 4 Digital Video Dr. Somsak Phattarasukol Faculty of Management Science UDON THANI RAJABHAT UNIVERSITY Digital Video ● Digital video is a representation of moving visual images in the form of encoded digital data. – the images being displayed are called frames – the rate at which frames are displayed in is called frame rate – the number of bits used to indicate a pixel's color is called color depth Sources: Wikipedia.org, dpbestflow.org, Mediacollege.com and Videomaker.com Digital Video ● An example video has – a duration of 1 hour (3600 sec), – a frame size of 640 x 480 pixels, – a color depth of 24 bits and – a frame rate of 25 frames per second would have a file size of 3600 x 640 x 480 x 24 x 25 = 82.8 Gbytes Sources: Wikipedia.org, dpbestflow.org, Mediacollege.com and Videomaker.com Digital Video Files ● Digital video file is a file for storing digital video and other data on a computer system. – It normally consists of a container containing encoded video data alongside encoded audio data as well as other data such as synchronization information and subtitles. – A program (or hardware) that encodes and decodes video or audio is called a codec Sources: Wikipedia.org, dpbestflow.org, Mediacollege.com and Videomaker.com Container ● A container is a metafile describing how different elements of data and metadata coexist in a computer file. – an example of image containers: TIFF – an example of audio containers: WAV – examples of multimedia containers: AVI, MP4, Ogg Sources: Wikipedia.org, dpbestflow.org, Mediacollege.com and Videomaker.com Codec ● A codec is a device or computer program capable of encoding or decoding a digital data stream or signal.
    [Show full text]
  • Analysis of Video Compression Technique IR.Mahalakshmi, Iidr.S.K.Mahendran Idept
    International Journal of Advanced Research in ISSN : 2347 - 8446 (Online) Computer Science & Technology (IJARCST 2017) Vol. 5, Issue 1 (Jan. - Mar. 2017) ISSN : 2347 - 9817 (Print) Analysis of Video Compression Technique IR.Mahalakshmi, IIDr.S.K.Mahendran IDept. of Computer Science, Madurai Kamaraj University College, India IIDept. of Computer Science, Bharathiar University, India Abstract Video compression has been the object of intensive research in the last thirty years. Video compression technique is now mature as is proven by the large number of applications that make use of this technology. This paper gives the idea about different techniques available for video compression. H.264/AVC exhibits superior coding performance improvement over its predecessors. The next generation standards are being generated by both VCEG and MPEG. In this paper, we also summarized the progress of those next generation video coding standard projects and existing new video coding techniques. In this paper is organized as describes the various basic techniques available for the video compression; the implementation strategies of video compression techniques; various generations ; latest trend in video compression; different issues related to emerging technologies. Keywords H.264/AVC, JVT, Video Codec, Issues, KTA, MPEG, VCEG. using the DWT . 2 The second one is to use a full 3-D wavelet Introduction decomposition. These approaches have reported coding effi- Video coding techniques provide efficient solutions to represent ciency improvements with respect to the hybrid schemes. These video data in a more compact and robust way so that the storage schemes are intended to provide further functionalities such as and transmission of video can be realized in less cost in terms scalability and progressive transmission.
    [Show full text]
  • An Overview of Core Coding Tools in the AV1 Video Codec
    An Overview of Core Coding Tools in the AV1 Video Codec Yue Chen∗, Debargha Murherjee∗, Jingning Han∗, Adrian Grange∗, Yaowu Xu∗, Zoe Liu∗, Sarah Parker∗, Cheng Chen∗, Hui Su∗, Urvang Joshi∗, Ching-Han Chiang∗, Yunqing Wang∗, Paul Wilkins∗, Jim Bankoski∗, Luc Trudeauy, Nathan Eggey, Jean-Marc Valiny, Thomas Daviesz, Steinar Midtskogenz, Andrey Norkinx and Peter de Rivaz{ ∗Google, USA yMozilla, USA zCisco, UK and Norway xNetflix, USA {Argon Design, UK VP9 Abstract—AV1 is an emerging open-source and royalty-free video R: Recursive compression format, which is jointly developed and finalized in early 64x64 2018 by the Alliance for Open Media (AOMedia) industry consortium. The main goal of AV1 development is to achieve substantial compression R R gain over state-of-the-art codecs while maintaining practical decoding R R complexity and hardware feasibility. This paper provides a brief technical overview of key coding techniques in AV1 along with preliminary 128x128 compression performance comparison against VP9 and HEVC. Index Terms—Video Compression, AV1, Alliance for Open Media, Open-source Video Coding AV1 I. INTRODUCTION R R Video applications have become ubiquitous on the internet over R R the last decade, with modern devices driving a high growth in Fig. 1. Partition tree in VP9 and AV1 the consumption of high resolution, high quality content. Services II. AV1 CODING TECHNIQUES such as video-on-demand and conversational video are predominant A. Coding Block Partition bandwidth consumers, that impose severe challenges on delivery infrastructures and hence create even stronger need for high efficiency VP9 uses a 4-way partition tree starting from the 64×64 level video compression technology.
    [Show full text]
  • 11-755— Spring 2021 Large Scale Multimedia Processing
    11-755— Spring 2021 Large Scale Multimedia Processing 1 Lecture 5/6 Multimedia capture and storage Rita Singh Carnegie Mellon University In this lecture • Digital multimedia: Recording and devices – Audio – Images – Video – Text • Digital multimedia: Processing – Audio processing – Two generic processing techniques 3 The first video: 1877 • https://www.telegraph.co.uk/culture/culturevideo/8035681/The-worlds-first-films.html • The footage was shot from 1877, and projected by Muybridge's 'Zoopraxiscope' invented two years later in 1879. The dates mean that he was able to make and show films 16 years before the Lumiere Brothers used the more commercially successful cinematograph to popularise the art form. • The dozen films are only a few seconds long and were commissioned to illustrate Victorian experiments in animal and human motion. Instead of using celluloid passed across a lens for a fraction of a second, Muybridge spaced out still cameras triggered by tripwires broken when the animals ran along, then projected the resulting sequence. 4 Video cameras • Analog video/movie cameras: – Mechanical devices that captured images on moving film (multiple photographs per second on a continuous film) Movie Camera 35mm Mitchell Film Original, circa 1940s 5 Video cameras • Digital video/movie cameras: – Just like still image cameras – Take multiple shots automatically at a fixed rate per second – The rate is decided based on persistence of vision – Usually 30 photos/second 6 Video cameras • What happens to the sound? – Sound is recorded alongside and time synchronized with the photo frames https://zaxcom.com/the-hobbit-a-production-sound- perspective/ 7 What happens to the sound? shotgun microphones - boom mics for movies • Each film created its own set of challenges, but Johnson and his two boom operators, Corrin Ellingford and Steven Harris, consistently took the time to wire all the actors each day.
    [Show full text]
  • Evaluation of 4D Light Field Compression Methods
    Evaluation of 4D Light Field Compression Methods David Barina Tomas Chlubna Marek Solony Drahomir Dlabaja Pavel Zemcik Centre of Excellence IT4Innovations Faculty of Information Technology Brno University of Technology Bozetechova 1/2, Brno Czech Republic {ibarina,ichlubna,isolony,zemcik}@fit.vutbr.cz Abstract Light field data records the amount of light at multiple points in space, captured e.g. by an array of cameras or by a light-field camera that uses microlenses. Since the storage and transmission requirements for such data are tremendous, compression techniques for light fields are gaining momentum in recent years. Although plenty of efficient compression formats do exist for still and moving images, only a little research on the impact of these methods on light field imagery is performed. In this paper, we evaluate the impact of state-of-the-art image and video compression methods on quality of images rendered from light field data. The methods include recent video compression standards, especially AV1 and XVC finalised in 2018. To fully exploit the potential of common image compression methods on four-dimensional light field imagery, we have extended these methods into three and four dimensions. In this paper, we show that the four-dimensional light field data can be compressed much more than independent still images while maintaining the same visual quality of a perceived picture. We gradually compare the compression performance of all image and video compression methods, and eventually answer the question, "What is the best compression method for light field data?". Keywords Light field, Plenoptic imaging, Lossy compression, Image refocusing 1 INTRODUCTION To describe a three-dimensional scene from any possible (x;y;z) viewing position at any viewing angle, one could define f a plenoptic function P(x;y;z;f;y), where the (x;y;z) is the position and (f;y) is a viewing angle (in spherical coordinates) of a camera.
    [Show full text]