A Comparison of JEM and AV1 with HEVC

Total Page:16

File Type:pdf, Size:1020Kb

A Comparison of JEM and AV1 with HEVC A Comparison of JEM and AV1 with HEVC Thorsten Laude Leibniz Universität Hannover Institut für Informationsverarbeitung Overview Coding Coding Complexity Tools Efficiency Thorsten Laude 2 [email protected] History of Video Codecs ISO/IEC/ITU-T 1980s 1990s 2000s 2010s H.120 MPEG-1 AVC HEVC H.261 MPEG-2/H.262 (MPEG-4 Part 10/ (MPEG-H JEM H.263 H.264) Part 2/ MPEG-4 Part 2 H.265) Contenders 1980s 1990s 2000s 2010s VC-X TrueMotion S/RT/2 Dirac VP9 Real Video VP3-7 Daala AV1 VP8 Thor Real Video Real Video Comparison of the latest video codecs (JEM/AV1) with HEVC Thorsten Laude 3 [email protected] On the Difficulty of Comparing Video Codecs AV1 is up to 43% better than HEVC HEVC is 30% better than AV1 Source: Feldmann, “Multi-Codec DASH Source: Grois et al., “Performance Dataset: An Evaluation of AV1, AVC, HEVC Comparison of AV1, JEM, VP9 and HEVC and VP9”, Bitmovin Blog, 2018 Encoders”, Proceedings of SPIE, 2017 “In terms of PSNR, the average BD-rate savings of AV1 relative to […] x264 high […] are […] 45.8% […]On the other hand, the encoding computational complexity […] was increased by factors of […] 5869.9x” Source: Liu, “AV1 beats x264 and libvpx-vp9 in practical Source: Akyazi and Ebrahimi, “Comparison of use cases”, Facebook Blog, 2018 compression efficiency between HEVC/H.265 and VP9 based on subjective assessments”, QoMEX, 2018 Thorsten Laude 4 [email protected] On the Difficulty of Comparing Video Codecs • Reference implementations Standard vs. Encoder (HM/JEM/aomenc) • Optimized encoders (x264/x265) • Computing resources Codecs perform differently good Codec Codec • Applications: e.g. Sequences for different content Comparison Configurations Broadcasting, VoD, Social Media • Tuning (e.g. PSNR, visual • Luma BD-rates, weighted BD-rates Metrics • Quality metrics: PSNR, SSIM, VMAF • Subjective Tests Thorsten Laude 5 [email protected] Test Conditions for this Comparison Class Sequence • Reference implementations Tango2 Standard vs. A1 Drums100 Encoder (HM/JEM/aomenc) (4K) Campfire ToddlerFountain2 CatRobot A2 TrafficFlow (4K) DaylightRoad2 Rollercoaster2 Kimono B ParkScene • HM/JEM: Common (1080p) Cactus BasketballDrive Codec Codec Test Conditions (CTC) Sequences BQTerrace Comparison Configurations • AV1 BasketballDrill --auto-alt-ref=1 --psnr C BQMall --tune=psnr --i420 -p 1 (WVGA) PartyScene -t 1 --fps=<?> --bit- RaceHorses depth=<?> --input-bit- BasketballPass depth=<?> --cq- D BQSquare level=<?> --kf-min- (WQVGA) BlowingBubbles dist=<?> --kf-max- RaceHorses dist=<?> -w <?> -h <?> E FourPeople (720p) Johnny Metrics • Luma BD-rates KristenAndSara • Quality metrics: PSNR F BasketballDrillText (Screen/ ChinaSpeed Mixed SlideEditing Content) SlideShow Thorsten Laude 6 [email protected] Coding Tools JEM AV1 Partitioning Partitioning • Quaternary and binary splits • Quaternary and binary splits • Bigger block size • Bigger block size Inter coding Inter coding • Overlapped block motion compensation • Overlapped block motion compensation • Higher order motion model • Higher order motion models • Sub-CU MV prediction • Wedge mode partitioning • Compound intra-inter prediction Intra coding Intra Coding • Additional directions • Directional, Paeth, Smooth prediction • Cross-component linear model • Intra block copy • Palette mode Transform coding Transform coding • Adaptive multiple transforms • DCT, DST, Identity • Non-separable secondary transform • Independent horizontal/vertical • Signal-dependent transform transforms Thorsten Laude 7 [email protected] Coding Efficiency 75% 55% 35% 15% rate - Better BD -5% -25% -45% -65% JEM vs. HM JEM vs. AV1 AV1 vs. HM JEM vs. HM JEM vs. AV1 AV1 vs.HM All-intra Random Access Thorsten Laude 8 [email protected] Encoder Runtimes 60 e.g. 10 50 Total CPU time: ≈ 1 decade frames/day 40 30 Better Complexity increase Complexity 20 10 0 JEM AV1 JEM AV1 All-intra Random Access Class A1 Class A2 Class B Class C Class D Class E Class F Overall HM Relative factors to HM, i.e. HM=1 Thorsten Laude 9 [email protected] Decoder Runtimes 14 12 10 8 Better 6 Complexity increase Complexity 4 2 0 JEM AV1 JEM AV1 All-intra Random Access Class A1 Class A2 Class B Class C Class D Class E Class F Overall HM Relative factors to HM, i.e. HM=1 Thorsten Laude 10 [email protected] Runtime-memory Complexity Thorsten Laude 11 [email protected] Trade-off Coding Efficiency vs. Complexity Better Better Thorsten Laude 12 [email protected] Summary Coding Efficiency Runtimes Comparison vs. HM Comparison vs. HM All intra (AI) Encoder JEM: 20% gain JEM: 39×(AI)/10×(RA) slower AV1: 4% gain AV1: 9×(AI)/32×(RA) slower Random Access (RA) Decoder JEM: 28% gain JEM: 3×(AI)/7×(RA) slower AV1: 38% loss AV1: 2×faster (AI)/same (RA) Closing remarks • Results are a snapshot of summer 2017 → AV1 finalization in March 2018 and JVET CfP evaluation in April 2018 • Since last summer, AV1 has gained additional 5% (based on 80 preliminary data points) • Complexity: Reference implementations vs. product implementations Thorsten Laude 13 [email protected] Details: Laude, T., Adhisantoso, Y. G., Voges, J., Munderloh, M., & Ostermann, J. (2018). A Comparison of JEM and AV1 with HEVC: Coding Tools , Coding Efficiency and Complexity. In Picture Coding Symposium (PCS). Dipl.-Ing. Thorsten Laude 14 [email protected].
Recommended publications
  • Download Media Player Codec Pack Version 4.1 Media Player Codec Pack
    download media player codec pack version 4.1 Media Player Codec Pack. Description: In Microsoft Windows 10 it is not possible to set all file associations using an installer. Microsoft chose to block changes of file associations with the introduction of their Zune players. Third party codecs are also blocked in some instances, preventing some files from playing in the Zune players. A simple workaround for this problem is to switch playback of video and music files to Windows Media Player manually. In start menu click on the "Settings". In the "Windows Settings" window click on "System". On the "System" pane click on "Default apps". On the "Choose default applications" pane click on "Films & TV" under "Video Player". On the "Choose an application" pop up menu click on "Windows Media Player" to set Windows Media Player as the default player for video files. Footnote: The same method can be used to apply file associations for music, by simply clicking on "Groove Music" under "Media Player" instead of changing Video Player in step 4. Media Player Codec Pack Plus. Codec's Explained: A codec is a piece of software on either a device or computer capable of encoding and/or decoding video and/or audio data from files, streams and broadcasts. The word Codec is a portmanteau of ' co mpressor- dec ompressor' Compression types that you will be able to play include: x264 | x265 | h.265 | HEVC | 10bit x265 | 10bit x264 | AVCHD | AVC DivX | XviD | MP4 | MPEG4 | MPEG2 and many more. File types you will be able to play include: .bdmv | .evo | .hevc | .mkv | .avi | .flv | .webm | .mp4 | .m4v | .m4a | .ts | .ogm .ac3 | .dts | .alac | .flac | .ape | .aac | .ogg | .ofr | .mpc | .3gp and many more.
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • Spb Tv Encoder
    SPB TV ENCODER INPUT IP Input Interfaces 2x Gigabit Ethernet SPB TV Encoder is a professional solution for the fast, high- that processes multi-format video streams for delivery to different quality transcoding of live and on-demand video streams from end-user devices: Mobile, Tablet, Desktop, and connected TV-sets. Optional Digital Input Interfaces SDI, DVB-S a single headend to multi-screen devices. Leverage your network SPB TV Encoding solution can be delivered in a flexible software- Input IP Streams UDP Unicast/Multicast, HTTP, HLS, RTSP, RTMP, MMS with a unified platform for IP TV, OTT TV and mobile TV services based configuration or as off the-shelf equipment. Supported Codecs MPEG-2, MPEG-4, H.264, MPEG-1, MPEG-1 Layer 3, AAC, HE AAC, AC3, WMV 7, WMV 8, WMV 9, WMA 1, WMA 2, WMA 9 Pro, On2 VP3, On2 VP5, On2 VP6, On2 VP8, PNG Filters Video cropping, Image insertion on input source loss, automatic audio loudness adjustment Up to 100 multi-rate channels FEC (Forward Error Correction) OUTPUT Live and on-demand video encoding 5.1 Audio, auto audio gain support IP Output Interfaces 2x Gigabit Ethernet Remote monitoring tools Keeping aspect ratio Output IP streams RTP Unicast/Multicast Image insertion on input loss Image overlay Output Codecs H.264 (Baseline, Main, High), MPEG-4, AAC, HE AAC, H.265 Web GUI and CLI interfaces Hardsubs overlay VoD Output Multi Track MP4 VoD Publishing FTP, SFTP SNMP 3D encoding support Thumbnails PNG, JPEG Managed updates to include the latest Multi output support codecs, formats, profiles and devices MANAGEMENT/MONITORING
    [Show full text]
  • Voxpro/Audionlabs Was Acquired by Wheatstone Corporation in the Fall of 2015
    NOTE: VoxPro/AudionLabs was acquired by Wheatstone Corporation in the fall of 2015. As a result the contact information and support links listed in this legacy document are no longer current. Should you require further assistance regarding the information presented here, please refer to the following contact info: EMAIL: [email protected] TEL: +1.252-638-7000 (9am-5pm EST) WEB: wheatstone.com [menu item: VoxPro] MAIL: Wheatstone Corporation 600 Industrial Drive New Bern, NC. 28562 USA 100716 Contents 3 Table of Contents 0 Chapter 1 Introduction 8 Chapter 2 Getting Started 10 1 VoxPro Main................................................................................................................................... Window 11 2 VoxPro Control................................................................................................................................... Panel 12 3 User Accounts................................................................................................................................... 13 4 Passwords ................................................................................................................................... 14 Chapter 3 Recording 16 1 Create New ...................................................................................................................................Recording 16 2 Create Empty................................................................................................................................... Recording 18 3 Create Recording..................................................................................................................................
    [Show full text]
  • Codec Is a Portmanteau of Either
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • Download the Inspector Product Sheet (Pdf)
    INSPECTOR Because your lab only has so many people... SSIMPLUS VOD Monitor Inspector is the only video quality measurement software with the algorithm trusted by Hollywood to determine the best possible configuration for R&D groups, engineers and architects who set up VOD encoding and processing workflows or make purchasing recommendations. Video professionals can evaluate more encoders and transcoders with the fastest and most comprehensive solution in the business. From the start of your workflow to delivering to consumer endpoints, VOD Monitor Inspector is here to help ensure every step along the way works flawlessly. Easy-to-use tools provide: A/B testing for encoding configurations and purchasing decisions Sandbox environment for encoder or transcoder output troubleshooting “The SSIMPLUS score developed Creation of custom templates to identify best practices for specific content libraries by SSIMWAVE represents a Configurable automation to save time and eliminate manual QA/QC generational breakthrough in Side-by-side visual inspector to subjectively assess degradations the video industry.” Perceptual quality maps that provide pixel level graphic visualization –The Television Academy of content impairments Allows you to optimize network performance and improve quality Our Emmy Award-winning SSIMPLUS™ score mimics the accuracy of 100,000 human eyes. Know the score when it YOU CAN HOW OUR SEE THE SOFTWARE SEES DEGRADATION THE DEGRADATION comes to video quality NARROW IT DOWN TO THE The SSIMPLUS score is the most accurate measurement PIXEL LEVEL representing how end-viewers perceive video quality. Our score can tell exactly where video quality degrades. 18 34 59 72 87 10 20 30 40 50 60 70 80 90 100 BAD POOR FAIR GOOD EXCELLENT Helping your workflow, work SSIMPLUS VOD Monitor Inspector helps ensure your video infrastructure is not negatively impacting content anywhere in your workflow.
    [Show full text]
  • 20130218 Technical White Paper.Bw.Jp
    Sending, Storing & Sharing Video With latakoo © Copyright latakoo. All rights reserved. Revised 11/12/2012 Table of contents Table of contents ................................................................................................... 1 1. Introduction ...................................................................................................... 2 2. Sending video & files with latakoo ...................................................................... 3 2.1 The latakoo app ............................................................................................ 3 2.2 Compression & upload ................................................................................. 3 2.3 latakoo minutes ............................................................................................ 4 3. The latakoo web interface .................................................................................. 4 3.1 Web interface requirements .......................................................................... 4 3.2 Logging into the dashboard .......................................................................... 4 3.3 Streaming video previews ............................................................................. 4 3.4 Downloading video ...................................................................................... 5 3.5 latakoo Pilot ................................................................................................. 5 3.6 Search and tagging ......................................................................................
    [Show full text]
  • Input Formats & Codecs
    Input Formats & Codecs Pivotshare offers upload support to over 99.9% of codecs and container formats. Please note that video container formats are independent codec support. Input Video Container Formats (Independent of codec) 3GP/3GP2 ASF (Windows Media) AVI DNxHD (SMPTE VC-3) DV video Flash Video Matroska MOV (Quicktime) MP4 MPEG-2 TS, MPEG-2 PS, MPEG-1 Ogg PCM VOB (Video Object) WebM Many more... Unsupported Video Codecs Apple Intermediate ProRes 4444 (ProRes 422 Supported) HDV 720p60 Go2Meeting3 (G2M3) Go2Meeting4 (G2M4) ER AAC LD (Error Resiliant, Low-Delay variant of AAC) REDCODE Supported Video Codecs 3ivx 4X Movie Alaris VideoGramPiX Alparysoft lossless codec American Laser Games MM Video AMV Video Apple QuickDraw ASUS V1 ASUS V2 ATI VCR-2 ATI VCR1 Auravision AURA Auravision Aura 2 Autodesk Animator Flic video Autodesk RLE Avid Meridien Uncompressed AVImszh AVIzlib AVS (Audio Video Standard) video Beam Software VB Bethesda VID video Bink video Blackmagic 10-bit Broadway MPEG Capture Codec Brooktree 411 codec Brute Force & Ignorance CamStudio Camtasia Screen Codec Canopus HQ Codec Canopus Lossless Codec CD Graphics video Chinese AVS video (AVS1-P2, JiZhun profile) Cinepak Cirrus Logic AccuPak Creative Labs Video Blaster Webcam Creative YUV (CYUV) Delphine Software International CIN video Deluxe Paint Animation DivX ;-) (MPEG-4) DNxHD (VC3) DV (Digital Video) Feeble Files/ScummVM DXA FFmpeg video codec #1 Flash Screen Video Flash Video (FLV) / Sorenson Spark / Sorenson H.263 Forward Uncompressed Video Codec fox motion video FRAPS:
    [Show full text]
  • Video - Dive Into HTML5
    Video - Dive Into HTML5 You are here: Home ‣ Dive Into HTML5 ‣ Video on the Web ❧ Diving In nyone who has visited YouTube.com in the past four years knows that you can embed video in a web page. But prior to HTML5, there was no standards- based way to do this. Virtually all the video you’ve ever watched “on the web” has been funneled through a third-party plugin — maybe QuickTime, maybe RealPlayer, maybe Flash. (YouTube uses Flash.) These plugins integrate with your browser well enough that you may not even be aware that you’re using them. That is, until you try to watch a video on a platform that doesn’t support that plugin. HTML5 defines a standard way to embed video in a web page, using a <video> element. Support for the <video> element is still evolving, which is a polite way of saying it doesn’t work yet. At least, it doesn’t work everywhere. But don’t despair! There are alternatives and fallbacks and options galore. <video> element support IE Firefox Safari Chrome Opera iPhone Android 9.0+ 3.5+ 3.0+ 3.0+ 10.5+ 1.0+ 2.0+ But support for the <video> element itself is really only a small part of the story. Before we can talk about HTML5 video, you first need to understand a little about video itself. (If you know about video already, you can skip ahead to What Works on the Web.) ❧ http://diveintohtml5.org/video.html (1 of 50) [6/8/2011 6:36:23 PM] Video - Dive Into HTML5 Video Containers You may think of video files as “AVI files” or “MP4 files.” In reality, “AVI” and “MP4″ are just container formats.
    [Show full text]
  • Supported Codecs and Formats Codecs
    Supported Codecs and Formats Codecs: D..... = Decoding supported .E.... = Encoding supported ..V... = Video codec ..A... = Audio codec ..S... = Subtitle codec ...I.. = Intra frame-only codec ....L. = Lossy compression .....S = Lossless compression ------- D.VI.. 012v Uncompressed 4:2:2 10-bit D.V.L. 4xm 4X Movie D.VI.S 8bps QuickTime 8BPS video .EVIL. a64_multi Multicolor charset for Commodore 64 (encoders: a64multi ) .EVIL. a64_multi5 Multicolor charset for Commodore 64, extended with 5th color (colram) (encoders: a64multi5 ) D.V..S aasc Autodesk RLE D.VIL. aic Apple Intermediate Codec DEVIL. amv AMV Video D.V.L. anm Deluxe Paint Animation D.V.L. ansi ASCII/ANSI art DEVIL. asv1 ASUS V1 DEVIL. asv2 ASUS V2 D.VIL. aura Auravision AURA D.VIL. aura2 Auravision Aura 2 D.V... avrn Avid AVI Codec DEVI.. avrp Avid 1:1 10-bit RGB Packer D.V.L. avs AVS (Audio Video Standard) video DEVI.. avui Avid Meridien Uncompressed DEVI.. ayuv Uncompressed packed MS 4:4:4:4 D.V.L. bethsoftvid Bethesda VID video D.V.L. bfi Brute Force & Ignorance D.V.L. binkvideo Bink video D.VI.. bintext Binary text DEVI.S bmp BMP (Windows and OS/2 bitmap) D.V..S bmv_video Discworld II BMV video D.VI.S brender_pix BRender PIX image D.V.L. c93 Interplay C93 D.V.L. cavs Chinese AVS (Audio Video Standard) (AVS1-P2, JiZhun profile) D.V.L. cdgraphics CD Graphics video D.VIL. cdxl Commodore CDXL video D.V.L. cinepak Cinepak DEVIL. cljr Cirrus Logic AccuPak D.VI.S cllc Canopus Lossless Codec D.V.L.
    [Show full text]
  • Video Encoding with Open Source Tools
    Video Encoding with Open Source Tools Steffen Bauer, 26/1/2010 LUG Linux User Group Frankfurt am Main Overview Basic concepts of video compression Video formats and codecs How to do it with Open Source and Linux 1. Basic concepts of video compression Characteristics of video streams Framerate Number of still pictures per unit of time of video; up to 120 frames/s for professional equipment. PAL video specifies 25 frames/s. Interlacing / Progressive Video Interlaced: Lines of one frame are drawn alternatively in two half-frames Progressive: All lines of one frame are drawn in sequence Resolution Size of a video image (measured in pixels for digital video) 768/720×576 for PAL resolution Up to 1920×1080p for HDTV resolution Aspect Ratio Dimensions of the video screen; ratio between width and height. Pixels used in digital video can have non-square aspect ratios! Usual ratios are 4:3 (traditional TV) and 16:9 (anamorphic widescreen) Why video encoding? Example: 52 seconds of DVD PAL movie (16:9, 720x576p, 25 fps, progressive scan) Compression Video codec Raw Size factor Comment 1300 single frames, MotionTarga, Raw frames 1.1 GB - uncompressed HUFFYUV 459 MB 2.2 / 55% Lossless compression MJPEG 60 MB 20 / 95% Motion JPEG; lossy; intraframe only lavc MPEG-2 24 MB 50 / 98% Standard DVD quality X.264 MPEG-4 5.3 MB 200 / 99.5% High efficient video codec AVC Basic principles of multimedia encoding Video compression Lossy compression Lossless (irreversible; (reversible; using shortcomings statistical encoding) in human perception) Intraframe encoding
    [Show full text]
  • Steve Jobs: Mystery Patent Pool to Attack Ogg Theora Email Roasts 'Open Source' Codecs by Cade Metz in San Francisco Posted in Software, 30Th April 2010 20:11 GMT
    Steve Jobs: mystery patent pool to attack Ogg Theora Email roasts 'open source' codecs By Cade Metz in San Francisco Posted in Software, 30th April 2010 20:11 GMT Steve Jobs has apparently claimed that a patent pool is being assembled to "go after" Ogg Theora, the open source video codec used by the latest browsers from Google, Mozilla, and Opera. The claim was made in an email the Apple CEO appears to have sent in response to an open source software advocate who questioned Apple's attachment to the patented H.264 codec. On Thursday, Hugo Roy, an activist with the Free Software Foundation Europe, sent an email to Steve Jobs in response to the Apple CEO's much-discussed open letter on Adobe Flash. The note praised Jobs for his letter - in which he reiterated the company's commitment to HTML5 and other "open standards" - but Roy also made a point of telling Jobs that such talk of open standards should not include H.264. "May I remind you that H.264 is not an open standard?" Roy wrote in a missive also posted [http://blogs.fsfe.org/hugo/2010/04/open-letter-to-steve-jobs/comment-page- 1/#comment-72 ] to the web. "This video codec is covered by patents...This is why Mozilla Firefox and Opera have not adopted this video codec for their HTML5 implementation, and decided to choose Theora as a sustainable and open alternative." Roy sent his note to what is believed to be Steve Jobs' email address - [email protected] - and three hours later, he received a reply.
    [Show full text]