Neural Rate Control for Video Encoding Using Imitation Learning

Total Page:16

File Type:pdf, Size:1020Kb

Neural Rate Control for Video Encoding Using Imitation Learning Neural Rate Control for Video Encoding using Imitation Learning Hongzi Mao†? Chenjie Gu† Miaosen Wang† Angie Chen† Nevena Lazic† Nir Levine† Derek Pang* Rene Claus* Marisabel Hechtman* Ching-Han Chiang* Cheng Chen* Jingning Han* †DeepMind *Google ?MIT CSAIL Abstract module manages the distribution of available bandwidth, usually constrained by the network condition. It directly In modern video encoders, rate control is a critical com- determines how many bits to spend to encode each video ponent and has been heavily engineered. It decides how frame, by assigning a quantization parameter (QP). The many bits to spend to encode each frame, in order to opti- goal of rate control is to maximize the encoding efficiency mize the rate-distortion trade-off over all video frames. This (often measured by the Bjontegaard delta rate) as well as is a challenging constrained planning problem because of maintaining the bitrate under a user-specified target bitrate. the complex dependency among decisions for different video Rate control is a constrained planning problem and can frames and the bitrate constraint defined at the end of the be formulated as a Partially Observable Markov Decision episode. Process. For a particular frame, the QP assignment decision We formulate the rate control problem as a Partially depends on (1) the frame’s spatial and temporal complexity, Observable Markov Decision Process (POMDP), and ap- (2) the previously encoded frames this frame refers to, and ply imitation learning to learn a neural rate control pol- (3) the complexity of future frames. In addition, the bitrate icy. We demonstrate that by learning from optimal video constraint imposes an episodic constraint on the planning encoding trajectories obtained through evolution strategies, problem. The inter-dependency of encoding decisions and our learned policy achieves better encoding efficiency and the episodic constraint make it challenging to engineer op- has minimal constraint violation. In addition to imitat- timal rate control algorithms. ing the optimal actions, we find that additional auxiliary To evaluate the potential of improving the rate control losses, data augmentation/refinement and inference-time policy, we conducted an experiment to measure how much policy improvements are critical for learning a good rate we may increase the Bjontegaard delta rate (BD-rate) com- control policy. We evaluate the learned policy against the pared to the rate control policy in libvpx, a widely adopted rate control policy in libvpx, a widely adopted open source open source VP9 codec for adaptive video streaming ser- VP9 codec library, in the two-pass variable bitrate (VBR) vices, such as YouTube and Netflix. We collected 1600 mode. We show that over a diverse set of real-world videos, videos spanning diverse content types and spatial/temporal our learned policy achieves 8.5% median bitrate reduction complexities. For every video, we applied an evolution without sacrificing video quality. strategy algorithm which maximizes the peak signal-to- arXiv:2012.05339v1 [cs.LG] 9 Dec 2020 noise ratio (PSNR) under libvpx’s two-pass variable-bitrate (VBR) mode. We observe an average 13% improvement 1. Introduction in BD-rate, indicating a big gap between the libvpx’s rate control policy and an optimal rate control policy. In recent years, video traffic keeps growing and consti- tutes over 75% of the Internet traffic [5]. Efficient video In this paper, we use imitation learning to train a neu- encoding is key to improving video quality and alleviating ral rate control policy and demonstrate its effectiveness in backbone Internet traffic pressure for video tasks such as improving encoding efficiency. We first generate a teacher video on demand (VOD) and live streaming decoding. dataset by searching for optimal QP sequences for individ- ual videos. We then train a neural network by imitating In modern video encoders, rate control is a critical com- these optimal QP sequences. However, imitating the opti- ponent that directly modulates the trade-off between rate mal QP sequences by itself fails to learn a good rate control (video size) and distortion (video quality). The rate control policy. In order for the policy to generalize to many videos ?Work done during the internship at DeepMind. Correspondence to and to satisfy the bitrate constraint, we design additional Chenjie Gu <[email protected]>. auxiliary losses and customized hindsight experience replay 1 to train the policy. During inference, we use a truncation VP9 has multiple rate control modes. In this paper, we trick, a simple technique to prune spurious action values focus on the “two-pass, variable bitrate (VBR)” mode [27] and improve encoding efficiency. Finally, we augment the (See Supplementary MaterialsA for more details). Math- learned policy with feedback control, in order to precisely ematically, VBR corresponds to a constrained optimization hit the target bitrate, and therefore minimize constraint vio- problem (for a particular video): lations. maximize PSNR(π) Although the method is developed with the goal of solv- π ing the rate control problem, it is general and is applicable ; (1) subject to bitrate(π) ≤ bitrate to other constrained planning problems and imitation learn- target ing methods. For example, the feedback control augmen- where π is a policy that decides the QP for each video frame tation defines a policy improvement operator for a class of during the second-pass encoding. PSNR (Peak Signal-to- constrained planning problems, and may be used in rein- Noise Ratio) measures the video quality (a.k.a., distortion) forcement learning. and is proportional to log-sum of mean squared error of all We evaluate and compare our learned neural rate con- show frames (excluding hidden alternate reference frames). trol policy against the default rate control policy in lib- Bitrate is the sum of bits of all frames divided by the du- vpx’s two-pass VBR mode. When evaluated on 200 real- ration of the video. Note that PSNR in Equation (1) can world videos never seen during training, our learned policy be replaced with other distortion metrics such as SSIM [28] achieves a 8.5% median reduction in bitrate while maintain- and VMAF [20]. ing the same video quality. Metrics. Bjontegaard delta rate (BD-rate) [3] is a key In summary, our contributions include: metric to compare rate control policies. Given the rate- 1. We formulate the rate control problem as a Partially distortion (RD) (bitrate-PSNR in our case) curve of two Observable Markov Decision Process (POMDP) (§2); policies, BD-rate computes the average of bitrate difference in percentage across the overlapped PSNR range, and there- 2. We develop an imitation learning method, including fore, measures the average bitrate reduction. teacher dataset generation, auxiliary losses, model ar- When we have only a single point on the RD-curve of chitecture and a customized hindsight experience re- one policy, BD-rate reduces to projected bitrate difference, play method, for learning a rate control policy (§3); which is the difference between the bitrate of one policy 3. We develop a policy improvement operator that aug- and the interpolated bitrate of the other policy at the same ments a learned policy with feedback control, and PSNR, as illustrated in Figure 1b. Similarly, we define pro- show that it significantly reduces the constraint viola- jected PSNR difference as the difference between the PSNR tion (§3.5); of one policy and the interpolated PSNR of the other policy at the same bitrate. 4. We show that our learned rate control policy achieves 8.5% median reduction in bitrate while maintaining the 2.2. Partially Observable Markov Decision Process same video quality (measured in PSNR) (§4). (POMDP) formulation We formulate the rate control problem Equation (1) as 2. Problem formulation and challenges a POMDP. Each step of the POMDP corresponds to the encoding of one video frame. At each step, the action is 2.1. Rate control in video encoding the QP, an integer in [0; 255], for the corresponding video Without loss of generality, we focus on improving rate frame. The state of the MDP corresponds to libvpx’s in- control in libvpx, a widely adopted open source implemen- ternal state, which is updated as new frames are encoded. tation of VP9. The same methodology applies to other mod- The state transition is deterministic and is determined by ern video encoders. As illustrated in Figure 1a, the rate how libvpx encodes a frame. We expose a subset of lib- controller makes encoding decisions of video frames se- vpx’s internal state as the observation, including (1) first- quentially. The rate controller regulates the trade-off be- pass statistics (e.g., noise energy, motion vector statistics) tween rate and distortion by assigning a quantization pa- of all video frames, which provides a rough estimate of the rameter (QP) to each frame. QP is an integer in the range of frame complexity of all frames, and helps the rate controller [0; 255]. QP is monotonically mapped to a quantization step to plan for future frames, (2) features of the frame to be en- size, which is used to digitalize prediction residue for en- coded (e.g., frame type and frame index) and (3) features tropy coding. Smaller quantization step sizes lead to smaller of previously encoded frames (e.g., bits/size of the encoded quantization error but also higher bits usage. Smaller quan- frame and mean-squared error between the encoded frame tization error means smaller reconstruction error, measured and the raw frame). Supplementary MaterialsB enumerates by mean squared error (MSE). all the observations we used in this work. Note that because 2 QP0 QP1 QP2 QP3 QP4 Encoder Encoder Encoder Encoder Encoder Bits0 / MSE0 Bits1 / MSE1 Bits2 / MSE2 Bits3 / MSE3 Bits4 / MSE4 (a) Rate control via quantization parameters (QPs). (b) Projected PSNR and bitrate. Figure 1: Illustrative example of the rate control problem and the metrics for encoding efficiency.
Recommended publications
  • The Interplay of Compile-Time and Run-Time Options for Performance Prediction Luc Lesoil, Mathieu Acher, Xhevahire Tërnava, Arnaud Blouin, Jean-Marc Jézéquel
    The Interplay of Compile-time and Run-time Options for Performance Prediction Luc Lesoil, Mathieu Acher, Xhevahire Tërnava, Arnaud Blouin, Jean-Marc Jézéquel To cite this version: Luc Lesoil, Mathieu Acher, Xhevahire Tërnava, Arnaud Blouin, Jean-Marc Jézéquel. The Interplay of Compile-time and Run-time Options for Performance Prediction. SPLC 2021 - 25th ACM Inter- national Systems and Software Product Line Conference - Volume A, Sep 2021, Leicester, United Kingdom. pp.1-12, 10.1145/3461001.3471149. hal-03286127 HAL Id: hal-03286127 https://hal.archives-ouvertes.fr/hal-03286127 Submitted on 15 Jul 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The Interplay of Compile-time and Run-time Options for Performance Prediction Luc Lesoil, Mathieu Acher, Xhevahire Tërnava, Arnaud Blouin, Jean-Marc Jézéquel Univ Rennes, INSA Rennes, CNRS, Inria, IRISA Rennes, France [email protected] ABSTRACT Both compile-time and run-time options can be configured to reach Many software projects are configurable through compile-time op- specific functional and performance goals. tions (e.g., using ./configure) and also through run-time options (e.g., Existing studies consider either compile-time or run-time op- command-line parameters, fed to the software at execution time).
    [Show full text]
  • Screen Capture Tools to Record Online Tutorials This Document Is Made to Explain How to Use Ffmpeg and Quicktime to Record Mini Tutorials on Your Own Computer
    Screen capture tools to record online tutorials This document is made to explain how to use ffmpeg and QuickTime to record mini tutorials on your own computer. FFmpeg is a cross-platform tool available for Windows, Linux and Mac. Installation and use process depends on your operating system. This info is taken from (Bellard 2016). Quicktime Player is natively installed on most of Mac computers. This tutorial focuses on Linux and Mac. Table of content 1. Introduction.......................................................................................................................................1 2. Linux.................................................................................................................................................1 2.1. FFmpeg......................................................................................................................................1 2.1.1. installation for Linux..........................................................................................................1 2.1.1.1. Add necessary components........................................................................................1 2.1.2. Screen recording with FFmpeg..........................................................................................2 2.1.2.1. List devices to know which one to record..................................................................2 2.1.2.2. Record screen and audio from your computer...........................................................3 2.2. Kazam........................................................................................................................................4
    [Show full text]
  • Challenges in Relaying Video Back to Mission Control
    Challenges in Relaying Video Back to Mission Control CONTENTS 1 Introduction ..................................................................................................................................................................................... 2 2 Encoding ........................................................................................................................................................................................... 3 3 Time is of the Essence .............................................................................................................................................................. 3 4 The Reality ....................................................................................................................................................................................... 5 5 The Tradeoff .................................................................................................................................................................................... 5 6 Alleviations and Implementation ......................................................................................................................................... 6 7 Conclusion ....................................................................................................................................................................................... 7 White Paper Revision 2 July 2020 By Christopher Fadeley Using a customizable hardware-accelerated encoder is essential to delivering the high
    [Show full text]
  • Hikvision H.264+ Encoding Technology
    WHITE PAPER Hikvision H.264+ Encoding Technology Encoding Improvement / Higher Transmission / Efficiency Storage Savings 2 Contents 1. Introduction .............................................................................................. 3 2. Background ............................................................................................... 3 3. Key Technologies .................................................................................... 4 3.1 Predictive Encoding ........................................................................ 4 3.2 Noise Suppression.......................................................................... 8 3.3 Long-Term Bitrate Control........................................................... 9 4. Applications ............................................................................................ 11 5. Conclusion............................................................................................... 11 Hikvision H.264+ Encoding Technology 3 1. INTRODUCTION As the global market leader in video surveillance products, Hikvision Digital Technology Co., Ltd., continues to strive for enhancement of its products through application of the latest in technology. H.264+ Advanced Video Coding (AVC) optimizes compression beyond the current H.264 standard. Through the combination of intelligent analysis technology with predictive encoding, noise suppression, and long-term bitrate control, Hikvision is meeting the demand for higher resolution at reduced bandwidths. Our customers will benefit
    [Show full text]
  • Arxiv:2007.15943V1 [Cs.SE] 31 Jul 2020
    MUZZ: Thread-aware Grey-box Fuzzing for Effective Bug Hunting in Multithreaded Programs Hongxu Chen§† Shengjian Guo‡ Yinxing Xue§∗ Yulei Sui¶ Cen Zhang† Yuekang Li† Haijun Wang# Yang Liu† †Nanyang Technological University ‡Baidu Security ¶University of Technology Sydney §University of Science and Technology of China #Ant Financial Services Group Abstract software performance. A typical computing paradigm of mul- tithreaded programs is to accept a set of inputs, distribute Grey-box fuzz testing has revealed thousands of vulner- computing jobs to threads, and orchestrate their progress ac- abilities in real-world software owing to its lightweight cordingly. Compared to sequential programs, however, multi- instrumentation, fast coverage feedback, and dynamic adjust- threaded programs are more prone to severe software faults. ing strategies. However, directly applying grey-box fuzzing On the one hand, the non-deterministic thread-interleavings to input-dependent multithreaded programs can be extremely give rise to concurrency-bugs like data-races, deadlocks, inefficient. In practice, multithreading-relevant bugs are usu- etc [32]. These bugs may cause the program to end up with ab- ally buried in the sophisticated program flows. Meanwhile, normal results or unexpected hangs. On the other hand, bugs existing grey-box fuzzing techniques do not stress thread- that appear under specific inputs and interleavings may lead interleavings that affect execution states in multithreaded pro- to concurrency-vulnerabilities [5, 30], resulting in memory grams. Therefore, mainstream grey-box fuzzers cannot ade- corruptions, information leakage, etc. quately test problematic segments in multithreaded software, although they might obtain high code coverage statistics. There exist a line of works on detecting bugs and vulner- To this end, we propose MUZZ, a new grey-box fuzzing abilities in multithreaded programs.
    [Show full text]
  • Efficient Multi-Codec Support for OTT Services: HEVC/H.265 And/Or AV1?
    Efficient Multi-Codec Support for OTT Services: HEVC/H.265 and/or AV1? Christian Timmerer†,‡, Martin Smole‡, and Christopher Mueller‡ ‡Bitmovin Inc., †Alpen-Adria-Universität San Francisco, CA, USA and Klagenfurt, Austria, EU ‡{firstname.lastname}@bitmovin.com, †{firstname.lastname}@itec.aau.at Abstract – The success of HTTP adaptive streaming is un- multiple versions (e.g., different resolutions and bitrates) and disputed and technical standards begin to converge to com- each version is divided into predefined pieces of a few sec- mon formats reducing market fragmentation. However, other onds (typically 2-10s). A client first receives a manifest de- obstacles appear in form of multiple video codecs to be sup- scribing the available content on a server, and then, the client ported in the future, which calls for an efficient multi-codec requests pieces based on its context (e.g., observed available support for over-the-top services. In this paper, we review the bandwidth, buffer status, decoding capabilities). Thus, it is state of the art of HTTP adaptive streaming formats with re- able to adapt the media presentation in a dynamic, adaptive spect to new services and video codecs from a deployment way. perspective. Our findings reveal that multi-codec support is The existing different formats use slightly different ter- inevitable for a successful deployment of today's and future minology. Adopting DASH terminology, the versions are re- services and applications. ferred to as representations and pieces are called segments, which we will use henceforth. The major differences between INTRODUCTION these formats are shown in Table 1. We note a strong differ- entiation in the manifest format and it is expected that both Today's over-the-top (OTT) services account for more than MPEG's media presentation description (MPD) and HLS's 70 percent of the internet traffic and this number is expected playlist (m3u8) will coexist at least for some time.
    [Show full text]
  • User Manual 19HFL5014W Contents
    User Manual 19HFL5014W Contents 1 TV Tour 3 13 Help and Support 119 1.1 Professional Mode 3 13.1 Troubleshooting 119 13.2 Online Help 120 2 Setting Up 4 13.3 Support and Repair 120 2.1 Read Safety 4 2.2 TV Stand and Wall Mounting 4 14 Safety and Care 122 2.3 Tips on Placement 4 14.1 Safety 122 2.4 Power Cable 4 14.2 Screen Care 123 2.5 Antenna Cable 4 14.3 Radiation Exposure Statement 123 3 Arm mounting 6 15 Terms of Use 124 3.1 Handle 6 15.1 Terms of Use - TV 124 3.2 Arm mounting 6 16 Copyrights 125 4 Keys on TV 7 16.1 HDMI 125 16.2 Dolby Audio 125 5 Switching On and Off 8 16.3 DTS-HD (italics) 125 5.1 On or Standby 8 16.4 Wi-Fi Alliance 125 16.5 Kensington 125 6 Specifications 9 16.6 Other Trademarks 125 6.1 Environmental 9 6.2 Operating System 9 17 Disclaimer regarding services and/or software offered by third parties 126 6.3 Display Type 9 6.4 Display Input Resolution 9 Index 127 6.5 Connectivity 9 6.6 Dimensions and Weights 10 6.7 Sound 10 7 Connect Devices 11 7.1 Connect Devices 11 7.2 Receiver - Set-Top Box 12 7.3 Blu-ray Disc Player 12 7.4 Headphones 12 7.5 Game Console 13 7.6 USB Flash Drive 13 7.7 Computer 13 8 Videos, Photos and Music 15 8.1 From a USB Connection 15 8.2 Play your Videos 15 8.3 View your Photos 15 8.4 Play your Music 16 9 Games 18 9.1 Play a Game 18 10 Professional Menu App 19 10.1 About the Professional Menu App 19 10.2 Open the Professional Menu App 19 10.3 TV Channels 19 10.4 Games 19 10.5 Professional Settings 20 10.6 Google Account 20 11 Android TV Home Screen 22 11.1 About the Android TV Home Screen 22 11.2 Open the Android TV Home Screen 22 11.3 Android TV Settings 22 11.4 Connect your Android TV 25 11.5 Channels 27 11.6 Channel Installation 27 11.7 Internet 29 11.8 Software 29 12 Open Source Software 31 12.1 Open Source License 31 2 1 TV Tour 1.1 Professional Mode What you can do In Professional Mode ON, you can have access to a large number of expert settings that enable advanced control of the TV’s state or to add additional functions.
    [Show full text]
  • Ffmpeg Codecs Documentation Table of Contents
    FFmpeg Codecs Documentation Table of Contents 1 Description 2 Codec Options 3 Decoders 4 Video Decoders 4.1 hevc 4.2 rawvideo 4.2.1 Options 5 Audio Decoders 5.1 ac3 5.1.1 AC-3 Decoder Options 5.2 flac 5.2.1 FLAC Decoder options 5.3 ffwavesynth 5.4 libcelt 5.5 libgsm 5.6 libilbc 5.6.1 Options 5.7 libopencore-amrnb 5.8 libopencore-amrwb 5.9 libopus 6 Subtitles Decoders 6.1 dvbsub 6.1.1 Options 6.2 dvdsub 6.2.1 Options 6.3 libzvbi-teletext 6.3.1 Options 7 Encoders 8 Audio Encoders 8.1 aac 8.1.1 Options 8.2 ac3 and ac3_fixed 8.2.1 AC-3 Metadata 8.2.1.1 Metadata Control Options 8.2.1.2 Downmix Levels 8.2.1.3 Audio Production Information 8.2.1.4 Other Metadata Options 8.2.2 Extended Bitstream Information 8.2.2.1 Extended Bitstream Information - Part 1 8.2.2.2 Extended Bitstream Information - Part 2 8.2.3 Other AC-3 Encoding Options 8.2.4 Floating-Point-Only AC-3 Encoding Options 8.3 flac 8.3.1 Options 8.4 opus 8.4.1 Options 8.5 libfdk_aac 8.5.1 Options 8.5.2 Examples 8.6 libmp3lame 8.6.1 Options 8.7 libopencore-amrnb 8.7.1 Options 8.8 libopus 8.8.1 Option Mapping 8.9 libshine 8.9.1 Options 8.10 libtwolame 8.10.1 Options 8.11 libvo-amrwbenc 8.11.1 Options 8.12 libvorbis 8.12.1 Options 8.13 libwavpack 8.13.1 Options 8.14 mjpeg 8.14.1 Options 8.15 wavpack 8.15.1 Options 8.15.1.1 Shared options 8.15.1.2 Private options 9 Video Encoders 9.1 Hap 9.1.1 Options 9.2 jpeg2000 9.2.1 Options 9.3 libkvazaar 9.3.1 Options 9.4 libopenh264 9.4.1 Options 9.5 libtheora 9.5.1 Options 9.5.2 Examples 9.6 libvpx 9.6.1 Options 9.7 libwebp 9.7.1 Pixel Format 9.7.2 Options 9.8 libx264, libx264rgb 9.8.1 Supported Pixel Formats 9.8.2 Options 9.9 libx265 9.9.1 Options 9.10 libxvid 9.10.1 Options 9.11 mpeg2 9.11.1 Options 9.12 png 9.12.1 Private options 9.13 ProRes 9.13.1 Private Options for prores-ks 9.13.2 Speed considerations 9.14 QSV encoders 9.15 snow 9.15.1 Options 9.16 vc2 9.16.1 Options 10 Subtitles Encoders 10.1 dvdsub 10.1.1 Options 11 See Also 12 Authors 1 Description# TOC This document describes the codecs (decoders and encoders) provided by the libavcodec library.
    [Show full text]
  • A Packetization and Variable Bitrate Interframe Compression Scheme for Vector Quantizer-Based Distributed Speech Recognition
    A Packetization and Variable Bitrate Interframe Compression Scheme For Vector Quantizer-Based Distributed Speech Recognition Bengt J. Borgstrom¨ and Abeer Alwan Department of Electrical Engineering, University of California, Los Angeles [email protected], [email protected] Abstract For example, the ETSI standard [2] packetization scheme sim- ply concatenates 2 adjacent 44-bit source coded frames, and We propose a novel packetization and variable bitrate com- transmits the resulting signal along with header information. pression scheme for DSR source coding, based on the Group The proposed algorithm performs further compression using of Pictures concept from video coding. The proposed algorithm interframe correlation. The high time correlation present in simultaneously packetizes and further compresses source coded speech has previously been studied for speech coding [3] and features using the high interframe correlation of speech, and is DSR coding [4]. compatible with a variety of VQ-based DSR source coders. The The proposed algorithm allows for lossless compression of algorithm approximates vector quantizers as Markov Chains, the quantized speech features. However, the algorithm is also and empirically trains the corresponding probability parame- robust to various degrees of lossy compression, either through ters. Feature frames are then compressed as I-frames, P-frames, VQ pruning or frame puncturing. VQ pruning refers to ex- or B-frames, using Huffman tables. The proposed scheme can clusion of low probability VQ codebook labels prior to Huff- perform lossless compression, but is also robust to lossy com- man coding, and thus excludes longer Huffman codewords, pression through VQ pruning or frame puncturing. To illustrate and frame puncturing refers to the non-transmission of certain its effectiveness, we applied the proposed algorithm to the ETSI frames, which drastically reduces the final bitrate.
    [Show full text]
  • Is the Linux Desktop Less Secure Than Windows 10? Or How Super Mario Music Can Own Your System
    IS THE LINUX DESKTOP LESS SECURE THAN WINDOWS 10? OR HOW SUPER MARIO MUSIC CAN OWN YOUR SYSTEM. Hanno Böck https://hboeck.de 1 This was too easy . It should not be possible to find a serious memory corruption vulnerability in the default Linux desktop attack surface with just a few minutes of looking. Although it’ s hard to say it, this is not the kind of situation that occurs with a latest Windows 10 default install. Is it possible that Linux desktop security has rotted? (Chris Evans) 2 NINTENDO SOUND FILES (1) Exploit against Gstreamer in Ubuntu 12.04 (LTS). Thumbnail parser. 3 NINTENDO SOUND FILES (2) NSF players are mini-emulators - the attacker can execute code in an emulator. Easier to bypass modern exploit mitigation techniques. 4 FIX The fix is to delete the affected NSF gstreamer plugin. No problem: Ubuntu shipped two different NSF player plugins. 5 FLIC EXPLOIT 6 AUTOMATIC DOWNLOADS Some browsers automatically download files to ~/Downloads. Any webpage can create files on your filesystem. (Chrome/Chromium, Epiphany, ... - not Linux specific) 7 TRACKER GNOME Desktop search tool automatically indexes all new files in a user's home - including ~/Downloads. 8 REACTION FROM TRACKER DEVELOPER Furthermore, the GStreamer guys were extremely fast in fixing it. You could claim that other libraries used for metadata extraction are just as insecure, but that'd really be bugs in these libraries to fix. (Carlos Garnacho) 9 TRACKER PARSERS (1) Gstreamer, ffmpeg, flac, totem-pl-parser, tiff, libvorbis, taglib, libpng, libexif, giflib, libjpeg-turbo, libosinfo, poppler, libxml2, exempi, libgxps, ghostscript, libitpcdata 10 TRACKER PARSERS (2) If you can exploit any of them you can exploit many Linux desktop users from the web without user interaction.
    [Show full text]
  • AVC to the Max: How to Configure Encoder
    Contents Company overview …. ………………………………………………………………… 3 Introduction…………………………………………………………………………… 4 What is AVC….………………………………………………………………………… 6 Making sense of profiles, levels, and bitrate………………………………………... 7 Group of pictures and its structure..………………………………………………… 11 Macroblocks: partitioning and prediction modes….………………………………. 14 Eliminating spatial redundancy……………………………………………………… 15 Eliminating temporal redundancy……...……………………………………………. 17 Adaptive quantization……...………………………………………………………… 24 Deblocking filtering….….…………………………………………………………….. 26 Entropy encoding…………………………………….……………………………….. 2 8 Conclusion…………………………………………………………………………….. 29 Contact details..………………………………………………………………………. 30 2 www.elecard.com Company overview Elecard company, founded in 1988, is a leading provider of software products for encoding, decoding, processing, monitoring and analysis of video and audio data in 9700 companies various formats. Elecard is a vendor of professional software products and software development kits (SDKs); products for in - depth high - quality analysis and monitoring of the media content; countries 1 50 solutions for IPTV and OTT projects, digital TV broadcasting and video streaming; transcoding servers. Elecard is based in the United States, Russia, and China with 20M users headquarters located in Tomsk, Russia. Elecard products are highly appreciated and widely used by the leaders of IT industry such as Intel, Cisco, Netflix, Huawei, Blackmagic Design, etc. For more information, please visit www.elecard.com. 3 www.elecard.com Introduction Video compression is the key step in video processing. Compression allows broadcasters and premium TV providers to deliver their content to their audience. Many video compression standards currently exist in TV broadcasting. Each standard has different properties, some of which are better suited to traditional live TV while others are more suited to video on demand (VoD). Two basic standards can be identified in the history of video compression: • MPEG-2, a legacy codec used for SD video and early digital broadcasting.
    [Show full text]
  • Adaptive Bitrate Streaming Over Cellular Networks: Rate Adaptation and Data Savings Strategies
    Adaptive Bitrate Streaming Over Cellular Networks: Rate Adaptation and Data Savings Strategies Yanyuan Qin, Ph.D. University of Connecticut, 2021 ABSTRACT Adaptive bitrate streaming (ABR) has become the de facto technique for video streaming over the Internet. Despite a flurry of techniques, achieving high quality ABR streaming over cellular networks remains a tremendous challenge. First, the design of an ABR scheme needs to balance conflicting Quality of Experience (QoE) metrics such as video quality, quality changes, stalls and startup performance, which is even harder under highly dynamic bandwidth in cellular network. Second, streaming providers have been moving towards using Variable Bitrate (VBR) encodings for the video content, which introduces new challenges for ABR streaming, whose nature and implications are little understood. Third, mobile video streaming consumes a lot of data. Although many video and network providers currently offer data saving options, the existing practices are suboptimal in QoE and resource usage. Last, when the audio and video arXiv:2104.01104v2 [cs.NI] 14 May 2021 tracks are stored separately, video and audio rate adaptation needs to be dynamically coordinated to achieve good overall streaming experience, which presents interesting challenges while, somewhat surprisingly, has received little attention by the research community. In this dissertation, we tackle each of the above four challenges. Firstly, we design a framework called PIA (PID-control based ABR streaming) that strategically leverages PID control concepts and novel approaches to account for the various requirements of ABR streaming. The evaluation results demonstrate that PIA outperforms state-of-the-art schemes in providing high average bitrate with signif- icantly lower bitrate changes and stalls, while incurring very small runtime overhead.
    [Show full text]