Low Delay MPEG2 Video Encoding N\ P

Total Page:16

File Type:pdf, Size:1020Kb

Low Delay MPEG2 Video Encoding N\ P Low Delay MPEG2 Video Encoding by Tri D. Tran Submitted to the Department of Electrical Engineering and Computer Science in Partial Fulfillment of the Requirements for the Degrees of Bachelor of Science in Electrical Science and Engineering and Master of Engineering in Electrical Engineering and Computer Science at the Massachusetts Institute of Technology February 3, 1997 © Copyright 1997 Tri D. Tran. All Rights Reserved. The author hereby grants to M.I.T. permission to reproduce and distribute publicly paper and electronic copies of this thesis and to grant others the right to do so. A uthor ............................. ............................... ............... Departmlene f E-., , Enginring and Computer Science 1 0- February 3, 1997 Certified by .. ...J Andrew B. Lippman Lec rer,4Associate Director, MIT Media Laboratory 0'l Thesis Supervisor Accepted by .... N\ P .........n D F. R. Morgenthaler Chairman, Depar ent Committee on Graduate Theses Low Delay MPEG2 Video Encoding by Tri D. Tran Submitted to the Department of Electrical Engineering and Computer Science February 3, 1997 In Partial Fulfillment of the Requirements for the Degrees of Bachelor of Science in Electrical Science and Engineering and Master of Engineering in Electrical Engineering and Computer Science ABSTRACT High-quality and low-delay MPEG2 video coding can be achieved by avoiding the use of intra (I) and bidirectional prediction (B) pictures. Such coding requires intra macroblocks refreshing techniques for channel error propagation resilience and for compliance with the accuracy requirements of the MPEG2 standard. This project gives an overview of the MPEG technology, describes some of these low delay coding techniques and presents software simulation results of their performance in terms of image quality and their robustness to transmission channel errors. Trade-offs and comparisons among various coding strategies are analyzed. Based on the test sequence "Susie," it is shown that the simple low delay coding algorithm gives an overall visual quality acceptable to general viewers even at relatively low bit rate, desirable delay, and under severe transmission error environments. Thesis Supervisor: Andrew B. Lippman Title: Lecturer, Associate Director, MIT Media Laboratory To my parents and to Maria Acknowledgments This thesis project is carried out at the IBM Thomas J. Watson Research Center located in Yorktown Heights, New York. It lasts from the beginning of June 1996 to the end of Janu- ary 1997. I started writing the thesis document itself during early November and I owe many people, in and outside of work, for the completion of it. Please forgive me if I forget to list anyone. Firstly, I would like to thank Cesar Gonzales at IBM Research for giving me the opportunity to work with his group on this particular project and guiding me through it. Without his help, I would not have a thesis to work on in the first place. I also would like to thank my MIT thesis advisor Andy Lippman for accepting me as his advisee, giving me advice and proofreading my thesis. I cannot forget to thank my two mentors at IBM Research--Lurng-Kuo Liu and Peter Westerink. Lurng-Kuo has helped me through with many concepts about the MPEG tech- nology in general and he also guided me through my low delay coding project. I normally meet with him on Monday mornings to discuss my progress and what to expect next. On the other hand, Peter has helped me tremendously with the MPEG software codec. Ques- tions about coding techniques, data interpretation, and many other topics are answered. I often just run to either Lurng-Kuo's or Peter's office whenever I have a new question. In short, I owe much of the thesis completion to these two persons. Other people in the group who have helped me through with my project are Rajesh Rajagopalan and Lascoe Allman. Rajesh has helped me proofread my technical paper and also pointed out helpful hints about my project. Lascoe readily mounts and dismounts any CDs and directories for me whenever asked. Other members of the department are always interested in knowing my progress. Outside of work, I would like to thank Barbara Lynds for giving me the greatest hospi- tality throughout the fall and winter months. She has always been the person I can come up and talk to about my daily events, or about anything at all. She even fixes me food and medicine whenever I don't feel well. Maria Su has been the love of my life, way before the project even started. She has always been loving and giving me support through whatever I decide to do. Please forgive me for those times when my work seems to take over our lives. You know better than any- one else that it does not. :-) I love you with all my heart! My parents and my brother have always been supporting me throughout. My brother Trac has always given me advice about my work in general. My parents, being so far away in Vietnam, have given me all the emotional support and unsurpassed love any par- ents can give to a child they haven't seen for more than a decade. Hopefully, the day of our family reunion isn't so far away... Table of Contents Chapter 1 Introduction -------------------------------- 7 Chapter 2 Overview of the MPEG Technology- --------- --- - 10 The MPEG1 Standard................................... ............... 10 Overview of MPEG 1 Compression Algorithm......................... 11 Reducing Spatial Redundancy ............................................... 11 Reducing Temporal Redundancy .................................... 13 BidirectionalTemporal Prediction................... 14 The MPEG1 Bitstream Structures............................. ....... 16 Video Sequence Header....................................................... 18 Group of Pictures (GOP)Header.................... 19 Picture Header........................ ............... 21 Slice Header............................................... 22 Macroblock Header ...................................... ......... 22 Block ........................................................ 23 A Typical MPEG 1 Encoder ...................................... ...... 23 A Typical MPEG 1 Decoder ..................................... ....... 25 Overall Performance of MPEG 1 ........................................ 26 The MPEG2 Standard .............................. ................................. 26 The MPEG2 Bit-Stream Syntax............................... ....... 28 Exploiting Spatial Redundancy ...................................... ..... 29 Exploiting Temporal Redundancy............................. ....... 29 Frame Predictionfor Frame Pictures................................. 30 Field Predictionfor Field Pictures..................................... 30 Field Predictionfor Frame Pictures................................... 30 Dual Primefor P Pictures........................................... 31 16x8 MC for Field Pictures............................. ........ 31 Typical Codec Architecture......................... ..... ............ 31 MPEG2 Scalability Extensions ........................................... 32 SNR Scalability..................................... .. ............. 32 Data Partitioning........................................ ........... 33 Spatial Scalability....................................... ........... 33 Temporal Scalability ............................................... 33 Regarding MPEG and Low Delay Coding ................................. 33 Chapter 3 Low Delay Coding Design and Implementation - - - - 36 The H.261 and H.263 Videoconferencing Standards............. 36 Design of the Low Delay Coding Algorithm.......................... 37 Refreshing Strategy Using Intra Slices and Columns ................ 37 Reducing Error Propagation............................ ............ 42 Compliance With MPEG2 Standard .................................... 44 Implementation of the Low Delay Coding Algorithm .............. 44 Relevant Files of the MPEG2 Encoder ..................................... 46 Implementation of Intra Slices and Columns............................ 49 Implementation of Constrained Motion Estimation.................. 55 An Exam ple......................................................................... 55 Chapter 4 Simulation Results ---------------------------- 57 Buffer Fullness Behavior ............................. .............................. 58 Video Quality Without Errors.................... ........ 64 Transmission Error Robustness .......... ........................ 67 The Error Injector.............................................. 67 Experimental Results ....................................... ............ 69 Chapter 5 Further Discussion of Simulations - --- ----- ---- - - 75 Buffer Fullness Occupancy .......................................................... 75 Video Quality ............................................. 75 Error Resilience ......................................... ................................... 76 Further Discussions ..................................... ..... ............. 80 Chapter 6 Conclusions --------------------------------- 82 Appendix A Modifications of the MPEG2 Software Encoder - -- - 84 Appendix B The Random Bit Error Injector- - -- --- --- - -- - 115 References ----------------------------------------- 118 1 Introduction Started in 1988, the Moving Picture Experts Group (MPEG) committee set a goal to stan- dardize a video coding strategy for video storage and retrieval for multimedia applications. Not only did the committee succeed with the standardization of MPEG
Recommended publications
  • Improving Mobile IP Handover Latency on End-To-End TCP in UMTS/WCDMA Networks
    Improving Mobile IP Handover Latency on End-to-End TCP in UMTS/WCDMA Networks Chee Kong LAU A thesis submitted in fulfilment of the requirements for the degree of Master of Engineering (Research) in Electrical Engineering School of Electrical Engineering and Telecommunications The University of New South Wales March, 2006 CERTIFICATE OF ORIGINALITY I hereby declare that this submission is my own work and to the best of my knowledge it contains no materials previously published or written by another person, nor material which to a substantial extent has been accepted for the award of any other degree or diploma at UNSW or any other educational institution, except where due acknowledgement is made in the thesis. Any contribution made to the research by others, with whom I have worked at UNSW or elsewhere, is explicitly acknowledged in the thesis. I also declare that the intellectual content of this thesis is the product of my own work, except to the extent that assistance from others in the project's design and conception or in style, presentation and linguistic expression is acknowledged. (Signed)_________________________________ i Dedication To: My girl-friend (Jeslyn) for her love and caring, my parents for their endurance, and to my friends and colleagues for their understanding; because while doing this thesis project I did not manage to accompany them. And Lord Buddha for His great wisdom and compassion, and showing me the actual path to liberation. For this, I take refuge in the Buddha, the Dharma, and the Sangha. ii Acknowledgement There are too many people for their dedications and contributions to make this thesis work a reality; without whom some of the contents of this thesis would not have been realised.
    [Show full text]
  • OEM Cobranet Module Design Guide
    CDK-8 OEM CobraNet Module Design Guide Date 8/29/2008 Revision 1.1 © Attero Tech, LLC 1315 Directors Row, Suite 107, Ft Wayne, IN 46808 Phone 260-496-9668 • Fax 260-496-9879 620-00001-01 CDK-8 Design Guide Contents 1 – Overview .....................................................................................................................................................................................................................................2 1.1 – Notes on Modules .................................................................................................................................................... 2 2 – Digital Audio Interface Connectivity...........................................................................................................................................................................3 2.1 – Pin Descriptions ....................................................................................................................................................... 3 2.1.1 - Audio clocks ..................................................................................................................................................... 3 2.1.2 – Digital audio..................................................................................................................................................... 3 2.1.3 – Serial bridge ..................................................................................................................................................... 4 2.1.4 – Control............................................................................................................................................................
    [Show full text]
  • Public Address System Network Design Considerations
    AtlasIED APPLICATION NOTE Public Address System Network Design Considerations Background AtlasIED provides network based Public Address Systems (PAS) that are deployed on a wide variety of networks at end user facilities worldwide. As such, a primary factor, directly impacting the reliability of the PAS, is a properly configured, reliable, well-performing network on which the PAS resides/functions. AtlasIED relies solely upon the end user’s network owner/manager for the design, provision, configuration and maintenance of the network, in a manner that enables proper PAS functionability/functionality. Should the network on which the PAS resides be improperly designed, configured, maintained, malfunctions or undergoes changes or modifications, impacts to the reliability, functionality or stability of the PAS can be expected, resulting in system anomalies that are outside the control of AtlasIED. In such instances, AtlasIED can be a resource to, and support the end user’s network owner/manager in diagnosing the problems and restoring the PAS to a fully functioning and reliable state. However, for network related issues, AtlasIED would look to the end user to recover the costs associated with such activities. While AtlasIED should not be expected to actually design a facility’s network, nor make formal recommendations on specific network equipment to use, this application note provides factors to consider – best practices – when designing a network for public address equipment, along with some wisdom and possible pitfalls that have been gleaned from past experiences in deploying large scale systems. This application note is divided into the following sections: n Local Network – The network that typically hosts one announcement controller and its peripherals.
    [Show full text]
  • Aperformance Analysis for Umts Packet Switched
    International Journal of Next-Generation Networks (IJNGN), Vol.2, No.1, M arch 2010 A PERFORMANCE ANALYSIS FOR UMTS PACKET SWITCHED NETWORK BASED ON MULTIVARIATE KPIS Ye Ouyang and M. Hosein Fallah, Ph.D., P.E. Howe School of Technology Management Stevens Institute of Technology, Hoboken, NJ, USA 07030 [email protected] [email protected] ABSTRACT Mobile data services are penetrating mobile markets rapidly. The mobile industry relies heavily on data service to replace the traditional voice services with the evolution of the wireless technology and market. A reliable packet service network is critical to the mobile operators to maintain their core competence in data service market. Furthermore, mobile operators need to develop effective operational models to manage the varying mix of voice, data and video traffic on a single network. Application of statistical models could prove to be an effective approach. This paper first introduces the architecture of Universal Mobile Telecommunications System (UMTS) packet switched (PS) network and then applies multivariate statistical analysis to Key Performance Indicators (KPI) monitored from network entities in UMTS PS network to guide the long term capacity planning for the network. The approach proposed in this paper could be helpful to mobile operators in operating and maintaining their 3G packet switched networks for the long run. KEYWORDS UMTS; Packet Switch; Multidimensional Scaling; Network Operations, Correspondence Analysis; GGSN; SGSN; Correlation Analysis. 1. INTRODUCTION Packet switched domain of 3G UMTS network serves all data related services for the mobile subscribers. Nowadays people have a certain expectation for their experience of mobile data services that the mobile wireless environment has not fully met since the speed at which they can access their packet switching services has been limited.
    [Show full text]
  • Action-Sound Latency: Are Our Tools Fast Enough?
    Action-Sound Latency: Are Our Tools Fast Enough? Andrew P. McPherson Robert H. Jack Giulio Moro Centre for Digital Music Centre for Digital Music Centre for Digital Music School of EECS School of EECS School of EECS Queen Mary U. of London Queen Mary U. of London Queen Mary U. of London [email protected] [email protected] [email protected] ABSTRACT accepted guidelines for latency and jitter (variation in la- The importance of low and consistent latency in interactive tency). music systems is well-established. So how do commonly- Where previous papers have focused on latency in a sin- used tools for creating digital musical instruments and other gle component (e.g. audio throughput latency [21, 20] and tangible interfaces perform in terms of latency from user ac- roundtrip network latency [17]), this paper measures the tion to sound output? This paper examines several common latency of complete platforms commonly employed to cre- configurations where a microcontroller (e.g. Arduino) or ate digital musical instruments. Within these platforms we wireless device communicates with computer-based sound test the most reductive possible setup: producing an audio generator (e.g. Max/MSP, Pd). We find that, perhaps \click" in response to a change on a single digital or ana- surprisingly, almost none of the tested configurations meet log pin. This provides a minimum latency measurement for generally-accepted guidelines for latency and jitter. To ad- any tool created on that platform; naturally, the designer's dress this limitation, the paper presents a new embedded application may introduce additional latency through filter platform, Bela, which is capable of complex audio and sen- group delay, block-based processing or other buffering.
    [Show full text]
  • 5G Ultra-Reliable Low-Latency Communication Implementation Challenges and Operational Issues with Iot Devices
    Review 5G Ultra-Reliable Low-Latency Communication Implementation Challenges and Operational Issues with IoT Devices Murtaza Ahmed Siddiqi 1, Heejung Yu 2,* and Jingon Joung 3 1 Department of Information and Communication Engineering, Yeungnam University, Gyeongsan 38541, Korea 2 Department of Electronics and Information Engineering, Korea University, Sejong 30019, Korea 3 School of Electrical and Electronics Engineering, Chung-Ang University, Seoul 06974, Korea * Correspondence: [email protected]; Tel.: +82-44-860-1352 Received: 31 July 2019; Accepted: 29 August 2019; Published: 2 September 2019 Abstract: To meet the diverse industrial and market demands, the International Telecommunication Union (ITU) has classified the fifth-generation (5G) into ultra-reliable low latency communications (URLLC), enhanced mobile broadband (eMBB), and massive machine-type communications (mMTC). Researchers conducted studies to achieve the implementation of the mentioned distributions efficiently, within the available spectrum. This paper aims to highlight the importance of URLLC in accordance with the approaching era of technology and industry requirements. While highlighting a few implementation issues of URLLC, concerns for the Internet of things (IoT) devices that depend on the low latency and reliable communications of URLLC are also addressed. In this paper, the recent progress of 3rd Generation Partnership Project (3GPP) standardization and the implementation of URLLC are included. Finally, the research areas that are open for further investigation in URLLC implementation are highlighted, and efficient implementation of URLLC is discussed. Keywords: 5G; URLLC; 3GPP; 5G NR; LTE; IoT 1. Introduction With the everyday increase in data traffic requirements ranging from mission-critical to massive machine connectivity, the anticipation of the fifth-generation (5G) is growing at an exponential rate.
    [Show full text]
  • Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads Sadjad Fouladi, Riad S
    Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads Sadjad Fouladi, Riad S. Wahby, and Brennan Shacklett, Stanford University; Karthikeyan Vasuki Balasubramaniam, University of California, San Diego; William Zeng, Stanford University; Rahul Bhalerao, University of California, San Diego; Anirudh Sivaraman, Massachusetts Institute of Technology; George Porter, University of California, San Diego; Keith Winstein, Stanford University https://www.usenix.org/conference/nsdi17/technical-sessions/presentation/fouladi This paper is included in the Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’17). March 27–29, 2017 • Boston, MA, USA ISBN 978-1-931971-37-9 Open access to the Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation is sponsored by USENIX. Encoding, Fast and Slow: Low-Latency Video Processing Using Thousands of Tiny Threads Sadjad Fouladi , Riad S. Wahby , Brennan Shacklett , Karthikeyan Vasuki Balasubramaniam , William Zeng , Rahul Bhalerao , Anirudh Sivaraman , George Porter , Keith Winstein Stanford University , University of California San Diego , Massachusetts Institute of Technology Abstract sion relies on temporal correlations among nearby frames. Splitting the video across independent threads prevents We describe ExCamera, a system that can edit, transform, exploiting correlations that cross the split, harming com- and encode a video, including 4K and VR material, with pression efficiency. As a result, video-processing systems low latency. The system makes two major contributions. generally use only coarse-grained parallelism—e.g., one First, we designed a framework to run general-purpose thread per video, or per multi-second chunk of a video— parallel computations on a commercial “cloud function” frustrating efforts to process any particular video quickly.
    [Show full text]
  • Audio Networking Special 2017
    NETWORKING SPECIAL GEAR Products Spotlight: Riedel MediorNet MultiViewer Audio networking equipment – what’s new. Extending the capabilities of hardware through the use of software apps has been an ongoing Lawo launches mc²96 Console theme at Riedel, starting with the company’s app-driven SmartPanel, introduced two years ago. ‘A fundamental benefit of a decentralized signal network is the ability to put signal inputs and outputs where they are needed rather than at a large, monolithic router that requires additional cabling,’ said Dr. Lars Lawo has launched its Höhmann, Product new flagship audio mixing Manager at Riedel console, the fully IP-based Communications. ‘These mc²96 Grand Production benefits apply to the Console at NAB 2017. MediorNet MultiViewer as The new console has been specifically designed to provide well, since the MultiViewer optimal performance in IP video production environments hardware can be placed anywhere while leveraging the network for sources. In addition, integrating the MultiViewer into the through native support for all relevant standards — SMPTE 2110, MediorNet ecosystem removes an extra layer of gear and complexity.’ AES67, RAVENNA and DANTE. The Lawo mc²96 console, available Each single MediorNet MultiViewer engine can access any MediorNet input signal and process up to 18 signals. These in frame sizes with 24 to 200 faders with the same quality Lawo’s signals can be placed flexibly onto four physical screens or routed to any destination within the MediorNet system and output mc²90 series was known for, is designed as Lawo’s most visual at alternative locations. The MultiViewer device provides local signal inputs and outputs to offer further connectivity options, broadcast console ever.
    [Show full text]
  • 15. Voice Over IP: Speech Transmission Over Packet Networks Voicej
    307 15. Voice over IP: Speech Transmission over Packet Networks VoiceJ. Skoglund, E. Kozica, over J. Linden, R. Hagen, W. B. Kleijn IP 15.2.3 Typical Network Characteristics ...... 312 Part C The emergence of packet networks for both data 15.2.4 Quality-of-Service Techniques ....... 313 and voice traffic has introduced new challenges for speech transmission designs that differ signif- 15.3 Outline of a VoIP System........................ 313 icantly from those encountered and handled in 15.3.1 Echo Cancelation.......................... 314 15 traditional circuit-switched telephone networks, 15.3.2 Speech Codec............................... 315 such as the public switched telephone network 15.3.3 Jitter Buffer ................................. 315 (PSTN). In this chapter, we present the many as- 15.3.4 Packet Loss Recovery .................... 316 pects that affect speech quality in a voice over IP 15.3.5 Joint Design of Jitter Buffer and Packet Loss Concealment ........ 316 (VoIP) conversation. We also present design tech- 15.3.6 Auxiliary Speech niques for coding systems that aim to overcome Processing Components ................ 316 the deficiencies of the packet channel. By properly 15.3.7 Measuring the Quality utilizing speech codecs tailored for packet net- ofaVoIPSystem........................... 317 works, VoIP can in fact produce a quality higher than that possible with PSTN. 15.4 Robust Encoding .................................. 317 15.4.1 Forward Error Correction ............... 317 15.4.2 Multiple Description Coding........... 320 15.1 Voice Communication............................ 307 15.1.1 Limitations of PSTN....................... 307 15.5 Packet Loss Concealment ....................... 326 15.1.2 The Promise of VoIP ...................... 308 15.5.1 Nonparametric Concealment.......... 326 15.5.2 Parametric Concealment ..............
    [Show full text]
  • Low Latency Audio Processing
    Low Latency Audio Processing Yonghao Wang School of Electronic Engineering and Computer Science Queen Mary University of London 2017 Submitted to University of London in partial fulfilment of the requirements for the degree of Doctor of Philosophy Queen Mary University of London 2017 1 Statement of Originality I, Yonghao Wang, confirm that the research included within this thesis is my own work or that where it has been carried out in collaboration with, or supported by others, that this is duly acknowledged below and my contribution indicated. Previously published material is also acknowledged below. I attest that I have exercised reasonable care to ensure that the work is original, and does not to the best of my knowledge break any UK law, infringe any third party’s copyright or other Intellectual Property Right, or contain any confidential material. I accept that the College has the right to use plagiarism detection software to check the electronic version of the thesis. I confirm that this thesis has not been previously submitted for the award of a degree by this or any other university. The copyright of this thesis rests with the author and no quotation from it or information derived from it may be published without the prior written consent of the author. Signature: Date: 28/November/2017 2 Abstract Latency in the live audio processing chain has become a concern for audio engineers and system designers because significant delays can be perceived and may affect synchroni- sation of signals, limit interactivity, degrade sound quality and cause acoustic feedback. In recent years, latency problems have become more severe since audio processing has become digitised, high-resolution ADCs and DACs are used, complex processing is performed, and data communication networks are used for audio signal transmission in conjunction with other traffic types.
    [Show full text]
  • Trends in Mobile Communications Outline of Presentation
    ITU TRAINING ON SPECTRUM MANAGEMENT FOR TERRESTRIAL SERVICES VICTORIA, REPUBLIC OF SEYCHELLES, 5 - 9OCTOBER, 2015 Trends in Mobile communications Outline of presentation • Evolution of mobile communications • Overview of evolving standards – 3G – UMTS HSDPA/HSUPA – HSPA+ – LTE and LTE-Advanced • Drivers of mobile broadband beyond 2020 • Technology trends in mobile broadband History of mobile communications ITU activities on standardization Frequency bands RR provisions (bandwidth) in MHz identifying the band for IMT 450-470 (20) 5.286AA 694/698-960 (366) 5.312A, 5.313A, 5.316B, 5.317A Frequency bands 1 710-2 025 (315) 5.384A, 5.388 identified for IMT by ITU 2 110-2 200 (90) 5.388 2300-2400 (100) 5.384A 2500-2690 (190) 5.384A 3400-3600 (200) 5.430A, 5.432A5.432B, 5.433A IMT-2000 (M.1457) IMT-Advanced (M.2012) a) IMT-2000 CDMA Direct Spread a) LTE-Advanced (3GPP) b) IMT-2000 CDMA Multi-Carrier b) WirelessMAN-Advanced (IEEE) c) IMT-2000 CDMA TDD IMT standards d) IMT-2000 TDMA Single-Carrier recommended by ITU e) IMT-2000 FDMA/TDMA f) IMT-2000 OFDMA TDD WMAN Wireless LAN in 2.4 GHz and 5 GHz bands (See Rec. ITU-R M.1450, ETSI EN300 328, IEEE 802.11) Giga bit WLAN (60 GHz bands) (Rec. ITU-R M.2003 (ISO/IEC 13156, ETSI EN302 567) MBWA (IEEE 802.20) 3G – beginning of mobile broadband (MBB) • The current mobile broadband begins with 3G • 3G requirements were defined in ITU standard IMT-2000 • General requirements: – Increased channel bandwidth – Packet switching – Efficient Radio Access • Rates: up to 2 MBS (indoor), 384 kbps (city), 144 kbps
    [Show full text]
  • Salsify: Low-Latency Network Video Through Tighter Integration Between a Video Codec and a Transport Protocol
    Salsify: Low-Latency Network Video Through Tighter Integration Between a Video Codec and a Transport Protocol Sadjad Fouladi Stanford University Joint work with: John Emmons, Emre Orbay, Catherine Wu, Riad S. Wahby, Keith Winstein ! snr.stanford.edu What is real-time video? Sender network Receiver 2 Real-time video latency target: tens of milliseconds the amount of time between when something happens you see it and 3 Low latency is required to maintain the interactivity of the application Interaction Sender network Receiver 4 Cloud Video Gaming Low-Latency Video Stream Remote Server Teleoperation of Robots Remote Surgery Video Conferencing WebRTC (Google Chrome) slow reaction to network variations ⇒ stalls and glitches Enter Salsify • Salsify is a new architecture for real-time Internet video. • Salsify tightly integrates a video-aware transport protocol, with a functional video codec, allowing it to respond quickly to changing network conditions. 10 Conventional design: two control loops at arm’s length video transport codec protocol 11 The narrow interface between codec and transport new target bit rate video transport codec protocol compressed frames 12 Decades of research and development on these components MPEG-1 Sprout NADA VC-1 MPEG-2 video GCC transportBBR LEDBAT H.265 H.264codec H.263 CDGprotocolRemyCC VP8 VP9 AV1 FBRA 13 “Let’s improve the transport” 1000 (kbps) Skype t 100 throughpu 0 10 20 30 40 50 60 5000 1000 500 (ms) y 100 dela 50 0 10 20 30 40 50 60 time (s) K. Winstein, A. Sivaraman, H. Balakrishnan, “Stochastic Forecasts Achieve High Throughput and Low Delay over Cellular Networks,” NSDI’13 14 “Let’s improve the transport” 1000 Sprout (kbps) t 100 throughpu improving the0 network10 20 component30 40 didn’t50 save60 the day… 5000 1000 500 (ms) y 100 dela 50 0 10 20 30 40 50 60 time (s) K.
    [Show full text]