Content-Aware Internet Video Delivery
Total Page:16
File Type:pdf, Size:1020Kb
Neural Adaptive Content-aware Internet Video Delivery Hyunho Yeo, Youngmok Jung, Jaehong Kim, Jinwoo Shin, Dongsu Han 2 Observation on Current Video Ecosystem Adaptive streaming has been widely deployed (a primary tool for improving user QoE) Video server Client Quality Bandwidth Quality High ABR Medium Low Time Time Time 3 Traditional Approaches Optimizing ABR algorithms Pensieve [SIGCOMM 17], MPC [SIGCOMM 15] Choosing better servers, CDNs Content Multihoming [SIGCOMM 12], VDN [SIGCOMM 15] Leveraging centralized control plan Video Control Plane [SIGCOMM 12], Pythease [NSDI 17] Goal: Find how to best utilize the network resource 4 Limitation of Current Video Delivery Video quality heavily depends on available bandwidth Video server Client Network Low Quality Congestion Directly affect 5 Limitation of Current Video Delivery Client computing power is scarcely utilized other than for decoding Mobile GPU Desktop GPU 500 Apple A10X 12 Apple A9X Apple A10 GTX 1080Ti 375 Apple A8X GTX 1080 9 GTX 980Ti Apple A7 GTX 780Ti 250 Apple A6X 6 GTX 780 (TFLOPs) (GFLOPs) 125 Apple A5 Apple A9 3 GTX 480 Apple A4 Apple A8 GTX 680 GTX 980 Apple A6 GTX 580 Computing Computing power Computing Computing power 0 0 2009 2012 2015 2018 2009 2012 2015 2018 Year Year 6 Observation on Current Video Ecosystem Standard codecs efficiently reduce redundancy inside GOP Group of Pictures (GOP) Video Standard codecs (H.26x, VPx, AV1) Compressed : Intra-frame coding : Inter-frame coding I-frame P-frames I-frame 7 Observation on Current Video Ecosystem Standard codecs efficiently reduce redundancy inside GOP Group of Pictures (GOP): 2—10 seconds Seamless switching Video GOP Video Standard codecsQuality (H.26x, VPx, AV1) Time Compressed : Intra-frame[Adaptive coding streaming] : Inter-frame coding I-frame P-frames I-frame 8 Limitation of Current Video Delivery Group of Pictures (GOP) : 2—10 seconds Video Time I-frame I-frame I-frame I-frame Redundancy (large timescales) Standard codecs lack any mechanisms for exploiting redundancy that occurs at large timescales 9 Key Observations on Deep Neural Network 1. Utilizes computing resource to enhance video quality Low quality DNN High quality Computing device 2. Trained and operate in large timescales (video) GOP GOP GOP DNN 10 Key Observations on Deep Neural Network 1. Utilizes computing resource to enhance video quality Low resolution Super-resolution DNN High resolution Computing device 2. Trained and operate in large timescales (video) GOP GOP GOP DNN 11 Key Observations on Deep Neural Network 1. Utilizes computing resource to enhance video quality LowLow resolution quality Super-resolutionDNN DNN HighHigh resolution quality Can we overcome theComputing current devicelimitations via DNN? 2. Trained and operate in large timescales (video or episodes) How muchGOP QoE improvementGOP can we achieve?GOP DNN 12 NAS: DNN-based Video Delivery Existing Approach NAS (Pensieve – SIGCOMM 17) 13 NAS: DNN-based Video Delivery Apply super-resolution DNN on top of bitrate adaptation : Client computation Super-resolution Bandwidth 240p 1080p 360p 1080p 480p 1080p 1080p 14 NAS: Design Scope 1. Content: Video on demand (VOD) Example 2. Computing device: Desktop-class GPUs Example GTX 1050 Ti (Entry-level) Titan Xp (High-end) Price $139 $1,200 15 NAS: Two Initial Challenges 1. DNN accuracy is unreliable for new content Guarantee performance New content Generic super-resolution1,2,3 ↓Quality SSIM = 0.86 SSIM = 0.84 2. Client must process DNN at real-time, but computing power varies across space and time, GPU x5 slower! GPU Require adaptation to computing power Client A: Entry-level GPU Client B: High-end GPU 1: SRCNN-ECCV14, 2:VDSR-CVPR 16, 3:EDSR-CVPRW 17 16 Key Design (1): Content-aware DNN Challenge: Providing reliable DNN quality 1. New video admission 2. Generates a content-aware DNN per-video Video 1 Super-resolution DNN 1 Video 2 Video server Super-resolution DNN 2 Content-aware DNN delivers the reliable training accuracy instead of the unpredictable testing accuracy. 17 Training a content-aware super-resolution 1. Prepares training data 2. Updates the DNN parameters Updates parameters High-resolution (1080p) frames Output Target Input DNN Low-resolutions (240p—720p) 18 Implication on Video Encoding Redundancy Redundancy GOP GOP GOP GOP Standard codecs Content-aware DNN 19 Key Design (2): Multiple Quality DNNs Challenge: Enabling real-time super-resolution on heterogeneous clients Downloads all? Several MBs Video server Client Quality: Low High Size: Small (93KB) Large (2,143KB) Compute: Low High 1. Provides multiple quality DNN options 20 Key Design (2): Multiple Quality DNNs Challenge: Enabling real-time super-resolution on heterogeneous clients Manifest file 2. Delivers DNN decription Video server (#Layer, #Channel) Client 53 fps 52 fps Quality: Low High 38 fps 21 fps Size: Small (93KB) Large (2,143KB) Computing device Selected (GTX 1080) Compute: Low High Mock DNNs 1. Provides multiple quality DNN options 3. Test-runs and selects the highest-quality running at real-time 21 NAS: Two Additional Challenges NAS streams video with a content-aware DNN, but … 1. Takes long time to download and utilize a DNN Incremental benefit Example Ultra-high (2,145KB) 360p video (400Kbps) 1 x 21 seconds x 2. A DNN competes bandwidth with video Integrate with ABR Example Video server Client 22 Key Design (3): Scalable DNN Challenge: Takes a long time to utilize a DNN Video server Client st nd th : Required : Optional 1 chunk 2 chunk 5 chunk No DNN Input Output Layer Layer Layer Layer Layer Layer additional by-passing paths Time 1. Implement a scalable DNN 2. Download/Apply a partial DNN (+ divide into similar-size chunks) 23 Key Design (4): Integrated ABR Challenge: How to decide when to download a DNN • Extends a reinforcement-learning based ABR (Pensieve [SIGCOMM17] ) QoE metric = bitrate - rebuffering – smoothness Reward 푟푡 Pensieve agent State 푠푡 Action 푎푡 Bandwidth 240p Environment Bitrate Playback buffer 1080p Goal: Maximize the total QoE over an entire video 24 Key Design (4): Integrated ABR Challenge: How to decide when to download a DNN • Extends a reinforcement-learning based ABR (Pensieve [SIGCOMM17] ) QoE metric = DNN(bitrate) - rebuffering – smoothness Reward 푟푡 State 푠 NAS agent 푡 Action 푎 Bandwidth 푡 240p Environment Bitrate Playback buffer 1080p # Remaining DNN DNN Goal: Maximize the total QoE reflecting DNN-based quality enhancement Putting All Together: Implementation Video DNN Server NAS Player (dash.js) ∆1.7K LOC (8.8%) DNN Processor (PyTorch) Integrated ABR 6.3K LOC 5.5K LOC 26 Evaluation 1) How much benefit does NAS deliver? 2) What are the cost and benefit of NAS ? 3) Does NAS effectively adapt to heterogeneous clients? 27 NAS vs. Existing Video Delivery : QoE • 17.8 hours real-world network traces: collected from 3G network and broadband (average bandwidth: 1.31Mbps) • 27 YouTube videos: 5-24 minutes, encoded at {400, 800, 1200, 2400, 4800}kbps • Computing device: NVIDIA Titan Xp, DNN quality: Ultra-high • Video player: Chromium browser, Video server: Apache server 28 Existing Approach NAS (Pensieve – SIGCOMMDemo: 17) NAS Highlight 29 NAS vs. Existing Video Delivery : QoE • 17.8 hours real-world network traces: collected from 3G network and broadband (average bandwidth: 1.31Mbps) • 27 YouTube videos: 5-24 minutes, encoded at {400, 800, 1200, 2400, 4800}kbps • Computing device: NVIDIA Titan Xp, DNN quality: Ultra-high • Video player: Chromium browser, Video server: Apache server 1 better BOLA 0.5 RobustMPC CDF CDF Pensieve NAS 0 -0.5 0.5 1.5 2.5 Average QoE NAS improves user QoE by 43.08% compared to Pensieve and 92.28% compared to BOLA using same amount of bandwidth. 30 NAS vs. Existing Video Delivery : Cost Pensieve CDN NAS Pensieve 2.5 = 0.085$/GB 2 NAS CDN 1.5 Training cost = 0.085$/GB 1 better : ↓17.13% bandwidth cost ($) Total 0.5 for same quality 0 10 mins 1.4$/hour 0 10 20 30 40 50 60 = 0.23$/minute of video Total viewing time (hour) When the total viewing reaches 30 hours (per minute of video), NAS CDN recoups the initial training cost. 31 Heterogeneous Clients Each GPU processes at real-time BOLA RobustMPC Pensieve (> 30fps for all resolutions) Low Medium High Ultra-high DNN quality GPU model (Price) better Low GTX 1050 Ti ($139) Medium GTX 1060 ($249) High GTX 1070 Ti ($449) CDF GTX 1080 ($559) Ultra-high GTX 1080 Ti ($669) Titan Xp ($1,200) Average QoE NAS adapts to heterogeneous devices, and a device with higher computing power receives greater benefit. 32 Conclusion Large timescale redundancy Short [Standard codecs] [Content-aware DNN] • NAS presents a new type of QoE maximization & encoding via DNN • NAS accommodates four key designs: Content-aware DNN, Multiple quality DNNs, Scalable DNN, Integrated ABR. • NAS can improve user QoE or reduce the video delivery cost for CDN..