Measurement-Driven Algorithm and System Design for Wireless and Datacenter Networks

Measurement-Driven Algorithm and System Design for Wireless and Datacenter Networks

Measurement-Driven Algorithm and System Design for Wireless and Datacenter Networks Varun Gupta Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2017 c 2017 Varun Gupta All Rights Reserved ABSTRACT Measurement-Driven Algorithm and System Design for Wireless and Datacenter Networks Varun Gupta The growing number of mobile devices and data-intensive applications pose unique chal- lenges for wireless access networks as well as datacenter networks that enable modern cloud- based services. With the enormous increase in volume and complexity of traffic from ap- plications such as video streaming and cloud computing, the interconnection networks have become a major performance bottleneck. In this thesis, we study algorithms and architec- tures spanning several layers of the networking protocol stack that enable and accelerate novel applications and that are easily deployable and scalable. The design of these algo- rithms and architectures is motivated by measurements and observations in real world or experimental testbeds. In the first part of this thesis, we address the challenge of wireless content delivery in crowded areas. We present the AMuSe system, whose objective is to enable scalable and adaptive WiFi multicast. AMuSe is based on accurate receiver feedback and incurs a small control overhead. This feedback information can be used by the multicast sender to optimize multicast service quality, e.g., by dynamically adjusting transmission bitrate. Specifically, we develop an algorithm for dynamic selection of a subset of the multicast receivers as feedback nodes which periodically send information about the channel quality to the multicast sender. Further, we describe the Multicast Dynamic Rate Adaptation (MuDRA) algorithm that utilizes AMuSe's feedback to optimally tune the physical layer multicast rate. MuDRA balances fast adaptation to channel conditions and stability, which is essential for multimedia applications. We implemented the AMuSe system on the ORBIT testbed and evaluated its perfor- mance in large groups with approximately 200 WiFi nodes. Our extensive experiments demonstrate that AMuSe can provide accurate feedback in a dense multicast environment. It outperforms several alternatives even in the case of external interference and changing net- work conditions. Further, our experimental evaluation of MuDRA on the ORBIT testbed shows that MuDRA outperforms other schemes and supports high throughput multicast flows to hundreds of nodes while meeting quality requirements. As an example application, MuDRA can support multiple high quality video streams, where 90% of the nodes report excellent or very good video quality. Next, we specifically focus on ensuring high Quality of Experience (QoE) for video streaming over WiFi multicast. We formulate the problem of joint adaptation of multicast transmission rate and video rate for ensuring high video QoE as a utility maximization problem and propose an online control algorithm called DYVR which is based on Lyapunov optimization techniques. We evaluated the performance of DYVR through analysis, sim- ulations, and experiments using a testbed composed of Android devices and off the shelf APs. Our evaluation shows that DYVR can ensure high video rates while guaranteeing a low but acceptable number of segment losses, buffer underflows, and video rate switches. We leverage the lessons learnt from AMuSe for WiFi to address the performance is- sues with LTE evolved Multimedia Broadcast/Multicast Service (eMBMS). We present the Dynamic Monitoring (DyMo) system which provides low-overhead and real-time feedback about eMBMS performance. DyMo employs eMBMS for broadcasting instructions which indicate the reporting rates as a function of the observed Quality of Service (QoS) for each UE. This simple feedback mechanism collects very limited QoS reports which can be used for network optimization. We evaluated the performance of DyMo analytically and via simulations. DyMo infers the optimal eMBMS settings with extremely low overhead, while meeting strict QoS requirements under different UE mobility patterns and presence of network component failures. In the second part of the thesis, we study datacenter networks which are key enablers of the end-user applications such as video streaming and storage. Datacenter applications such as distributed file systems, one-to-many virtual machine migrations, and large-scale data processing involve bulk multicast flows. We propose a hardware and software system for enabling physical layer optical multicast in datacenter networks using passive optical splitters. We built a prototype and developed a simulation environment to evaluate the performance of the system for bulk multicasting. Our evaluation shows that the optical multicast architecture can achieve higher throughput and lower latency than IP multicast and peer-to-peer multicast schemes with lower switching energy consumption. Finally, we study the problem of congestion control in datacenter networks. Quantized Congestion Control (QCN), a switch-supported standard, utilizes direct multi-bit feedback from the network for hardware rate limiting. Although QCN has been shown to be fast- reacting and effective, being a Layer-2 technology limits its adoption in IP-routed Layer 3 datacenters. We address several design challenges to overcome QCN feedback's Layer- 2 limitation and use it to design window-based congestion control (QCN-CC) and load balancing (QCN-LB) schemes. Our extensive simulations, based on real world workloads, demonstrate the advantages of explicit, multi-bit congestion feedback, especially in a typical environment where intra-datacenter traffic with short Round Trip Times (RTT: tens of µs) run in conjunction with web-facing traffic with long RTTs (tens of milliseconds). Table of Contents List of Figures iv List of Tables xiii 1 Introduction 1 1.1 Background . .2 1.2 Contributions . .6 1.3 Contributions to Literature . 12 I Adaptive Multicast Services 14 2 LIGHT-WEIGHT FEEDBACK FOR WIRELESS MULTICAST 15 2.1 Introduction . 15 2.2 Related work . 21 2.3 Network Setting . 24 2.4 Objective . 25 2.5 The AMuSe Feedback Mechanism . 26 2.6 Experimental Evaluation of Testbed Environment . 32 2.7 Feedback Node Selection . 40 2.A Proof of Proposition 1 . 48 3 MULTICAST DYNAMIC RATE ADAPTATION 49 3.1 Introduction . 49 3.2 Related Work . 53 i 3.3 Testbed and Key Observations . 54 3.4 Network Model and Objective . 55 3.5 Multicast Rate Adaptation . 57 3.6 Reporting Interval Duration . 66 3.7 Experimental Evaluation . 68 3.8 Demonstration Application . 76 4 OPTIMIZING VIDEO QoE FOR MULTICAST STREAMING 79 4.1 Introduction . 79 4.2 Related Work . 83 4.3 Model and Problem Formulation . 84 4.4 Online Transmission and Video Rate Adaptation . 89 4.5 Numerical Evaluations . 93 4.6 Implementation and Experimental Evaluation . 97 5 DYNAMIC MONITORING OF LARGE SCALE LTE-eMBMS 107 5.1 Introduction . 107 5.2 Related Work . 113 5.3 Model and Objective . 115 5.4 The DyMo System . 116 5.5 Algorithms for SNR Threshold Estimation . 119 5.6 Performance Evaluation . 125 5.7 Conclusion . 138 II Datacenter Networks 142 6 OPTICAL MULTICAST FOR DATACENTER NETWORKS 143 6.1 Introduction . 144 6.2 Architecture and implementation . 148 6.3 Control plane evaluation . 155 6.4 System evaluation . 159 ii 6.5 Paxos with optical multicast . 168 6.6 Optical incast . 169 7 QCN BASED DATACENTER CONGESTION CONTROL 172 7.1 Introduction . 172 7.2 Related Work . 175 7.3 Background . 176 7.4 Design . 179 7.5 Evaluation . 187 III Conclusions 199 IV Bibliography 204 iii List of Figures 1.1 An overview of the wireless and datacenter networks . .2 1.2 (a) A block diagram of the contributions to adaptive wireless multicast for both WiFi and cellular networks: a light-weight feedback mechanism, mul- ticast dynamic rate adaptation, loss recovery and Forward Error Correction (FEC), and video rate adaptation, (b) A heatmap of the average Packet Delivery Ratio (PDR) values for 200 nodes receiving multicast data from a single Access Point in the ORBIT testbed. .7 2.1 The AMuSe feedback mechanism (highlighted in red) as a part of the overall AMuSe system. 16 2.2 Feedback node selection by AMuSe. A node with the poorest channel quality in every neighborhood is selected as a Feedback node. Each feedback node periodically sends updates about the service quality to the Access Point. 17 2.3 Unreliable packet delivery by the LBP and the Pseudo-Broadcast approach. 23 2.4 State diagram of the AMuSe FB node selection algorithm at each node. All nodes initialize in the VOLUNTEER state. 27 2.5 An example of a wireless network a single AP and 4 receivers. All 3 re- quirements described in Section 2.5 for an accurate feedback selection are important for this example. 30 iv 2.6 Link Quality (LQ) and Packet Delivery Ratio (PDR) heatmaps at the AP for D = 6 meters with transmission bitrate of 12 Mbps and noise level of -70 dBm and -35 dBm. The FB nodes are highlighted with a thick border in red in the LQ heatmap and in blue in the PDR heatmap. Empty locations represent nodes that did not produce LQ or PDR reports and they are excluded from our experiments. Nodes with P DR = 0 are active nodes that reported LQ values but were unable to decode packets. These nodes are excluded from the FB node selection process. Note that the minimum threshold below which a node does not become an FB node is configurable............... 32 2.7 Experimental results for testing hypothesis H1 and verifying the presence of abnormal nodes. 36 2.8 Experimental results for testing hypotheses H2{H3: (a) LQ STD: varying TXAP without noise, cluster size = 3m, (b) PDR STD: varying TXAP with- out noise, cluster size = 3m, (c) LQ STD: varying TXAP without noise, cluster size = 6m, (d) PDR STD: varying TXAP without noise, cluster size = 6m, (e) LQ STD: varying noise, TXAP = 12 Mbps, cluster size = 3m, and (f) PDR STD: varying noise, TXAP = 12 Mbps, cluster size = 3m.....

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    248 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us