Network Traffic Measurements, Applications to Internet Services
Total Page:16
File Type:pdf, Size:1020Kb
Politecnico di Torino Porto Institutional Repository [Doctoral thesis] Network Traffic Measurements, Applications to Internet Services and Security Original Citation: Bocchi, Enrico (2017). Network Traffic Measurements, Applications to Internet Services and Security. PhD thesis Availability: This version is available at : http://porto.polito.it/2665103/ since: February 2017 Published version: DOI:10.6092/polito/porto/2665103 Terms of use: This article is made available under terms and conditions applicable to Open Access Policy Article ("Public - All rights reserved") , as described at http://porto.polito.it/terms_and_conditions. html Porto, the institutional repository of the Politecnico di Torino, is provided by the University Library and the IT-Services. The aim is to enable open access to all the world. Please share with us how this access benefits you. Your story matters. (Article begins on next page) POLITECNICO DI TORINO TÉLÉCOM PARISTECH Ph.D. Course in • Electronics and Communications Engineering ScuDo Doctoral School • Computer Science École Doctorale Informatique, Télécommunication et Électronique (EDITE) de Paris Network Traffic Measurements Applications to Internet Services and Security Enrico Bocchi Thesis Supervisors Prof. Marco Mellia – Politecnico di Torino Prof. Dario Rossi – Télécom ParisTech Examination Committee Prof. Guillaume Urvoy-Keller – Université Nice Sophia Antipolis Prof. Joerg Widmer – IMDEA Networks Institute Prof. Lin Chen – Université Paris-Sud Prof. Antonio Pescapè – Università degli Studi di Napoli Federico II Prof. Sébastien Tixeuil – Université Pierre et Marie Curie, Paris 6 Doctoral Dissertation Doctoral Program in Electronics and Communications Engineering (29thcycle) Network Traffic Measurements Applications to Internet Services and Security By Enrico Bocchi ****** Supervisor(s): Prof. Marco Mellia, Supervisor – Politecnico di Torino Prof. Dario Rossi, Supervisor – Télécom ParisTech Doctoral Examination Committee: Prof. Guillaume Urvoy-Keller, Referee, Université Nice Sophia Antipolis Prof. Joerg Widmer, Referee, IMDEA Networks Institute Prof. Lin Chen, Examiner, Université Paris-Sud Prof. Antonio Pescapè, Examiner, Università degli Studi di Napoli Federico II Prof. Sébastien Tixeuil, Examiner, Université Pierre et Marie Curie – Paris 6 Politecnico di Torino 2017 2016-ENST-00xx EDITE - ED 130 Doctorat ParisTech THÈSE pour obtenir le grade de docteur délivré par TELECOM ParisTech Spécialité « Informatique et Réseaux » présentée et soutenue publiquement par Enrico BOCCHI le 3 Février 2017 Mesures du trafic réseau Applications aux Services Internet et à la Sécurité Directeur de thèse : Marco MELLIA – Politecnico di Torino, Italie Directeur de thèse : Dario ROSSI – Télécom ParisTech, France T Jury H M. Guillaume URVOY-KELLER, Professeur, Université Nice Sophia Antipolis Rapporteur È M. Joerg WIDMER, Professeur, IMDEA Networks Institute Rapporteur M. Lin CHEN, Maître de conférences, Université Paris-Sud Examinateur S M. Antonio PESCAPÈ, Professeur, Università degli Studi di Napoli Federico II Examinateur M. Sébastien TIXEUIL, Professeur, Université Pierre et Marie Curie – Paris 6 Examinateur E TELECOM ParisTech école de l’Institut Mines-Télécom - membre de ParisTech 46 rue Barrault 75013 Paris - (+33) 1 45 81 77 77 - www.telecom-paristech.fr Declaration I hereby declare that, the contents and organization of this dissertation constitute my own original work and does not compromise in any way the rights of third parties, including those relating to the security of personal data. Enrico Bocchi 2017 * This dissertation is presented in partial fulfillment of the requirements for Ph.D. degree in the Graduate School of Politecnico di Torino (ScuDo). Acknowledgments In the last three years, the “Acknowledgments” section of papers has usually been filled with a bunch of codes, acronyms, and project names. Indeed, if there is one thing on which my two supervisors agreed, that undoubtedly is, citing, “Never forget about the ones that are paying your ticket”. Besides this master tuition, the list of other memorable moments would take as many pages as the thesis itself. I sincerely thank Marco and Dario: I was smart enough to spark the rivalry between you two and put myself between a rock and a hard place. Apparently, we sorted it out and it has been a true pleasure to work with you. In my case, being a Ph.D. student also meant spend some time abroad working with other fellows. I was lucky enough in California to meet the great Cyber Innovation Team: Ignacio, Ale, Saby, Misko, Gaspar, Han, Ruben, and, above all, SJ. I was even luckier shortly after when at LINCS with Michele, Danilo, Andrea, Christian, Leonardo, Yixi, Luca, and the nightly Parisian crew. It was the best of times... I still and will always miss Paris, les promenades, les restaurants, les cafés, les fêtes, and the people I met there. Living abroad has the sad side-effect of having to say goodbye to friends, parents, and relatives in your hometown. Just a little later, you realize how they manage to be closer than ever, especially when you are far away. A special thanks to my dad Gianfranco and my aunt Eliana for supporting me so much in what I do. I would have never been able to reach this far without your help. Thank you, Dado, for your non-sense phone calls at crazy times trying to convince me of your latest ideas, unexpectedly evolving towards the legal side in recent times. And thank you, Sara, for having kept on believing in the two of us despite the distance and the entire weeks without seeing each other. We have been stronger than all of this, and I hope it is now the time to share the daily pieces of joy. At last, thanks to the guys with whom I shared when in Torino, Hassan, Danilo, Luca, Ali, Martino, Andrea, and on the upper-floor Stefano and Idilio. We started with ourbest intentions, but the office looks much better as a choir room. Along with this line, thanks to any bit of rock music from the late sixties to the whole seventies. You have a redeeming effect on me. Those were the days... There was no Internet ;-) Cos’è l’Internet? Ma... Niente... Internet è quando vai sul sito col www e il computer. Nell’Internet c’è il Facebook, il Myspace, il FarmVille... Dipende un po’ da quanto schiacci e come schiacci [il mouse, n.d.r.]: Se schiacci normalmente viene l’emmepitré. Se schiacci un pochino più intensamente viene il filmato, il videoclips. Se invece schiacci fortissimo viene la donna nuda. Rocco Tanica – Elio e le Storie Tese Teatro Massimo di Pescara, 28 marzo 2012 XII Table of contents List of figures XXI List of tables XXV Abstract 1 Résumé 3 1 Introduction5 1.1 Topics and Research Questions........................7 1.1.1 Benchmarking Personal Cloud Storage Services...........8 1.1.2 Network Security Threats.......................8 1.1.3 Web Quality of Experience.......................9 1.2 Approach.................................... 10 1.2.1 Passive Traffic Measurements..................... 11 1.2.2 Active Traffic Measurements...................... 15 1.2.3 Analytics – Deep Packet Inspection.................. 16 1.2.4 Analytics – Machine Learning..................... 17 1.2.5 Application Domains.......................... 21 1.3 Thesis Organization............................... 21 I Personal Cloud Storage 25 2 Cloud Storage Services 27 2.1 Introduction................................... 27 XIV Table of contents 2.2 Related Work.................................. 28 2.3 Contributions and Work Organization..................... 29 3 Personal Cloud Storage – Usage Patterns Characterization 31 3.1 Methodology and Dataset........................... 31 3.1.1 Dataset Description........................... 33 3.2 Characterization of Cloud Storage Usage................... 33 3.2.1 Connection Frequency......................... 33 3.2.2 Data Volumes.............................. 35 3.2.3 Usage Scenarios............................. 37 3.3 Performance................................... 37 3.3.1 Performance of Storage Clients.................... 39 3.3.2 Performance per Terminal Type.................... 41 3.4 Usage per Client Terminal........................... 41 3.4.1 Connection Frequency and Volume.................. 42 3.4.2 Workloads................................ 43 3.5 Conclusions................................... 44 4 Personal Cloud Storage – Benchmarks and Comparison 45 4.1 Measurement Methodology........................... 46 4.1.1 Goals................................... 46 4.1.2 Testbed and Tools........................... 46 4.1.3 Client Capabilities and Storage Protocols............... 47 4.1.4 Data Center Locations......................... 48 4.1.5 Benchmarking Performance...................... 49 4.1.6 Storage Services Under Test...................... 49 4.2 Client Capabilities............................... 50 4.2.1 Bundling................................. 51 4.2.2 Chunking................................ 52 4.2.3 Compression............................... 54 Table of contents XV 4.2.4 Client-Side Deduplication....................... 55 4.2.5 Delta Encoding............................. 56 4.2.6 P2P Synchronization.......................... 57 4.2.7 Client Capabilities in Brief....................... 58 4.3 System Design.................................. 58 4.3.1 Protocols and Overhead........................ 59 4.3.2 Data Center Topology......................... 60 4.3.3 System Design in Brief......................... 61 4.4 Benchmarks................................... 61 4.4.1 Workloads...............................