Large-Scale Trustworthy Distributed Collaboration Claudia-Lavinia Ignat
Total Page:16
File Type:pdf, Size:1020Kb
Large-scale trustworthy distributed collaboration Claudia-Lavinia Ignat To cite this version: Claudia-Lavinia Ignat. Large-scale trustworthy distributed collaboration. Distributed, Parallel, and Cluster Computing [cs.DC]. Université de Lorraine, 2021. tel-03229173 HAL Id: tel-03229173 https://hal.inria.fr/tel-03229173 Submitted on 18 May 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Ecole´ doctorale IAEM Lorraine Large-scale trustworthy distributed collaboration THESE` pr´esent´eeet soutenue publiquement le 23 avril 2021 pour l'obtention d'une Habilitation de l'Universit´ede Lorraine (mention informatique) par Claudia-Lavinia Ignat Composition du jury Rapporteurs : M. Prasun DEWAN Professeur, University of North Carolina at Chapel Hill Mme Val´erieISSARNY Directrice de Recherche, Inria M. Fran¸coisTAIANI Professeur des Universit´es,Universit´ede Rennes I Examinateurs : Mme Sihem AMER-YAHIA Directrice de Recherche CNRS M. Fran¸coisCHAROY Professeur des Universit´es,Universit´ede Lorraine Mme Isabelle CHRISMENT Professeur des Universit´es,Universit´ede Lorraine M. Fabien GANDON Directeur de Recherche Inria M. Pascal MOLLI Professeur des Universit´es,Universit´ede Nantes Laboratoire Lorrain de Recherche en Informatique et ses Applications | UMR 7503 Mis en page avec la classe thesul. Acknowledgments To begin with, I would like to thank jury members for having accepted my invitation in this difficult period of pandemics. Many thanks to Prasun Dewan, Valérie Issarny and François Taiani for their encouraging reports on my work and their thorough comments and questions during my habilitation defense. Also I would like to thank Sihem Amer-Yahia, François Charoy, Isabelle Chrisment, Fabien Gandon, Pascal Molli for their very interesting and constructive questions during the defense. Special thanks to Isabelle Chrisment for having accepted to be the jury president and for all the administrative work. I would like to express my gratitude to François Charoy as a scientific guarantor for my habilitation but also for being a great team leader and for his support, confidence and freedom he gave me with respect to my research work. The work I report in this habilitation manuscript is the result of collaboration with several people. I would like to thank all my co-authors, specially Valerie Shalin (Wright State University) and Weihai Yu (UiT The Arctic University of Norway) for the fruitful ideas and discussions we had together. I also thank my PhD students Hien Thi Thu Truong, Quang Vinh Dang, Hoang Long Nguyen and Hoai Le Nguyen and all the other students that I supervised for having worked with me on the different parts of my research project. I would like to thank all permanent members of team Coast (Khalid Benali, François Charoy, Claude Godart, Gérald Oster, Olivier Perrin and the others) and all non permanent members (Luc André, Victorien Elvinger, Quentin Laporte-Chabasse, Stéphane Martin, Matthieu Nico- las and all the others that belonged to Ecoo/Score/Coast) for all scientific and non scientific discussions that made a very pleasant working environment. Many thanks to Claude Godart and Pascal Molli that welcomed me in their team at my arrival in France. I would like to thank Moira Norrie, my PhD supervisor, that played an important role in my professional life, for having inspired me with the beauties of a research career. Many special thanks to my husband and colleague Gérald Oster for his valuable support and encouragements during all these years and for the scientific discussions that sometimes expand even in the free time. I thank him as well as to our girls Julie, Natalie and Alexandra for all the love and happiness they bring to my life. Last but not least I would like to thank to my parents and my brother who gave me all their support and love even from far away. I specially dedicate this thesis to my father for his constant interest and concern on the evolution of my career and for his encouragements over the last years to write this manuscript. i ii to my father iii iv Contents 1 Motivation ..................................... 1 2 Research Work ................................... 3 2.1 Collaborative data management ..................... 4 2.2 Trustworthy collaboration ........................ 6 3 My journey ..................................... 6 4 Implication in research projects .......................... 9 5 Organisation of the manuscript .......................... 12 I Collaborative Data Management 13 Chapter 1 Literature Review 17 1.1 Optimistic replication ............................... 17 1.1.1 Last writer wins .............................. 17 1.1.2 Serialisation ................................ 17 1.1.3 Multi-versioning .............................. 18 1.1.4 Operational transformation ........................ 18 1.1.5 CRDT algorithms ............................. 24 1.1.6 Consistency maintenance for complex data ............... 25 1.2 Awareness and conflicts management ...................... 27 1.2.1 Conflicts prevention ............................ 27 1.2.2 Conflicts management ........................... 28 Chapter 2 Operational transformation for complex data 31 2.1 Document model .................................. 32 2.2 Operations ..................................... 34 2.3 Merging algorithm and transformation functions ................ 36 2.4 Complexity discussion ............................... 37 v Contents Chapter 3 Evaluation of optimistic replication approaches 39 3.1 Theoretical complexity of main optimistic replication algorithms ....... 40 3.2 Evaluation of optimistic replication algorithms with real traces of collaboration 41 3.3 Measurement of collaborative editors performance: a user perspective .... 43 3.4 Effect of delay on group performance: an experimental design ........ 46 Chapter 4 CRDTs for ordered sequences 51 4.1 Identifiers ...................................... 53 4.2 Insertion operation ................................. 53 4.3 Deletion operation ................................. 54 4.4 Correctness ..................................... 55 4.5 Implementation .................................. 56 4.5.1 Naive representation (LogootSplitNaive) .............. 56 4.5.2 String-based representation (LogootSplitString) .......... 57 4.5.3 Tree-based representation (LogootSplitAVL) ............ 57 4.5.4 Evaluation ................................. 59 Chapter 5 Conflicts prevention and resolution 61 5.1 Conflicts prevention ................................ 62 5.1.1 Localisation of concurrent changes .................... 62 5.1.2 Trade-off between awareness and privacy ................ 66 5.2 Conflicts management and their resolution (a case study) ........... 68 5.2.1 Integration rate and conflict rate ..................... 68 5.2.2 Conflict types ............................... 69 5.2.3 Conflict resolution ............................. 69 5.2.4 Adjacent-line conflicts ........................... 69 II Trustworthy Collaboration 73 Chapter 1 Literature Review 77 1.1 Contract-based models .............................. 77 1.2 Access control ................................... 78 vi 1.3 Usage Control ................................... 79 1.4 Log auditing .................................... 80 1.5 Trust measurement and trust game ....................... 80 1.6 Log Authentication ................................ 81 Chapter 2 Trust-based collaboration 85 2.1 The C-PPC model ................................. 86 2.1.1 Events and logs .............................. 86 2.1.2 Contracts .................................. 86 2.1.3 Log management and consistency maintenance during collaboration . 91 2.1.4 Log Auditing and Trust Assessment ................... 95 2.2 Experimental evaluation of the C-PPC model and auditing mechanism ... 100 2.2.1 Experiment 1 - Misbehavior detection .................. 100 2.2.2 Experiment 2 - Overhead estimation ................... 102 Chapter 3 Authentication of Logs in Multi-synchronous Collaboration 103 3.1 Description of the Collaboration model ..................... 103 3.2 Security properties to be ensured ......................... 104 3.3 Authenticators structure and construction .................... 106 3.4 Example of authenticators construction ..................... 108 3.5 Authenticators-based log verification ....................... 109 3.6 Space and time complexity for authenticator construction and log verification 110 3.7 Experimental evaluation .............................. 110 Chapter 4 Measuring trust score among users in trust game 113 4.1 Requirements .................................... 113 4.2 Current trust .................................... 113 4.3 Aggregated trust .................................. 114 4.4 Expected trust ................................... 115 4.5 Dealing with fluctuating behaviour ........................ 116 4.6 Evaluation of the trust metric .......................... 117 4.6.1 Simulations ................................. 117 4.6.2 Evaluation with real data ......................... 119 vii Contents Chapter 5 Influence of trust score on user behavior