Project-Team ASAP

IN PARTNERSHIP WITH: Institut national des sciences appliquées de Rennes Université Rennes 1 Activity Report 2014 Project-Team ASAP As Scalable As Possible: foundations of large scale dynamic distributed systems IN COLLABORATION WITH: Institut de recherche en informatique et systèmes aléatoires (IRISA) RESEARCH CENTER Rennes - Bretagne-Atlantique THEME Distributed Systems and middleware Table of contents 1. Members :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 1 2. Overall Objectives :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 2 2.1. General objectives2 2.1.1. Scalability2 2.1.2. Personalization2 2.1.3. Uncertainty2 2.1.4. Malicious behaviors and privacy2 2.2. Structure of the team3 2.2.1. Objective 1: Decentralized personalization3 2.2.2. Objective 2: Personalization: Cloud computing meets p2p3 2.2.3. Objective 3: Privacy-aware decentralized computations4 2.2.4. Objective 4: Information dissemination over social networks4 2.2.5. Objective 5: Computability and efficiency of distributed systems5 3. Research Program :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 6 3.1. Distributed computing6 3.2. Theory of distributed systems6 3.3. Peer-to-peer overlay networks6 3.4. Epidemic protocols7 3.5. Malicious process behaviors7 3.6. Online social networks and recommender systems7 4. Application Domains ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::8 5. New Software and Platforms :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 8 5.1. MediEgo: A recommendation solution for webmasters8 5.2. AllYours-P2P: A distributed news recommender (former WhatsUp)8 5.3. HyRec: A hybrid recommender system9 5.4. GossipLib: A library for gossip-based applications9 5.5. YALPS: A library for p2p applications 10 5.6. HEAP: Heterogeneity-aware gossip protocol 10 5.7. Brow2Brow: Browser-to-browser serverless toolboxes 10 5.8. WebGC: Web-based Gossip Communication 11 6. New Results ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 11 6.1. Highlights of the Year 11 6.2. Models and abstractions for distributed systems 11 6.2.1. Signature-free asynchronous Byzantine consensus 11 6.2.2. Randomized mutual exclusion with constant amortized RMR complexity on the DSM 12 6.2.3. Reliable shared memory abstraction on top of asynchronous Byzantine message-passing systems 12 6.2.4. Distributed Universality 12 6.2.5. Computing in the presence of concurrent solo executions 13 6.2.6. A simple broadcast algorithm for recurrent dynamic systems 13 6.2.7. Fisheye consistency: Keeping data in synch in a georeplicated world 13 6.3. Large-scale and user-centric distributed systems 14 6.3.1. Archiving cold data in warehouses with clustered network coding 14 6.3.2. WebGC: Browser-based gossiping 14 6.3.3. Large-scale graph processing in datacenters with bandwidth guarantees 14 6.3.4. Scaling KNN computation over large graphs on a PC 15 6.3.5. Privacy-preserving distributed collaborative filtering 15 6.3.6. Behave: Behavioral cache for web content 15 6.3.7. HyRec: Leveraging browsers for scalable recommenders 15 2 Activity Report INRIA 2014 6.3.8. Landmark-based similarity for p2p collaborative filtering 16 6.3.9. Adaptation for the masses: Towards decentralized adaptation in large-scale p2p recommenders 16 6.3.10. Tight bounds for rumor spreading with vertex expansion 16 6.3.11. Greedy routing in small-world networks with power-law degrees 17 6.3.12. Randomized rumor spreading in dynamic graphs 17 6.3.13. Privacy-preserving dissemination in social networks and microblogs 17 6.3.14. Adaptive streaming 18 6.3.15. Predictive capabilities of social and interest affinity for recommendations 18 6.3.16. Polystyrene: The decentralized data shape that never dies 18 6.3.17. Link-prediction for very large scale graphs using distributed graph engines 19 6.3.18. GOSSIPKIT: A unified component framework for gossip 19 6.3.19. Towards a new model for cyber foraging 19 7. Bilateral Contracts and Grants with Industry ::::::::::::::::::::::::::::::::::::::::::::: 19 7.1. Technicolor 19 7.2. Orange Labs 20 7.3. Web Alter-Egos Google Focused Award 20 8. Partnerships and Cooperations ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 20 8.1. National initiatives 20 8.1.1. LABEX CominLabs 20 8.1.2. ANR project SocioPlug 20 8.1.3. DeSceNt CominLabs 21 8.1.4. ANR Blanc project Displexity 21 8.2. European initiatives 21 8.2.1. FP7 & H2020 projects 21 8.2.2. Collaborations with major European organizations 21 8.3. International Initiatives 21 8.3.1. Inria associate teams 21 8.3.2. Inria international partners 22 8.4. International Research Visitors 22 8.4.1. Visits of international scientists 22 8.4.2. Internships 22 8.4.3. Visits to international teams 22 9. Dissemination ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: 23 9.1. Promoting scientific activities 23 9.2. Invited talks 25 9.3. Teaching, supervision, juries 25 9.3.1. Teaching 25 9.3.2. Supervision 26 9.3.3. Juries 26 10. Bibliography :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::27 Project-Team ASAP Keywords: Distributed System, Social Networks, Web Personalization, Theory Of Distributed Computing, Privacy, Big Data, Peer-to-peer Creation of the Project-Team: 2007 July 01. 1. Members Research Scientists Anne-Marie Kermarrec [Team leader, Inria, Senior Researcher, HdR] Davide Frey [Inria, Researcher] George Giakkoupis [Inria, Researcher] Faculty Members Marin Bertier [INSA Rennes, Associate Professor] Michel Raynal [Univ. Rennes I, Professor, HdR] François Taïani [Univ. Rennes I, Professor, HdR] Engineers Yuri Barssi [Univ. Rennes I, until Jun 2014, granted by FP7 EIT ICT LABS GA project] Heverson Borba Ribeiro [Inria, until Jun 2014, granted by FP7 EIT ICT LABS GA project] Antoine Boutet [Inria, until Jan 2014, granted by FP7 EIT ICT LABS GA project] Raziel Carvajal Gomez [Inria, granted by Brow2Brow ADT] Jacques Falcou [Inria, until May 2014, granted by FP7 EIT ICT LABS GA project] Arnaud Jégou [Inria, granted by FP7 EIT ICT LABS GA project] Xavier Lucas [Inria, from Oct 2014] Jean-François Verdonck [Inria, granted by FP7 EIT ICT LABS GA project] PhD Students Fabien André [Technicolor, granted by CIFRE] Resmi Ariyattu Chandrasekharannair [Univ. Rennes I] Simon Bouget [Normale Sup Rennes, from Sep 2014] Stéphane Delbruel [Univ. Rennes I] Ali Gouta [Orange Labs, granted by CIFRE] Nupur Mittal [Inria, granted by Google Focused Award Alter-Egos] Javier Olivares [Univ. Federico Santa Maria Valparaiso, external grant] Antoine Rault [Inria, granted by Conseil Régional de Bretagne] Pierre-Louis Roman [Univ. Rennes I, from Oct 2014] Julien Stainer [Univ. Rennes I] Post-Doctoral Fellows Zohir Bouzid [Univ. Rennes I, from Jun 2014] Nitin Chiluka [Univ. Rennes I] Juan Manuel Tirado Martin [Inria] Visiting Scientists Yahya Benkaouz [Inria, until Feb 2014] Maryam Helmi Khomeirani [Inria, from Aug 2014 until Sep 2014] Frederik Mallmann-Trenn [Inria, until Jul 2014] Diogo Saraiva Lima [Univ. Rennes I, from Jul 2014 until Aug 2014] Administrative Assistant Cécile Bouton [Inria, AER] Others 2 Activity Report INRIA 2014 Naman Goel [Inria, Internship, from Feb 2014 until Apr 2014, granted by Google Focused Award Alter-Egos] Mathieu Pasquet [Univ. Rennes I, Internship, from Feb 2014 until Jun 2014] Martin Sansoucy [Univ. Rennes I, Internship, from May 2014 until Aug 2014] 2. Overall Objectives 2.1. General objectives The ASAP project-team focuses its research on a number of aspects in the design of large-scale distributed systems. Our work, ranging from theory to implementation, aims to satisfy the requirements of large-scale distributed platforms, namely scalability, and dealing with uncertainty and malicious behaviors. The recent evolutions that the Internet has undergone yield new challenges in the context of distributed systems, namely the explosion of social networking, the prevalence of notification over search, the privacy requirements and the exponential growth of user-generated data introducing more dynamics than ever. 2.1.1. Scalability The past decade has been dominated by a major shift in scalability requirements of distributed systems and applications mainly due to the exponential growth of network technologies (Internet, wireless technology, sensor devices, etc.). Where distributed systems used to be composed of up to a hundred of machines, they now involve thousands to millions of computing entities scattered all over the world and dealing with a huge amount of data. In addition, participating entities are highly dynamic, volatile or mobile. Conventional distributed algorithms designed in the context of local area networks do not scale to such extreme configurations. The ASAP project aims to tackle these scalability issues with novel distributed protocols for large-scale dynamic environments. 2.1.2. Personalization The need for scalability is also reflected in the huge amounts of data generated by Web 2.0 applications. Their fundamental promise, achieving personalization, is limited by the enormous computing capacity they require to deliver effective services like storage, search, or recommendation. Only a few companies can afford the cost of the immense cloud platforms required to process users’ personal data and even they are forced to use off-line and cluster-based algorithms that operate on quasi-static data. This is not acceptable when building, for example, a large-scale news recommendation platform that must match a multitude of

Project-Team ASAP

42 Paxos Made Moderately Complex

A General Characterization of Indulgence

Velos: One-Sided Paxos for RDMA Applications

Distributed Computing Column 36 Distributed Computing: 2009 Edition

State Machine Replication Is More Expensive Than Consensus

The Impact of RDMA on Agreement

Byzantizing Paxos by Refinement

Table of Contents

The Information Structure of Indulgent Consensus

Epidemic Protocols: from Large Scale to Big Data Davide Frey

Project-Team ASAP

State Machine Replication Is More Expensive Than Consensus