Chapter 10 Peer-To-Peer Systems

Total Page:16

File Type:pdf, Size:1020Kb

Chapter 10 Peer-To-Peer Systems CSF661 – Distributed Systems 分散式系統 Chapter 10 Peer-to-Peer Systems 吳俊興 國立高雄大學 資訊工程學系 Chapter 10 Peer-to-Peer Systems 10.1 Introduction 10.2 Napster and its legacy 10.3 Peer-to-peer middleware 10.4 Routing overlays 10.5 Overlay case studies: Pastry, Tapestry 10.6 Application case studies – Squirrel web cache – OceanStore file store – Ivy file system 10.7 Summary 2 10.1 Introduction Paradigm Shift of Computing System Models 1980~ 1990~ 2000~ Terminal-Mainframe Client-Server Peer-to-Peer (Super-computing) (Micro-computing (Macro-computing) /Personal Computer) RS-232 Dialup/10M Ethernet ADSL/100M+ Ethernet VT100/DOS Windows 31/95 Linux/Windows XP 3 Client-server Model Clients and servers each with distinct roles Request Clients Server U11 Service U12 S U13 U21 The server and the network U22 become the bottlenecks and points of failure U32 U31 •DDoS •Flash Crowd 4 Peer-to-peer Model “ Peer-to-Peer (P2P) is a way of structuring distributed applications such that the individual nodes have symmetric roles. Rather than being divided into clients and servers each with quite distinct roles, in P2P applications a node may act as both a client and a server.” Excerpt from the Charter of Peer-to-Peer Research Group, IETF/IRTF, June 24, 2003 http://www.irtf.org/charters/p2prg.html Peers play similar roles No distinction of responsibilities 5 Content Distribution Networks U11 U12 CR1 S U13 CRa CRb U21 Overlay Network CR2 U22 CR3 Hosting + Hierarchical Proxies Content Router + DNS Request Routing CR or Peer Node U32 U31 (Akamai, CacheFlow, etc.) Name: lb1.www.ms.akadns.net Addresses: 207.46.20.60, 207.46.18.30, 207.46.19.30, 207.46.19.60, 207.46.20.30 Aliases: www.microsoft.com, toggle.www.ms.akadns.net, g.www.ms.akadns.net Name: www.yahoo.akadns.net Addresses: 66.94.230.33, 66.94.230.34, 66.94.230.35, 66.94.230.39, 66.94.230.40, … Aliases: www.yahoo.com Name: e96.g.akamaiedge.net Address: 202.177.217.122 Aliases: www.gio.gov.tw, www.gio.gov.tw.edgekey.net Name: a1289.g.akamai.net Addresses: 203.133.9.9, 203.133.9.11 Aliases: www.whitehouse.gov, www.whitehouse.gov.edgesuite.net 6 Characteristics of P2P Systems • Ensures that each user contributes resources to the system • All the nodes have the same functional capabilities and responsibilities • Their correct operation does not depend on the existence of any centrally-administered systems • They can be designed to offer a limited degree of anonymity to the providers and users of resources • A key issue: placement of data across many hosts – efficiency – load balance – availability 7 Generations • Early services – DNS, Netnews/Usenet – Xerox Grapevine name/mail service – Lamport’s part-time parliament algorithm – Bayou replicated storage system – classless inter-domain IP routing algorithm • 1st generation – centralized search – Napster • 2nd generation – decentralized, unstructured – Freenet, Gnutella, Kazaa, BitTorrent • 3rd generation – decentralized, structured – P2P middleware: Pastry, Tapestry, CAN, Chord, Kademlia 8 Example of Centralized P2P Systems: Napster • Announced in January 1999 by Shawn Fanning for sharing MP3 files and pulled plug in July 2001 • Centralized server for search, Directory direct file transfer among peer nodes • Proprietary client-server protocol P P and client-client protocol • Relying on the user to choose a ‘best’ source • Disruptive, proof of concepts • IPR and firewall issues 9 Example of Decentralized P2P Systems: Gnutella • Open source • 3/14/2000: Released by NullSoft/AOL, almost 2 2 2 2 immediately withdrawn, and 1 became open source 1 Client 3 2 • Message flooding: serverless, C S decentralized search by 1 1 Server 3 message broadcast, direct file transfer using HTTP • Limited-scope query Servent (=Serv er + Cli ent) 10 Figure 10.12: Key elements in the Gnutella protocol 11 Example of Unstructured P2P Systems: Freenet • Ian Clarke, Scotland, 2000 • Distributed depth-first search, Exhaustive search • File hash key, lexicographically closest 5 match 6 • Store-and-forward file transfer C S 3 1 2 • Anonymity 4 • Open source 12 Example of Hybrid P2P Systems: FastTrack / KaZaA • Proprietary software developed by FastTrack in Amsterdam and licensed to many companies • Summer 2001, Sharman networks, founded in Vanuatu, acquires FastTrack Supernodes in overlay mesh • Hierarchical supernodes (Ultra- C peers) • Dedicated authentication server and supernode list server S • From user’s perspective, it’s like Google. • Encrypted files and control data transported using HTTP • Parallel download • Automatically switch to new server 13 Example of Structured P2P Systems: Chord 0 15 1 • Frans Kaashoek, et. al., MIT, 14 2 2001 • IRIS: Infrastructure for Resilient 13 3 Internet Systems, 2003 12 4 • Distributed Hashing Table • Scalable lookup service 11 5 • Hyper-cubic structure 10 6 9 8 7 14 Figure 10.11: Structured versus unstructured peer-to-peer systems Benefits from P2P •Theory – Dynamic discovery of information – Better utilization of bandwidth, processor, storage, and other resources – Each user contributes resources to network • Practice examples – Sharing browser cache over 100Mbps lines – Disk mirroring using spare capacity – Deep search beyond the web 16 Figure 10.1 Distinctions between IP and overlay routing for P2P applications IP Application-level routing overlay Scale IPv4 is limited to 232 addressable nodes. The Peer-to-peer systems can address more objects. IPv6 name space is much more generous The GUID name space is very large and flat (2128), but addresses in both versions are (>2128), allowing it to be much more fully hierarchically structured and much of the space occupied. is pre-allocated according to administrative requirements. Load balancing Loads on routers are determined by network Object locations can be randomized and hence topology and associated traffic patterns. traffic patterns are divorced from the network topology. Network dynamics IP routing tables are updated asynchronously on Routing tables can be updated synchronously or (addition/deletion of a best-efforts basis with time constants on the asynchronously with fractions of a second objects/nodes) order of 1 hour. delays. Fault tolerance Redundancy is designed into the IP network by Routes and object references can be replicated its managers, ensuring tolerance of a single n-fold, ensuring tolerance of n failures of nodes router or network connectivity failure. n-fold or connections. replication is costly. Target identification Each IP address maps to exactly one target Messages can be routed to the nearest replica of node. a target object. Security and anonymity Addressing is only secure when all nodes are Security can be achieved even in environments trusted. Anonymity for the owners of addresses with limited trust. A limited degree of is not achievable. anonymity can be provided. 17 Distributed Computation • Only a small portion of the CPU cycles of most computers is utilized. Most computers are idle for the greatest portion of the day, and many of the ones in use spend the majority of their time waiting for input or a response. • A number of projects have attempted to use these idle CPU cycles. The best known is the SETI@home project, but other projects including code breaking have used idle CPU cycles on distributed machines. • SETI@home project: a scientific experiment that uses Internet-connected computers to analyze radio telescope data in the Search for Extraterrestrial Intelligence (SETI) http://setiathome.ssl.berkeley.edu/ 18 Dangers and Attacks on P2P • Poisoning (files with contents different to description) • Polluting (inserting bad packets into the files) • Defection (users use the service without sharing) • Insertion of viruses (attached to other files) • Malware (originally attached to the files) • Denial of Service (slow down or stop the network traffic) • Filtering (some networks don’t allow P2P traffic) • Identity attacks (tracking down users and disturbing them) • Spam (sending unsolicited information) 19 10.2 Napster and its legacy • The first large scale peer-to-peer network was Napster, set up in 1999 to share digital music files over the Internet. While Napster maintained centralized (and replicated) indices, the music files were created and made available by individuals, usually with music copied from CDs to computer files. Music content owners sued Napster for copyright violations and succeeded in shutting down the service. Figure 10.2 documents the process of requesting a music file from Napster. 20 Figure 10.2 Napster: peer-to-peer file sharing peers Napster server Napster server Index 1. File location Index request 3. File request 2. List of peers offering the file 5. Index update 4. File delivered 21 Napster: Lessons Learned • Napster created a network of millions of people, with thousands of files being transferred at the same time. • There were quality issues. While Napster displayed link speeds to allow users to choose faster downloads, the fidelity of recordings varied widely. • Since Napster users were parasites of the recording companies, there was some central control over selection of music. One benefit was that music files did not need updates. • There was no guarantee of availability for a particular item of music. 22 10.3 Middleware for Peer-to-Peer • A key problem in Peer-to-Peer applications is to provide a way for clients to access data resources efficiently. – Similar needs in client/server technology led to solutions like NFS. – However, NFS relies on pre-configuration and is not scalable enough for peer-to-peer. • Peer clients need to locate and communicate with any available resource, even though resources may be widely distributed and configuration may be dynamic, constantly adding and removing resources and connections. 23 Non-Functional Requirements for Peer-to-Peer Middleware • Global Scalability • Load Balancing • Local Optimization • Adjusting to dynamic host availability • Security of data • Anonymity, deniability, and resistance to censorship (in some applications) 24 10.4 Routing Overlays •A routing overlay is a distributed algorithm for a middleware layer responsible for routing requests from any client to a host that holds the object to which the request is addressed.
Recommended publications
  • A Content-Addressable Network for Similarity Search in Metric Spaces Fabrizio Falchi
    University of Pisa Masaryk University Faculty of Informatics Department of Department of Information Engineering Information Technologies Doctoral Course in Doctoral Course in Information Engineering Informatics Ph.D. Thesis A Content-Addressable Network for Similarity Search in Metric Spaces Fabrizio Falchi Supervisor Supervisor Supervisor Lanfranco Lopriore Fausto Rabitti Pavel Zezula 2007 Abstract Because of the ongoing digital data explosion, more advanced search paradigms than the traditional exact match are needed for content- based retrieval in huge and ever growing collections of data pro- duced in application areas such as multimedia, molecular biology, marketing, computer-aided design and purchasing assistance. As the variety of data types is fast going towards creating a database utilized by people, the computer systems must be able to model hu- man fundamental reasoning paradigms, which are naturally based on similarity. The ability to perceive similarities is crucial for recog- nition, classification, and learning, and it plays an important role in scientific discovery and creativity. Recently, the mathematical notion of metric space has become a useful abstraction of similarity and many similarity search indexes have been developed. In this thesis, we accept the metric space similarity paradigm and concentrate on the scalability issues. By exploiting computer networks and applying the Peer-to-Peer communication paradigms, we build a structured network of computers able to process similar- ity queries in parallel. Since no centralized entities are used, such architectures are fully scalable. Specifically, we propose a Peer- to-Peer system for similarity search in metric spaces called Met- ric Content-Addressable Network (MCAN) which is an extension of the well known Content-Addressable Network (CAN) used for hash lookup.
    [Show full text]
  • Securing Structured Overlays Against Identity Attacks
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 20, NO. 10, OCTOBER 2009 1487 Securing Structured Overlays against Identity Attacks Krishna P.N. Puttaswamy, Haitao Zheng, and Ben Y. Zhao Abstract—Structured overlay networks can greatly simplify data storage and management for a variety of distributed applications. Despite their attractive features, these overlays remain vulnerable to the Identity attack, where malicious nodes assume control of application components by intercepting and hijacking key-based routing requests. Attackers can assume arbitrary application roles such as storage node for a given file, or return falsified contents of an online shopper’s shopping cart. In this paper, we define a generalized form of the Identity attack, and propose a lightweight detection and tracking system that protects applications by redirecting traffic away from attackers. We describe how this attack can be amplified by a Sybil or Eclipse attack, and analyze the costs of performing such an attack. Finally, we present measurements of a deployed overlay that show our techniques to be significantly more lightweight than prior techniques, and highly effective at detecting and avoiding both single node and colluding attacks under a variety of conditions. Index Terms—Security, routing protocols, distributed systems, overlay networks. Ç 1INTRODUCTION S the demand for Internet and Web-based services multihop lookup mechanism called key-based routing (KBR) Acontinues to grow, so does the scale of the computing [8]. KBR maps a given key to
    [Show full text]
  • Kademlia on the Open Internet
    Kademlia on the Open Internet How to Achieve Sub-Second Lookups in a Multimillion-Node DHT Overlay RAUL JIMENEZ Licentiate Thesis Stockholm, Sweden 2011 TRITA-ICT/ECS AVH 11:10 KTH School of Information and ISSN 1653-6363 Communication Technology ISRN KTH/ICT/ECS/AVH-11/10-SE SE-164 40 Stockholm ISBN 978-91-7501-153-0 SWEDEN Akademisk avhandling som med tillstånd av Kungl Tekniska högskolan framlägges till offentlig granskning för avläggande av Communication Systems fredag den 9 december 2011 klockan 10.00 i C2, Electrum, Kungl Tekniska högskolan, Forum, Isafjordsgatan 39, Kista. © Raul Jimenez, December 2011 This work is licensed under a Creative Commons Attribution 2.5 Sweden License. http://creativecommons.org/licenses/by/2.5/se/deed.en Tryck: Universitetsservice US AB iii Abstract Distributed hash tables (DHTs) have gained much attention from the research community in the last years. Formal analysis and evaluations on simulators and small-scale deployments have shown good scalability and per- formance. In stark contrast, performance measurements in large-scale DHT overlays on the Internet have yielded disappointing results, with lookup latencies mea- sured in seconds. Others have attempted to improve lookup performance with very limited success, their lowest median lookup latency at over one second and a long tail of high-latency lookups. In this thesis, the goal is to to enable large-scale DHT-based latency- sensitive applications on the Internet. In particular, we improve lookup la- tency in Mainline DHT, the largest DHT overlay on the open Internet, to identify and address practical issues on an existing system.
    [Show full text]
  • Effective Manycast Messaging for Kademlia Network
    Effective Manycast Messaging for Kademlia Network Lubos Matl Tomas Cerny Michael J. Donahoo Czech Technical University Czech Technical University Baylor University Charles square 13, 121 35 Charles square 13, 121 35 Waco, TX, US Prague 2, CZ Prague 2, CZ [email protected] [email protected] [email protected] ABSTRACT General Terms Peer-to-peer (P2P) communication plays an ever-expanding Measurement, Reliability, Performance, Experimentation role in critical applications with rapidly growing user bases. In addition to well-known P2P systems for data sharing Keywords (e.g., BitTorrent), P2P provides the core mechanisms in VoIP (e.g., Skype), distributed currency (e.g., BitCoin), etc. Distributed systems, performance, communication, P2P, There are many communication commonalities in P2P ap- DHT, key-based search plications; consequently, we can factor these communication primitives into overlay services. Such services greatly sim- 1. INTRODUCTION plify P2P application development and even allow P2P in- P2P networking is a widely-known architectural pattern frastructures to host multiple applications, instead of each for distributed systems. P2P has been applied to a variety of having its own network. Note well that such services must be applications, including file sharing (BitTorrent [10]), video both self-scaling and robust to meet the needs of large, ad- or audio streaming (SplitStream [5], Coolstream [22]), par- hoc user networks. One such service is Distributed Hash Ta- allel computation, online payment systems (Bitcoin [16]), ble (DHT) providing a dictionary-like location service, useful or voice-over-IP services (Skype). All these applications in many types of P2P applications. Building on the DHT demonstrate that the architecture is important and useful primitives for search and store, we can add even more power- for multiple domains.
    [Show full text]
  • Distributed Systems – Lectures
    377 P2P Systems 11.1 Introduction & overview 11 Peer-to-Peer Systems1 11.1 Introduction & overview Client-server Model • Clients and servers each with distinct roles • The server and the network become the bottlenecks and points of failure Peer-to-peer Model “Peer-to-Peer (P2P) is a way of structuring distributed applications such that the individual nodes have symmetric roles. Rather than being divided into clients and servers each with quite distinct roles, in P2P applications a node may act as both a client and a server.” (Excerpt from the Charter of Peer-to-Peer Research Group, IETF/IRTF, June 24, 2003 http://www.irtf.org/charters/p2prg.html) 1Textbook Chapter 10 378 P2P Systems 11.1 Introduction & overview • Peers play similar roles • No distinction of responsibilities Key characteristics of P2P Systems: • Ensures that each user contributes resources to the system • All the nodes have the same functional capabilities and responsibilities • Their correct operation does not depend on the existence of any centrally- administered systems • They can be designed to offer a limited degree of anonymity to the providers and users of resources • A key issue: placement of data across many hosts – efficiency 379 P2P Systems 11.1 Introduction & overview – load balancing – availability Generations • Early services – DNS, Netnews/Usenet – Xerox Grapevine name/mail service – Lamport’s part-time parliament algorithm for distributed consesnus – Bayou replicated storage system – classless inter-domain IP routing algorithm • 1st generation – centralized
    [Show full text]
  • Tapestry: a Resilient Global-Scale Overlay for Service Deployment Ben Y
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 22, NO. 1, JANUARY 2004 41 Tapestry: A Resilient Global-Scale Overlay for Service Deployment Ben Y. Zhao, Ling Huang, Jeremy Stribling, Sean C. Rhea, Anthony D. Joseph, Member, IEEE, and John D. Kubiatowicz, Member, IEEE Abstract—We present Tapestry, a peer-to-peer overlay routing delivery to mobile or replicated endpoints in the presence of infrastructure offering efficient, scalable, location-independent instability in the underlying infrastructure. As a result, a DOLR routing of messages directly to nearby copies of an object or network provides a simple platform on which to implement service using only localized resources. Tapestry supports a generic decentralized object location and routing applications program- distributed applications—developers can ignore the dynamics ming interface using a self-repairing, soft-state-based routing of the network except as an optimization. Already, Tapestry has layer. This paper presents the Tapestry architecture, algorithms, enabled the deployment of global-scale storage applications and implementation. It explores the behavior of a Tapestry such as OceanStore [4] and multicast distribution systems such deployment on PlanetLab, a global testbed of approximately 100 as Bayeux [5]; we return to this in Section VI. machines. Experimental results show that Tapestry exhibits stable Tapestry is a peer-to-peer (P2P) overlay network that pro- behavior and performance as an overlay, despite the instability of the underlying network layers. Several widely distributed vides high-performance, scalable, and location-independent applications have been implemented on Tapestry, illustrating its routing of messages to close-by endpoints, using only localized utility as a deployment infrastructure.
    [Show full text]
  • Topology-Aware Routing in Structured Peer-To-Peer Overlay Networks
    Topology-aware routing in structured peer-to-peer overlay networks ¡ ¢ Miguel Castro Peter Druschel Y. Charlie Hu Antony Rowstron Microsoft Research, 7 J J Thomson Close, Cambridge, CB3 0FB, UK. ¡ Rice University, 6100 Main Street, MS-132, Houston, TX 77005, USA. ¢ Purdue University, 1285 EE Building, West Lafayette, IN 47907, USA. Abstract tant distinction lies in the approach they take to consider- ing and exploiting proximity in the underlying Internet. Structured peer-to-peer (p2p) overlay networks like Chord in its original design, for instance, does not con- CAN, Chord, Pastry and Tapestry offer a novel platform sider network proximity at all. As a result, its protocol for a variety of scalable and decentralized distributed ap- for maintaining the overlay network is very light-weight, plications. They provide efficient and fault-tolerant rout- but messages may travel arbitrarily long distances in the ing, object location and load balancing within a self- Internet in each routing hop. organizing overlay network. One important aspect of In a version of CAN, each node measures its network these systems is how they exploit network proximity in delay to a set of landmark nodes, in an effort to deter- the underlying Internet. We present a study of topology- mine its relative position in the Internet and to construct aware routing approaches in p2p overlays, identify prox- an Internet topology-aware overlay. Tapestry and Pastry imity neigbor selection as the most promising technique, construct a topology-aware overlay by choosing nearby and present an improved design in Pastry. Results ob- nodes for inclusion in their routing tables.
    [Show full text]
  • Master Thesis
    Hochschule für Angewandte Wissenschaften Hamburg Hamburg University of Applied Sciences Master Thesis Tran, Thanh Minh Tu Mobile P2P Audio Network for the iPhone and iPod Touch Fakultät Technik und Informatik Faculty of Engineering and Computer Science Studiendepartment Informations-und Department of Information and Elektrotechnik Electrical Engineering Tran, Thanh Minh Tu Mobile P2P Audio Network for the iPhone and iPod Touch Master thesis based on the examination and study regulations for the Master of Engineering degree programme Information Engineering at the Department of Information and Electrical Engineering of the Faculty of Engineering and Computer Science of the University of Applied Sciences Hamburg Supervising examiner: Prof. Dr. Thomas C. Schmidt Second examiner: Prof. Dr. Hans-Jürgen Hotop Day of delivery December 7th, 2009 Tran, Thanh Minh Tu Thema der Master Thesis Mobile P2P Audio Network for the iPhone and iPod Touch Stichworte Peer-to-Peer Systems, Overlay Network, Application Layer Multicast Systems, Audio Live Streaming Technologies, iPhone Audio Processing Zusammenfassung Gruppenkommunikation ermöglicht eine große Anzahl von Anwendungen, z.B. Sprach- und Videokonferenzen, Radio- und TV-Broadcasts, verteilte Gruppenspiele etc., und wird aus diesem Grund sowohl in kommerziellen Entwicklungen wie Forschungsprojekten häufig zum Untersuchungsgegenstand. Während IP Multicast die Verkehrslasten im Netzwerk minimiert, ist eine globale Verbreitung bisher hinter den Erwartungen zurückgeblieben. Application Layer Multicast bietet hier eine alternative Lösungsmöglichkeit zur Content- Verteilung, da es ausschliesslich auf der Basis einer Peer-to-Peer Overlay-Schicht arbeitet. Viele Kommunikations- und Unterhaltungsdienste (z.B. Skype, Zattoo) verwenden ALM in der Realisierung. Mit den gestiegenen Hardware- und Netzwerkressourcen der kleinen elektronischen Geräte wie SmartPhones und PDAs ist es nun möglich, ALM Lösungen auch auf Handhelds zu realisieren.
    [Show full text]
  • A Content-Addressable Network for Similarity Search in Metric Spaces
    UNIVERSITY OF PISA MASARYK UNIVERSITY FACULTY OF INFORMATICS ü & DEPARTMENT OF DEPARTMENT OF INFORMATION ENGINEERING INFORMATION TECHNOLOGIES DOCTORAL COURSE IN DOCTORAL COURSE IN INFORMATION ENGINEERING INFORMATICS PH.D. THESIS A Content-Addressable Network for Similarity Search in Metric Spaces Fabrizio Falchi SUPERVISOR SUPERVISOR SUPERVISOR Lanfranco Lopriore Fausto Rabitti Pavel Zezula 2007 Abstract Because of the ongoing digital data explosion, more advanced search paradigms than the traditional exact match are needed for content- based retrieval in huge and ever growing collections of data pro­ duced in application such as multimedia, molecular biology, marketing, computer-aided design and purchasing assistance. As the variety of data types is fast going towards creating a database utilized by people, the computer systems must be able to model hu­ man fundamental reasoning paradigms, which are naturally based on similarity. The ability to perceive similarities is crucial for recog­ nition, classification, and learning, and it plays an important role in scientific discovery and creativity. Recently, the mathematical notion of metric space has become a useful abstraction of similarity and many similarity search indexes have been developed. In this thesis, we accept the metric space similarity paradigm and concentrate on the scalability issues. By exploiting computer networks and applying the Peer-to-Peer communication paradigms, we build a structured network of computers able to process similar­ ity queries in parallel. Since no centralized entities are used, such architectures are fully scalable. Specifically, we propose a Peer- to-Peer system for similarity search in metric spaces called Met­ ric Content-Addressable Network (MCAN) which is an extension of the well known Content-Addressable Network (CAN) used for hash lookup.
    [Show full text]
  • Distributed Systems: Concepts and Design Slides for Chapter 10: Peer
    Slides for Chapter 10: Peer-to-Peer Systems From Coulouris, Dollimore, Kindberg and Blair Distributed Systems: Concepts and Design Edition 5, © Addison-Wesley 2012 Overview of Chapter • Introduction • Napster and its legacy • Peer-to-peer middleware • Routing overlays • Overlay case studies: Pastry, Tapestry • Application case studies: Squirrel, OceanStore, Ivy 2 Introduction Goal: • Provide fully decentralized and self-organizing, dynamically balancing the storage and processing loads as nodes join and leave Characteristics of peer-to-peer systems: • Each user contributes resources (files, computing cycles, etc.) • Each node has similar functional capability • No centrally administered systems • Provide anonymity to providers and users of resources • Require algorithms for data placement, and for workload balances so that nodes do not suffer undue overhead 3 Introduction (cont.) Three generations of peer-to-peer systems: • Napster (music exchange) • Freenet, Gnutella, Kazaa, BitTorrent (file sharing) • Pastry, Tapestry, CAN, Chord, Kademlia (peer-to-peer middleware • Resources identified by GUIDs (Globally Unique Identifiers) using secure hashing • Suitable for storing immutable objects (music, video) • Overlay (application-level) routing used instead of IP routing 4 Figure 10.1: Distinctions between IP and overlay routing for peer-to- peer applications IP Application-level routing overlay Scale IPv4 is limited to 232 addressablen odes. The Peer-to-peer systems can addressm ore objects. IPv6 name space is much more generous The GUID name space is very largeand flat (2128), but addresses in both versions are (>2128), allowing it to be much morefully hierarchically structured nad much of the space occupied. is pre-allocated according to administrative requirements. Load balancing Loads on routers are determined by network Object locations can be randomized and hence topologya nd associated traffic patterns.
    [Show full text]
  • Performance Evaluation of Structured Peer-To-Peer Overlays for Use on Mobile Networks
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Stirling Online Research Repository Performance Evaluation of Structured Peer-to-Peer Overlays for Use on Mobile Networks Farida Chowdhury, Jamie Furness, Mario Kolberg Computing Science and Mathematics Universty of Stirling, Stirling, Scotland {fch,jrf,mko}@cs.stir.ac.uk Abstract: Distributed Hash Table (DHT) based Peer-to-Peer (P2P) overlays have been widely researched and deployed in many applications such as file sharing, IP telephony, content distribution and media streaming applications. However, their deployment has largely been restricted to fixed, wired networks. This is due to the fact that supporting P2P overlays on wireless networks such as the public mobile data network is more challenging due to constraints in terms of data transmissions on cellular networks, limited battery power of the handsets and increased levels of node churn. However, the proliferation of smartphones makes the use of P2P applications on mobile handsets very desirable. In this paper, we have analysed and evaluated the performance and efficiency of five popular DHT based structured P2P overlays (Chord, Pastry, Kademlia, Broose and EpiChord) under conditions as commonly experienced in public mobile data networks. Our results show that the conditions in mobile networks, including a high churn rate and the relatively low bandwidth availability is best matched by Kademlia and EpiChord. These overlays exhibit a high lookup success ratio and low hop count while consuming a moderate amount of bandwidth. These characteristics make these two overlays suitable candidates for use in mobile networks. 1. INTRODUCTION Peer-to-Peer (P2P) communication networks have been very popular over the last decade due to their capability to facilitate information storage and retrieval among a potentially very large number of nodes while avoiding the use of central servers.
    [Show full text]
  • A Survey and Comparison of Peer-To-Peer Overlay Network Schemes Eng Keong Lua, Jon Crowcroft, Marcelo Pias, Ravi Sharma and Steven Lim
    ¢ £ £ ¦ ¡ ¡ ¤ ¥ § ¨ © ¢ £ 3 3 < 3 + + " § § © ¤ © ! ! : : , - $ * 2 9 / - ! % 0 £ ¤ © ¦ ¦ £ ¥ ¥ ¥ ¤ © 1 ¦ 8 ; + + + ¥ ¨ ¥ ¨ ! % , ! . : ! , , 2 9 £ © © ¥ ¨ / - ! % 0 £ ¢ £ ¤ © ¤ ¨ £ ¦ ¤ © § ¤ ¦ ¦ 7 " ¥ ¨ ¥ ¨ ¥ ¨ ¥ ¨ ! - % ! . ) 5 / " ¢ £ £ ¢ £ ¦ - * - 6 ! ( ! * - ! % 0 ¥ ¢ £ ¢ £ ¦ ¥ © ¨ ¤ £ ¦ § ¨ £ ¦ 1 3 4 § ¤ ¥ § ¨ § § ¡ ! % - $ ! . 2 0 5 / " - * - 6 ! ( ! * - ! % 0 © § ¤ © £ © ¤ § is that the P2P overlay . ¦ & " § ! $ % # ' + + " $ ( ( * , - $ * . ) / - ! % 0 . DHT-based systems have a key Structured Unstructured and Structured The technical meaning of network topology is tightlynot controlled at and random content peerssubsequent are but placed queries at more specifiedtems efficient. locations Such use that Structured will thein make P2P Distributed which sys- Hash data object Tabledeterministically, (or (DHT) at value) location as the informationto a peers is the substrate, with placed data identifiers object’s corresponding unique Fig. 1. An Abstract P2P Overlay Network Architecture communications framework. Thelayer Network describes Communications thechines connected network over the characteristics Internetbased or of devices small wireless desktop that ordynamic are sensor- ma- nature connected of inparadigm. peers an The poses ad-hoc Overlay challenges manner. Nodesmanagement in The Management of communication layer peers,routing covers which algorithms the include for optimization. discovery The Featureslayer of Management peers deals and withaggregated the resource availability
    [Show full text]