Totalrecall: System Support for Automated Availability Management

Total Page:16

File Type:pdf, Size:1020Kb

Totalrecall: System Support for Automated Availability Management Total Recall: System Support for Automated Availability Management Ranjita Bhagwan, Kiran Tati, Yu-Chung Cheng, Stefan Savage, and Geoffrey M. Voelker Department of Computer Science and Engineering University of California, San Diego Abstract configure the system to compensate appropriately. While Availability is a storage system property that is both this approach may be acceptable when failures are con- highly desired and yet minimally engineered. While sistently rare, such as for the individual drives in a disk many systems provide mechanisms to improve availabil- array (and even here the management burden may be ob- ity – such as redundancy and failure recovery – how to jectionable [23]), it quickly breaks down in large-scale best configure these mechanisms is typically left to the distributed systems where hosts are transiently inaccessi- system manager. Unfortunately, few individuals have ble and individual failures are common. the skills to properly manage the trade-offs involved, let Peer-to-peer systems are particularly fragile in this re- alone the time to adapt these decisions to changing con- spect as their constituent parts are in a continual state of ditions. Instead, most systems are configured statically flux. Over short time scales (1-3 days), individual hosts and with only a cursory understanding of how the config- in such systems exhibit highly transient availability as uration will impact overall performance or availability. their users join and leave the system at will – frequently While this issue can be problematic even for individual following a rough diurnal pattern. In fact, the majority of storage arrays, it becomes increasingly important as sys- hosts in existing peer-to-peer systems are inaccessible at tems are distributed – and absolutely critical for the wide- any given time, although most are available over longer area peer-to-peer storage infrastructures being explored. time scales [4, 19]. Over still longer periods, many of This paper describes the motivation, architecture and these hosts leave the system permanently, as most peer- implementation for a new peer-to-peer storage system, to-peer systems experience high levels of churn in their called TotalRecall, that automates the task of availability overall membership. In such systems, we contend that management. In particular, the TotalRecall system auto- availability management must be provided by the system matically measures and estimates the availability of its itself, which can monitor the availability of the underly- constituent host components, predicts their future avail- ing host population and adaptively determine the appro- ability based on past behavior, calculates the appropriate priate resources and mechanisms required to provide a redundancy mechanisms and repair policies, and delivers specified level of availability. user-specified availability while maximizing efficiency. This paper describes the architecture, design and im- plementation of a new peer-to-peer storage system, called TotalRecall, that automatically manages availability in a 1 Introduction dynamically changing environment. By adapting the de- gree of redundancy and frequency of repair to the distri- Availability is a storage system property that is highly de- bution of failures, TotalRecall guarantees user-specified sired in principle, yet poorly understood in practice. How levels of availability while minimizing the overhead much availability is necessary, over what period of time needed to provide these guarantees. We rely on three key and at what granularity? How likely are failures now and approaches in providing these services: in the future and how much redundancy is needed to tol- erate them? When should repair actions be initiated and Availability Prediction. The system continuously how should they be implemented? These are all ques- monitors the current availability of its constituent tions that govern the availability of a storage system, but hosts. This measured data is used to construct pre- they are rarely analyzed in depth or used to influence the dictions, at multiple time-scales, about the future dynamic behavior of a system. availability of individual hosts and groups of hosts. Instead, system designers typically implement a static set of redundancy and repair mechanisms simply param- Redundancy Management. Short time-scale predic- eterized by resource consumption (e.g., number of repli- tions are then used to derive precise redundancy re- cas). Determining how to configure the mechanisms and quirements for tolerating transient disconnectivity. what level of availability they will provide if employed is The system selects the most efficient redundancy left for the user to discover. Moreover, if the underlying mechanism based on workload behavior and system environment changes, it is again left to the user to re- policy directives. Dynamic Repair. Long time-scale predictions cou- The AutoRAID system provided two implementations of pled with information about current availability storage redundancy – mirroring and RAID5 – and dy- drive system repair actions. The repair policy is dy- namically assigned data between them to optimize the namically selected as a function of these predictions, performance of the current workload. While this sys- target availability and system workload. tem did not directly provide users with explicit con- trol over availability it did significantly reduce the TotalRecall is implemented in C++ using a modified management burden associated with configuring these version of the DHash peer-to-peer object location ser- high-availability storage devices. A later HP project, vice [8]. The system implements a variety of redun- AFRAID, did allow user-specified availability in a disk dancy mechanisms (including replication and online cod- array environment, mapping availability requests into ing), availability predictors and repair policies. However, variable consistency management operations [20]. more significantly, the system provides interfaces that al- low new mechanisms and policies to describe their be- In the enterprise context, several researchers have re- havior in a unified manner – so the system can decide cently proposed systems to automate storage manage- how and when to best use them. ment tasks. Keeton and Wilkes have described a sys- The remainder of this paper describes the motivation, tem designed to automate data protection decisions that is architecture and design of the TotalRecall system. The similar in motivation to our own, but they focus on longer following section motivates the problem of availability time scales since the expected failure distribution in the management and describes key related work. In Section 3 enterprise is far less extreme than in the peer-to-peer en- we discuss our availability architecture – the mechanisms vironment [10]. The WiND system is being designed to and policies used to ensure that user availability require- automate many storage management tasks in a cluster en- ments are met. Sections 4 describes the design of the vironment, but is largely focused on improving system TotalRecall Storage System and its implementation. Sec- performance [2]. Finally, the PASIS project is exploring tion 5 describes the TotalRecall File System, a NFS file system support to automatically make trade-offs between service implemented on the core storage system. In Sec- different redundancy mechanisms when building a dis- tion 6, we quantitatively evaluate the effectiveness of our tributed file system [24]. system and compare it against existing approaches. Fi- Not surprisingly, perhaps the closest work to our own nally, Section 7 concludes. arises from the peer-to-peer (P2P) systems community. Due to the administrative heterogeneity and poor host availability found in the P2P environment, almost all 2 Motivation and Related Work P2P systems provide some mechanism for ensuring data The implicit guarantee provided by all storage systems availability in the presence of failures. For example, the is that data, once stored, may be recalled at some future CFS system relies on a static replication factor coupled point. Providing this guarantee has been the subject of with an active repair policy, as does Microsoft’s FAR- countless research efforts over the last twenty years, and SITE system (although FARSITE calculates the replica- has produced a wide range of technologies ranging from tion factor as a function of total storage and and is more RAID to robotic tape robots. However, while the effi- careful about replica placement) [1,8,9]. The Oceanstore ciency of these techniques has improved over time and system uses a combination of block-level erasure cod- while the cost of storage itself has dropped dramatically, ing for long term durability and simple replication to tol- the complexity of managing this storage and its availabil- erate transient failures [12, 22]. Finally, a recent paper ity has continued to increase. In fact, a recent survey by Blake and Rodrigues argues that the cost of dynamic analysis of cluster-based services suggests that the op- membership makes cooperative storage infeasible in the erational cost of preparing for and recovering from fail- transiently available peer-to-peer environments [5]. This ures easily dominates the capital expense of the individ- finding is correct under certain assumptions, but is not ual hardware systems [17]. This disparity will only con- critical in the environments we have measured and the tinue to increase as hardware costs are able to reflect ad- system we
Recommended publications
  • Foxx Street Gossip Zip File Download Avtok V.1.0
    foxx street gossip zip file download AvtoK v.1.0. AvtoK - multi autoclicker. Software for recording and playback of action with a keyboard and mouse, which can be repeated in autoclicker. Despite the apparent simplicity of the main window is quite powerful in its class. Scope: In games and sites that require routine activities. To automate tedious and repetitive actions on the computer. By setting the control points can analyze the image on the screen and organize complex actions. Features. Read more. Rate: License: Freeware Category: Automation Tools Developer: Ocean games Downloads: 319 Size: 1.63 Mb Price: Free. To free download a trial version of AvtoK, click here To visit developer homepage of AvtoK, click here. Screen Shot. Click on a thumbnail for the larger image. System Requirements. AvtoK requires Win2000, Win7 x32, Win7 x64, Win98, WinServer, WinVista, WinVista x64, WinXP. Buzzer v.1.1.0.0. A massive red button that plays a sound of your choice from the config menu allowing you to have a game show style buzzer. Cockerel sound Smelly bottom sound Joke drum roll sound French tart sound Door bell sound Horny horn sound Bloke who’s trapped his figure in the car boot sound Wolf whistle sound Some nifty piece of. Read more. Rate: Release Date: 09/13/2012 License: Freeware Category: Recreation Developer: FluxedDev.com Downloads: 26 Size: 6 Mb Price: Free. To free download a trial version of Buzzer, click here To visit developer homepage of Buzzer, click here. Screen Shot. Click on a thumbnail for the larger image. System Requirements.
    [Show full text]
  • The Edonkey File-Sharing Network
    The eDonkey File-Sharing Network Oliver Heckmann, Axel Bock, Andreas Mauthe, Ralf Steinmetz Multimedia Kommunikation (KOM) Technische Universitat¨ Darmstadt Merckstr. 25, 64293 Darmstadt (heckmann, bock, mauthe, steinmetz)@kom.tu-darmstadt.de Abstract: The eDonkey 2000 file-sharing network is one of the most successful peer- to-peer file-sharing applications, especially in Germany. The network itself is a hybrid peer-to-peer network with client applications running on the end-system that are con- nected to a distributed network of dedicated servers. In this paper we describe the eDonkey protocol and measurement results on network/transport layer and application layer that were made with the client software and with an open-source eDonkey server we extended for these measurements. 1 Motivation and Introduction Most of the traffic in the network of access and backbone Internet service providers (ISPs) is generated by peer-to-peer (P2P) file-sharing applications [San03]. These applications are typically bandwidth greedy and generate more long-lived TCP flows than the WWW traffic that was dominating the Internet traffic before the P2P applications. To understand the influence of these applications and the characteristics of the traffic they produce and their impact on network design, capacity expansion, traffic engineering and shaping, it is important to empirically analyse the dominant file-sharing applications. The eDonkey file-sharing protocol is one of these file-sharing protocols. It is imple- mented by the original eDonkey2000 client [eDonkey] and additionally by some open- source clients like mldonkey [mlDonkey] and eMule [eMule]. According to [San03] it is with 52% of the generated file-sharing traffic the most successful P2P file-sharing net- work in Germany, even more successful than the FastTrack protocol used by the P2P client KaZaa [KaZaa] that comes to 44% of the traffic.
    [Show full text]
  • Copyright Infringement (DMCA)
    Copyright Infringement (DMCA) Why it is important to understand the DMCA: Kent State University (KSU) is receiving more and more copyright infringement notices every semester, risking the loss of ‘safe harbor’ status. Resident students, KSU faculty, staff, and student employees, and the University itself could be at risk of costly litigation, expensive fines, damage to reputation, and possible jail time. What you need to know about KSU’s role: ● KSU is not a policing organization. ● KSU does not actively monitor computing behavior. ● KSU reacts to infringement notices generated by agents of the copyright holders ● KSU expends a significant amount of time/money protecting the identity of students by maintaining ‘safe harbor’ status KSU is ‘the good guy’ – we’re focused on educating our community What you need to know if your student receives a copyright infringement notice: They were identified as using P2P software and illegally sharing copyrighted material (whether downloading to their computer or allowing others to download from their computer). They will be informed of the notice via email, and their network connection to external resources (sites outside of KSU) will be disabled to maintain ‘safe harbor’ status until such time as they have complied with the University’s requirements under the DMCA. They can face sanctions ranging from blocked connectivity to dismissal from the University. Definition of terms: DMCA: The Digital Millennium Copyright Act of 1998 was signed into law in the United States to protect the intellectual property rights of copyright holders of electronic media (music, movies, software, games, etc.). The DMCA allows KSU to operate as an OSP.
    [Show full text]
  • Download-Emule-Kad-Server-List.Pdf
    Download Emule Kad Server List Download Emule Kad Server List 1 / 3 2 / 3 web site page displaying list of all active servers on the eDonkey/eMule p2p network. ... ping test update servers list at client start download list in eMule.. 0.50a installed on my computer. I can connect to eD2K network easily but I can't connect to Kad network. I have tried to download from http://www.nodes-dat.com/ but the first button " Add to eMule (from Nodes Server)" did't work and the other two worked but the problem still remains.. Bezpieczna lista serwerów emule do pobrania. Pobierz listę zawsze aktualną. Download server.met & serverlist for eMule.. eMule now connects to both the eDonkey network and the Kad network. ... eMule will use clients it knows already from the ed2k servers to get connected to Kad .... The servers merely help hold the network together. Meanwhile, Kad is a network that is also connectable via eMule. Unlike the ED2K network, ... You can use the easy to use installer or you can download the binaries. The difference is that the .... nodes.dat nodes for emule kademlia net server edonkey overnet. ... von IP/Port im Kad-Fenster, oder. - per Download aus dem Internent, z.B. nodes.dat.. Connecting to servers hasn't been working for a long time. ... started them again (there is free drive space on the download drive) but can't get a Kad connection. ... Block4: ipfilter.dat, nodes.dat, server.met (emule-security.org). Dodaj do #eMule te 2 pliki : Do serwerów, czyli eD2k --- http://www.server-list.info/ Do Kad ---..
    [Show full text]
  • 00079-141173.Pdf (5.08
    CHRIS JAY HOOFNAGLE Adjunct Full Professor School of Information School of Law Faculty Director Berkeley Center for Law & Technology August 22, 2017 University of California, Berkeley VIA THE WEB Berkeley, CA Tel: 5 Federal Trade Commission https://hoofnagle.berkeley.edu Office of the Secretary 600 Pennsylvania Avenue NW. Suite CC–5610 (Annex B) Washington, DC 20580 Re: Comment of Chris Hoofnagle on Controlling the Assault of Non-Solicited Pornography and Marketing Act (CAN–SPAM Rule, 16 CFR part 316, Project No. R711010) Dear Mr. Brown, Thank you for soliciting public comment on the CAN–SPAM Rule. My comments below focus on the need for the CAN–SPAM Rule, the costs that spam imposes on consumers and the economy, the prospect that technical interventions on intermediaries can be effective, that spam senders strategically use transaction costs to deter recipients from opting out, that senders impose privacy penalties on those who opt out, for the FTC to consider third-party lookups for email addresses to be an aggravated violation of CAN–SPAM, to revisit that the idea of a Do-Not-Email Registry, and finally, to keep the computer science literature on spam in focus. There is a Continuing Need for the CAN–SPAM Rule Because the Injuries Caused by Spam Are Economic and Social and Are on Par with Serious Crimes In a 2001 speech, FTC Chairman Timothy Muris identified spam messages as injurious under the Commission’s “harm-based” approach.1 Today, the majority of e-mail is spam. Senders of marketing e- mails can leverage the technical and economic properties of the internet to send tens of billions of messages a day.
    [Show full text]
  • Hearing on P2P File-Sharing Technology
    Prepared Statement of The Federal Trade Commission Before the Subcommittee on Competition, Infrastructure, and Foreign Commerce of the Committee on Commerce, Science, and Transportation United States Senate Hearing on P2P File-Sharing Technology Washington, D.C. June 23, 2004 I. Introduction Mr. Chairman and members of the Committee, I am Howard Beales, Director of the Bureau of Consumer Protection, Federal Trade Commission (“Commission” or “FTC”).1 I appreciate this opportunity to provide the Commission’s views on peer-to-peer (“P2P”) file- sharing and protecting consumers online.2 The Federal Trade Commission is the federal government’s principal consumer protection agency. Congress has directed the Commission, under the FTC Act, to take law enforcement action against “unfair or deceptive acts or practices” in almost all sectors of the economy and to promote vigorous competition in the marketplace.3 With the exception of certain industries and activities, the FTC Act provides the Commission with broad investigative and enforcement authority over entities engaged in, or whose business affects, commerce.4 The FTC Act also authorizes the Commission to conduct studies and collect information, and, in the public interest, to publish reports on the information it obtains.5 1The written statement presents the views of the Federal Trade Commission. Oral statements and responses to questions reflect my views and not necessarily those of the Commission or any individual Commissioner. 2The views contained in this testimony are expressed to assist you in your review of peer- to-peer file sharing technology. Official Commission determinations of the legality of practices under the Federal Trade Commission Act are ordinarily made based on a complete record, after notice and the opportunity to fully brief the issues being considered.
    [Show full text]
  • (How to Get Away from the RIAA) Using Newgroups and Hel
    The Regulation of Peer-to-Peer File Sharing Networks: Legal Convergence v. Perception Divergence Dr Katerina Sideri, Email: [email protected] and [email protected] The Regulation of Peer-to-Peer File Sharing Networks: Legal Convergence v. Perception Divergence Dr Katerina Sideri Lecturer in Law School of Law, University of Exeter Email: [email protected] and [email protected] Paper prepared for the AHRB Workshop on Network Theme 2 (Globalisation, Convergence and Divergence) Incomplete working paper, do not quote without the author’s permission. 1 The Regulation of Peer-to-Peer File Sharing Networks: Legal Convergence v. Perception Divergence Dr Katerina Sideri, Email: [email protected] and [email protected] A P2P: Regulatory Convergence Millions of people around the globe use peer-to-peer (P2P) file sharing networks. However, as these networks function like search engines, allowing looking into files stored in users’ hard discs, many engage in downloading music and movies for free, without compensating copyright holders. Digital technology makes the distribution of multiple perfect copies of copyright material so easy that the entertainment industry has been alerted and brought proceedings before courts in the US, Europe, Canada, Australia and Asia, targeting users and the makers of P2P software for copyright violations, and submitting that it has lost millions as a result of the piracy and counterfeiting taking place in such networks. However, P2P networks are not meant to be primarily designed to promote piracy.1 They play the role of a new distribution channel to promote music or other kinds of art, or even share personal photos with some million other users, thus advancing cultural expression and creativity.
    [Show full text]
  • Peer to Peer and SPAM in the Internet
    Networking Laboratory Department of Electrical and Communications Engineering Helsinki University of Technology Peer to Peer and SPAM in the Internet Report based a Licentiate Seminar on Networking Technology Fall 2003 Editor: Raimo Kantola Abstract: This report is a collection of papers prepared by PhD students on Peer-to-Peer applications and unsolicited e-mail or spam. The phenomena are covered from different angles including the main algorithms, protocols and application programs, the content, the operator view, legal issues, the economic aspects and the user point of view. Most papers are based on literature review including the latest sources on the web, some contain limited simulations and a few introduce new ideas on aspects of the phenomena under scrutiny. Before presenting the papers we will try to give some economic and social background to the phenomena. Overall, the selection of papers provides a good overview of the two phenomena providing light into what is happening in the Internet today. Finally, we provide a short discussion on where the development is taking us and how we should react to these new phenomena. Acknowledgements My interest in Peer-to-peer traffic and applications originates in the discussions that took place in the Special Interest Group on Broadband networks hosted within the TEKES NETS program during 2003. In particular, I would like to thank Pete Helenius of Rommon for his insights. The Broadband group organised an open seminar in Dipoli on November 20th that brought the topic into a wider discussion involving people from both the networking side as well as the content business.
    [Show full text]
  • P2P Filesharing Population Tracking Based on Network Flow Data
    P2P Filesharing Population Tracking Based on Network Flow Data Arno Wagner, Thomas D¨ubendorfer, Lukas Hammerle,¨ Bernhard Plattner Contact: [email protected] Communication Systems Laboratory Swiss Federal Institute of Technology Zurich (ETH Zurich) Outline 1. Motivation 2. Setting 3. The PeerTracker 4. Validation by Polling 5. Comments on Legal Aspects 6. Conclusion Arno Wagner, ETH Zurich, ICISP 2006 – p.1 Contributors Philipp Jardas, ”P2P Filesharing Systems: Real World NetFlow Traffic Characterization”, Barchelor Thesis, ETH Zürich, 2004 Lukas Hämmerle, ”P2P Population Tracking and Traffic Characterization of Current P2P file-sharing Systems”, Master Thesis, ETH Zürich, 2004 Roger Kaspar, ”P2P File-sharing Traffic Identification Method Validation and Verification”, Semester Thesis, ETH Zürich, 2005 PDFs available from http://www.tik.ee.ethz.ch/~ddosvax/sada/ Arno Wagner, ETH Zurich, ICISP 2006 – p.2 Motivation P2P traffic forms a large and dynamic part of the overall network traffic Identification of P2P traffic allows P2P anomaly detection Identification of P2P traffic allows better analysis of other traffic Arno Wagner, ETH Zurich, ICISP 2006 – p.3 The DDoSVax Project http://www.tik.ee.ethz.ch/~ddosvax/ Collaboration between SWITCH (www.switch.ch, AS559) and ETH Zurich (www.ethz.ch) Aim (long-term): Near real-time analysis and countermeasures for DDoS-Attacks and Internet Worms Start: Begin of 2003 Funded by SWITCH and the Swiss National Science Foundation Arno Wagner, ETH Zurich, ICISP 2006 – p.4 DDoSVax Data Source: SWITCH The Swiss Academic And Research Network .ch Registrar Links most Swiss Universities and CERN Carried around 5% of all Swiss Internet traffic in 2003 Around 60.000.000 flows/hour Around 300GB traffic/hour Flow archive (unsampled) since May 2003 Arno Wagner, ETH Zurich, ICISP 2006 – p.5 What is a ”Flow”? In the DDoSVax-Project: CISCO NetFlow v5.
    [Show full text]
  • Designing Incentive Mechanisms for Peer-To-Peer Systems P
    67 Designing Incentive Mechanisms for Peer- to-Peer Systems From file-sharing to mobile ad-hoc networks. community networking to application layer overlays. the peer-trrpeer networking paradigm promises to revolutionize the way we design. build and use the communications network of t~morro~.transform the SINCIU~C of the communications industry, and challenge our understanding of markets and demwracies in a digital age. The fundamenlal premise of peer-to-peer systems is that individual peers voluntarily contribute resources to the system. However, the inherent tension between individual rationality and collective welfare produces a misalignment of incentives in the grassr~otsprovisioning ofP2P services. What makes free-riding a particularly difficult problem is the unique set of challenges that P2P systems pose: large populations. high turnover, asymmetry of interest. collusion. hidden actions, and zerwost identities. This paper discusses some of the research opponunities and challenges in the design oiincentive mechanisms for P2P systems. P2P System Characteristics Peer-to-peer networks exhibit sevecal economic characteristics that are different from those of traditional provider-centric networks. First. P2P networks derive their utility solely from the contribution of resources from cooperating nodes. End- nodcs have to self-organize and assume the roles of routing, message forwarding, and network repair in order to maintain connectivity (at the nrtwurk layer and/or the application layer). At the same time, nodes have every incentive to free-ride or dcfcct, e.g.. contribute less to the common resource pool than they consume from it or contribute nothing at all. There is therefore a tension between the maximization of individual utility and the achicvcment of global optimalily.
    [Show full text]
  • Rarest First and Choke Algorithms Are Enough
    Rarest First and Choke Algorithms Are Enough Arnaud Legout G. Urvoy-Keller and P. Michiardi I.N.R.I.A. Institut Eurecom Sophia Antipolis France Sophia Antipolis France [email protected] fGuillaume.Urvoy,[email protected] ABSTRACT for this success. Whereas content localization has attracted The performance of peer-to-peer ¯le replication comes from considerable research interest in the last years [7,12,23,25], its piece and peer selection strategies. Two such strategies content replication has started to be the subject of active have been introduced by the BitTorrent protocol: the rarest research only recently. As an example, the most popular ¯rst and choke algorithms. Whereas it is commonly ad- peer-to-peer ¯le sharing networks [1] eDonkey2K, FastTrack, mitted that BitTorrent performs well, recent studies have Gnutella, Overnet focus on content localization. The only proposed the replacement of the rarest ¯rst and choke algo- widely used [16, 17, 20] peer-to-peer ¯le sharing application rithms in order to improve e±ciency and fairness. In this focusing on content replication is BitTorrent [8]. paper, we use results from real experiments to advocate that Yang et al. [26] studied the problem of e±cient content the replacement of the rarest ¯rst and choke algorithms can- replication in a peer-to-peer network. They showed that not be justi¯ed in the context of peer-to-peer ¯le replication the capacity of the network to serve content grows exponen- in the Internet. tially with time in the case of a flash crowd, and that a key We instrumented a BitTorrent client and ran experiments improvement on peer-to-peer ¯le replication is to split the on real torrents with di®erent characteristics.
    [Show full text]
  • Rarest First and Choke Algorithms Are Enough Arnaud Legout, Guillaume Urvoy-Keller, Pietro Michiardi
    Rarest First and Choke Algorithms Are Enough Arnaud Legout, Guillaume Urvoy-Keller, Pietro Michiardi To cite this version: Arnaud Legout, Guillaume Urvoy-Keller, Pietro Michiardi. Rarest First and Choke Algorithms Are Enough. [Research Report] 2006. inria-00001111v1 HAL Id: inria-00001111 https://hal.inria.fr/inria-00001111v1 Submitted on 13 Feb 2006 (v1), last revised 6 Sep 2006 (v3) HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. INRIA-00001111, VERSION 1 - 13 FEBRUARY 2006 1 Rarest First and Choke Algorithms Are Enough Arnaud Legout I.N.R.I.A. Sophia Antipolis, France Email: [email protected] Guillaume Urvoy-Keller and Pietro Michiardi Institut Eurecom Sophia Antipolis, France Email: {Guillaume.Urvoy,Pietro.Michiardi}@eurecom.fr Technical Report Abstract— The performance of peer-to-peer file replication that a peer-to-peer architecture for file replication is a major comes from its piece and peer selection strategies. Two such improvement compared to a client server architecture, whose strategies have been introduced by the BitTorrent protocol: capacity of service does not scale with the number of peers. the rarest first and choke algorithms. Whereas it is commonly admitted that BitTorrent performs well, recent studies have However, both studies assume global knowledge, which is proposed the replacement of the rarest first and choke algorithms not realistic.
    [Show full text]