15-441 Computer Networking Lecture 16: Delivering Content Web And

Total Page:16

File Type:pdf, Size:1020Kb

15-441 Computer Networking Lecture 16: Delivering Content Web And Overview • Web 15-441 • Protocol interactions 15-441 Computer Networking 15-641 • Caching • Cookies • Consistent hashing Lecture 16: Delivering Content • Peer-to-peer Web and Peer-Peer • CDN Peter Steenkiste • Video Fall 2014 www.cs.cmu.edu/~prs/15-441-F14 2 Web history Internet Traffic History • 1945: Vannevar Bush, “As we may think”, Atlantic 100000 Monthly, July, 1945. • Describes the idea of a distributed hypertext system. 10000 All Fixed 1000 • A “memex” that mimics the “web of trails” in our minds. Mobile • 1989: Tim Berners-Lee (CERN) writes internal proposal 100 to develop a distributed hypertext system 10 • Connects “a web of notes with links”. PByte/month 1 • Intended to help CERN physicists in large projects share and manage information 0.1 • 1990: TBL writes graphical browser for Next machines 0.01 0.001 • 1992-1994: NCSA/Mosaic/Netscape browser release Year 3 4 1 Typical Workload (Web Pages) HTTP 0.9/1.0 • Multiple (typically small) objects per page • One request/response per TCP connection • File sizes •Lots of small objects • Simple to implement • Heavy-tailed means & TCP • Short transfers are very hard on TCP • Pareto distribution for tail • 3-way handshake • Multiple connection setups three-way handshake • Lognormal for body of distribution • Lots of slow starts each time • Embedded references • Extra connection state • Several extra round trips added to transfer • Number of embedded objects also Pareto • Many slow starts – low throughput because of small -k Pr(X>x) = (x/xm) window • This plays havoc with performance. Why? • Never leave slow start for short transfers • Loss recovery is poor when windows are small • Solutions? • Lots of extra connections • Increases server state/processing 5 6 Single Transfer Example Improving Performance Client Server • Multiple concurrent connections (Netscape) 0 RTT SYN • Benefits are mixed: more state, timeouts, … Client opens TCP SYN 1 RTT • Multiplex multiple transfers onto one TCP connection connection ACK Client sends HTTP request DAT (HTTP 1.1) ACK Server reads from for HTML DAT disk • Also allow pipelined transfers, i.e., multiple outstanding requests 2 RTT FIN ACK • How to identify requests/responses Client parses HTML FIN • Delimiter Server must examine response for delimiter string Client opens TCP ACK connection SYN • Content-length and delimiter Must know size of transfer in SYN advance 3 RTT ACK • Block-based transmission send in multiple length delimited Client sends HTTP request DAT blocks for image Server reads from ACK disk • Store-and-forward wait for entire response and then use 4 RTT content-length DAT Image begins to arrive • Solution use existing methods and close connection otherwise 7 8 2 Persistent Connection Solution Other Problems Server • Serialized transmission but first bytes may be most useful • May be better to get the 1st 1/4 of all images than one Client complete image (e.g., progressive JPEG) 0 RTT DAT • Can “packetize” transfer over TCP, e.g., range requests Client sends HTTP request ACK Server reads from for HTML DAT disk • Application specific solution to transport protocol 1 RTT ACK problems. :( Client parses HTML DAT • Could fix TCP so it works well with multiple Server reads from Client sends HTTP request simultaneous connections ACK disk for image DAT • More difficult to deploy 2 RTT • Persistent connections can significantly increase state on the server Image begins to arrive • Allow server to close HTTP session • Client can no longer send requests 9 10 Web Proxy Caches No Caching Example (1) Assumptions • Average object size = 100,000 bits origin • User configures browser: Web origin • Avg. request rate from institution’s servers browser to origin servers = 15/sec accesses via cache server public • Delay from institutional router to • Browser sends all HTTP Proxy Internet requests to cache any origin server and back to router server = 2 sec • Object in cache: cache client returns object Consequences • Else cache requests object • Utilization on LAN = 15% 1.5 Mbps access link from origin server, then • Utilization on access link = 100% returns object to client • Total delay = Internet delay + access institutional network delay + LAN delay 10 Mbps LAN = 2 sec + minutes + milliseconds client origin server 11 12 3 No Caching Example (2) With Caching Example (3) Possible solution Install cache • Increase bandwidth of access link origin origin • Suppose hit rate is .4 to, say, 10 Mbps servers servers Consequence • Often a costly upgrade public • 40% requests will be satisfied almost public Internet immediately (say 10 msec) Internet Consequences • 60% requests satisfied by origin server • Utilization on LAN = 15% • Utilization of access link reduced to 60%, • Utilization on access link = 15% resulting in negligible delays • Total delay = Internet delay + access 10 Mbps • Weighted average of delays 1.5 Mbps delay + LAN delay access link = .6*2 sec + .4*10msecs < 1.3 secs access link = 2 sec + msecs + msecs institutional institutional network network 10 Mbps LAN 10 Mbps LAN institutional cache 13 14 HTTP Caching Problems • Clients often cache documents • Fraction of HTTP objects that are cacheable is dropping • Challenge: update of documents • Why? • If-Modified-Since requests to check • Major exception? • HTTP 0.9/1.0 used just date • This problem will not go away • HTTP 1.1 has an opaque “entity tag” (could be a file signature, • Dynamic data stock prices, scores, web cams etc.) as well • CGI scripts results based on passed parameters • When/how often should the original be checked for • Other less obvious examples changes? • SSL encrypted data is not cacheable • Check every time? • Most web clients don’t handle mixed pages well many generic objects transferred with SSL • Check each session? Day? Etc? • Cookies results may be based on past data • Use Expires header • Hit metering owner wants to measure # of hits for revenue, etc. • If no Expires, often use Last-Modified as estimate • What will be the end result? 15 16 4 Cookies: Keeping “state” Cookies: Keeping “State” client Amazon server Many major Web sites use Cookie file usual http request msg server cookies usual http response + creates ID Four components: Example: ebay: 8734 Set-cookie: 1678 1678 for user 1) Cookie header line in the • Susan accesses Internet HTTP response message always from the same PC Cookie file usual http request msg 2) Cookie header line in HTTP • She visits a specific e- amazon: 1678 cookie- request message cookie: 1678 commerce site for the first time ebay: 8734 specific 3) Cookie file kept on user’s • When initial HTTP requests usual http response msg action host and managed by user’s arrives at the site, the site one week later: browser creates a unique ID and creates 4) Back-end database at Web an entry in a backend database usual http request msg Cookie file cookie- site for that ID cookie: 1678 amazon: 1678 specific ebay: 8734 usual http response msg action 17 18 Overview Distributing Load across Servers • Web • Given document XYZ, we need to choose a • Consistent hashing server to use • Peer-to-peer • E.g., in a data center • CDN • Suppose we use simple hashing: modulo of a • Video hash of the name of the document • Number servers from 1…n • Place document XYZ on server (XYZ mod n) • What happens when a servers fails? n n-1 • Same if different people have different measures of n • Why might this be bad? 19 20 5 Consistent Hash: Goals Consistent Hash – Example • Construction • “view” = subset of all hash buckets that are 0 • Assign each of C hash buckets to 14 candidate locations random points on mod 2n circle, where, hash key size = n. Bucket • Correspond to a real server 12 4 • Map object to random position on • Desired features unit interval • Load – all hash buckets have a similar number • Hash of object = closest bucket 8 of objects assigned to them • Monotone addition of bucket does not cause • Smoothness – little impact on hash bucket movement between existing buckets contents when buckets are added/removed • Spread & Load small set of buckets that lie • Spread – small set of hash buckets that may near object hold an object regardless of views • Balance no bucket is responsible for large number of objects 21 22 Consistent Hashing: Ring Consistent Hashing Example • Use consistent has to map both keys and nodes to an m-bit identifier Rule: A key is stored at its successor: node with next higher or equal ID in the same (metric) identifier space • For example, use SHA-1 hashes IP=“198.10.10.1” 0 K5 • Node identifier: SHA-1 hash of IP address IP=“198.10.10.1” SHA-1 ID=123 N123 K20 • Key identifier: SHA-1 hash of key Circular 7-bit SHA-1 ID=60 Key=“LetItBe” K101 ID space N32 • Also need “rule” for assigning keys to nodes • For example: “closest”, higher, lower, .. N90 Key=“LetItBe” K60 24 23 6 Consistent Hashing Properties Finer Grain Load Balancing • Load balance: all nodes receive roughly the same • Redirector knows all server IDs number of keys • It can also track approximate “load” for more • For N nodes and K keys, with high probability precise load balancing • Each node holds at most (1+)K/N keys • Provided that K is large compared to N • Need to define load and be able to track it • When server is added, it receives its initial work load from • To balance load: “neighbors” on the ring • W = Hash(URL, ip of s ) for all i • “Local” operation: no other servers are affected i i • Sort W from high to low • Similar property when a server is removed i • Find first server with low enough load • Benefits and drawbacks? 25 26 Consistent
Recommended publications
  • IPFS and Friends: a Qualitative Comparison of Next Generation Peer-To-Peer Data Networks Erik Daniel and Florian Tschorsch
    1 IPFS and Friends: A Qualitative Comparison of Next Generation Peer-to-Peer Data Networks Erik Daniel and Florian Tschorsch Abstract—Decentralized, distributed storage offers a way to types of files [1]. Napster and Gnutella marked the beginning reduce the impact of data silos as often fostered by centralized and were followed by many other P2P networks focusing on cloud storage. While the intentions of this trend are not new, the specialized application areas or novel network structures. For topic gained traction due to technological advancements, most notably blockchain networks. As a consequence, we observe that example, Freenet [2] realizes anonymous storage and retrieval. a new generation of peer-to-peer data networks emerges. In this Chord [3], CAN [4], and Pastry [5] provide protocols to survey paper, we therefore provide a technical overview of the maintain a structured overlay network topology. In particular, next generation data networks. We use select data networks to BitTorrent [6] received a lot of attention from both users and introduce general concepts and to emphasize new developments. the research community. BitTorrent introduced an incentive Specifically, we provide a deeper outline of the Interplanetary File System and a general overview of Swarm, the Hypercore Pro- mechanism to achieve Pareto efficiency, trying to improve tocol, SAFE, Storj, and Arweave. We identify common building network utilization achieving a higher level of robustness. We blocks and provide a qualitative comparison. From the overview, consider networks such as Napster, Gnutella, Freenet, BitTor- we derive future challenges and research goals concerning data rent, and many more as first generation P2P data networks, networks.
    [Show full text]
  • A Fog Storage Software Architecture for the Internet of Things Bastien Confais, Adrien Lebre, Benoît Parrein
    A Fog storage software architecture for the Internet of Things Bastien Confais, Adrien Lebre, Benoît Parrein To cite this version: Bastien Confais, Adrien Lebre, Benoît Parrein. A Fog storage software architecture for the Internet of Things. Advances in Edge Computing: Massive Parallel Processing and Applications, IOS Press, pp.61-105, 2020, Advances in Parallel Computing, 978-1-64368-062-0. 10.3233/APC200004. hal- 02496105 HAL Id: hal-02496105 https://hal.archives-ouvertes.fr/hal-02496105 Submitted on 2 Mar 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. November 2019 A Fog storage software architecture for the Internet of Things Bastien CONFAIS a Adrien LEBRE b and Benoˆıt PARREIN c;1 a CNRS, LS2N, Polytech Nantes, rue Christian Pauc, Nantes, France b Institut Mines Telecom Atlantique, LS2N/Inria, 4 Rue Alfred Kastler, Nantes, France c Universite´ de Nantes, LS2N, Polytech Nantes, Nantes, France Abstract. The last prevision of the european Think Tank IDATE Digiworld esti- mates to 35 billion of connected devices in 2030 over the world just for the con- sumer market. This deep wave will be accompanied by a deluge of data, applica- tions and services.
    [Show full text]
  • Decentralized File Sharing Application for Mobile Devices -. | Davide Taibi
    Asterism: Decentralized File Sharing Application for Mobile Devices Olli-Pekka Heinisuo Valentina Lenarduzzi Davide Taibi Tampere University Tampere University Tampere University Tampere, Finland Tampere, Finland Tampere, Finland [email protected] valentina.lenarduzzi@tut.fi davide.taibi@tut.fi Abstract—Most applications and services rely on central au- To avoid the issues of centralization, data can to be stored thorities. This introduces a single point of failure to the system. without any central authorities. Decentralized storage and The central authority must be trusted to have data stored by the content distribution can be achieved with peer-to-peer (P2P) application available at any given time. More importantly, the privacy of the user depends on the service provider capability to networking. In peer-to-peer networking every node of a network keep the data safe. A decentralized system could be a solution to is both a client and a server [4]. The nodes then talk to each remove the dependency from a central authority. Moreover, due other without any central service as opposed to centralized to the rapid growth of mobile device usage, the availability of systems where all communication goes through dedicated decentralization must not be limited only to desktop computers. central servers. In this work we aim at studying the possibility to use mobile devices as a decentralized file sharing platform without any Peer-to-peer (P2P networks and file sharing are rather central authorities. This was done by implementing Asterism, common on desktop environments but they have been rarely a peer-to-peer file-sharing mobile application based on the Inter- used on mobile devices due to more restricted resources.
    [Show full text]
  • Forensic Investigation of P2P Cloud Storage Services and Backbone For
    Forensic investigation of P2P cloud storage services and backbone for IoT networks : BitTorrent Sync as a case study Yee Yang, T, Dehghantanha, A, Choo, R and Yang, LK http://dx.doi.org/10.1016/j.compeleceng.2016.08.020 Title Forensic investigation of P2P cloud storage services and backbone for IoT networks : BitTorrent Sync as a case study Authors Yee Yang, T, Dehghantanha, A, Choo, R and Yang, LK Type Article URL This version is available at: http://usir.salford.ac.uk/id/eprint/40497/ Published Date 2016 USIR is a digital collection of the research output of the University of Salford. Where copyright permits, full text material held in the repository is made freely available online and can be read, downloaded and copied for non-commercial private study or research purposes. Please check the manuscript for any further copyright restrictions. For more information, including our policy and submission procedure, please contact the Repository Team at: [email protected]. Note: This is authors accepted copy – for final article please refer to International Journal of Computers & Electrical Engineering Forensic Investigation of P2P Cloud Storage: BitTorrent Sync as a Case Study 1 2 3 1 Teing Yee Yang , Ali Dehghantanha , Kim-Kwang Raymond Choo , Zaiton Muda 1 Department of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, UPM Serdang, Selangor, Malaysia 2 The School of Computing, Science & Engineering, Newton Building, University of Salford, Salford, Greater Manchester, United Kingdom 3 Information Assurance Research Group, University of South Australia, Adelaide, South Australia, Australia. Abstract Cloud computing has been regarded as the technology enabler for the Internet of Things (IoT).
    [Show full text]
  • Download-Emule-Kad-Server-List.Pdf
    Download Emule Kad Server List Download Emule Kad Server List 1 / 3 2 / 3 web site page displaying list of all active servers on the eDonkey/eMule p2p network. ... ping test update servers list at client start download list in eMule.. 0.50a installed on my computer. I can connect to eD2K network easily but I can't connect to Kad network. I have tried to download from http://www.nodes-dat.com/ but the first button " Add to eMule (from Nodes Server)" did't work and the other two worked but the problem still remains.. Bezpieczna lista serwerów emule do pobrania. Pobierz listę zawsze aktualną. Download server.met & serverlist for eMule.. eMule now connects to both the eDonkey network and the Kad network. ... eMule will use clients it knows already from the ed2k servers to get connected to Kad .... The servers merely help hold the network together. Meanwhile, Kad is a network that is also connectable via eMule. Unlike the ED2K network, ... You can use the easy to use installer or you can download the binaries. The difference is that the .... nodes.dat nodes for emule kademlia net server edonkey overnet. ... von IP/Port im Kad-Fenster, oder. - per Download aus dem Internent, z.B. nodes.dat.. Connecting to servers hasn't been working for a long time. ... started them again (there is free drive space on the download drive) but can't get a Kad connection. ... Block4: ipfilter.dat, nodes.dat, server.met (emule-security.org). Dodaj do #eMule te 2 pliki : Do serwerów, czyli eD2k --- http://www.server-list.info/ Do Kad ---..
    [Show full text]
  • Content Addressed, Versioned, P2P File System (DRAFT 3)
    IPFS - Content Addressed, Versioned, P2P File System (DRAFT 3) Juan Benet [email protected] ABSTRACT parties invested in the current model. But from another per- The InterPlanetary File System (IPFS) is a peer-to-peer dis- spective, new protocols have emerged and gained wide use tributed file system that seeks to connect all computing de- since the emergence of HTTP. What is lacking is upgrading vices with the same system of files. In some ways, IPFS design: enhancing the current HTTP web, and introducing is similar to the Web, but IPFS could be seen as a sin- new functionality without degrading user experience. gle BitTorrent swarm, exchanging objects within one Git Industry has gotten away with using HTTP this long be- repository. In other words, IPFS provides a high through- cause moving small files around is relatively cheap, even for put content-addressed block storage model, with content- small organizations with lots of traffic. But we are enter- addressed hyper links. This forms a generalized Merkle ing a new era of data distribution with new challenges: (a) DAG, a data structure upon which one can build versioned hosting and distributing petabyte datasets, (b) computing file systems, blockchains, and even a Permanent Web. IPFS on large data across organizations, (c) high-volume high- combines a distributed hashtable, an incentivized block ex- definition on-demand or real-time media streams, (d) ver- change, and a self-certifying namespace. IPFS has no single sioning and linking of massive datasets, (e) preventing ac- point of failure, and nodes do not need to trust each other.
    [Show full text]
  • African Development Bank Group Mali
    AFRICAN DEVELOPMENT BANK GROUP Public Disclosure Authorized Disclosure Public MALI PROJECT FOR THE ECONOMIC EMPOWERMENT OF WOMEN IN THE SHEA BUTTER SUBSECTOR (PAEFFK) Public Disclosure Authorized Disclosure Public RDGW DEPARTMENT November 2018 Translated document TABLE OF CONTENTS Currency Equivalents and Fiscal Year ................................................................................................................i Acronyms and Abbreviations ............................................................................................................................ii Project Brief ................................................................................................................................................... iii Project summary............................................................................................................................................... v Results-Based Logical Framework ................................................................................................................... vi Project Implementation Schedule ……………………………………………………………………………......viii I. STRATEGIC THRUST AND RATIONALE ............................................................................................ 1 1.1. Project Linkages with the Country Strategy and Objectives .................................................................. 1 1.2. Rationale for Bank Involvement ........................................................................................................... 1 1.3. Aid Coordination
    [Show full text]
  • A Distributed Content Addressable Network for Open Educational Resources
    DOI: 10.33965/celda2019_201911C063 16th International Conference on Cognition and Exploratory Learning in Digital Age (CELDA 2019) A DISTRIBUTED CONTENT ADDRESSABLE NETWORK FOR OPEN EDUCATIONAL RESOURCES Stephen Downes National Research Council Canada, Ottawa, Ontario, Canada ABSTRACT We introduce Content Addressable Resources for Education (CARE) as a method for addressing issues of scale, access, management and distribution that currently exist for open educational resources (OER) as they are currently developed in higher education. CARE is based on the concept of the distributed web (dweb) and, using (for example) the Interplanetary File System (IPFS) provides a means to distributed OER in such a way that they cannot be blocked or paywalled, can be associated with each other (for example, as links in a single site, or as newer versions of existing resources) creating what is essentially an open resource graph (ORG), and when accessed through applications such as Beaker Browser, can be cloned and edited by any user to create and share new resources. KEYWORDS Open Educational Resources, Distributed Web, Dat, Interplanetary File System, Open Resource Graph, Content-based Addressing 1. INTRODUCTION The distributed web potentially solves issues proponents of decentralization have long sought to address. One challenge is traffic, which overloads a single server. A second issue is latency, or the lag created by accessing resources half a world away. Additionally, some resources may be subject to national policies creating the need to differentiate access. And finally, if the centralized source is unavailable for some reason, then access for the entire world is disrupted. For the World Wide Web, many of these challenges are addressed with Content Distribution Networks (CDN).
    [Show full text]
  • An Efficient Holistic Data Distribution and Storage Solution for Online Social Networks Guoxin Liu Clemson University
    Clemson University TigerPrints All Dissertations Dissertations 12-2015 An Efficient Holistic Data Distribution and Storage Solution for Online Social Networks Guoxin Liu Clemson University Follow this and additional works at: https://tigerprints.clemson.edu/all_dissertations Recommended Citation Liu, Guoxin, "An Efficient Holistic Data Distribution and Storage Solution for Online Social Networks" (2015). All Dissertations. 1774. https://tigerprints.clemson.edu/all_dissertations/1774 This Dissertation is brought to you for free and open access by the Dissertations at TigerPrints. It has been accepted for inclusion in All Dissertations by an authorized administrator of TigerPrints. For more information, please contact [email protected]. An Efficient Holistic Data Distribution and Storage Solution for Online Social Networks A Dissertation Presented to the Graduate School of Clemson University In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Computer Engineering by Guoxin Liu December 2015 Accepted by: Dr. Haiying Shen, Committee Chair Dr. Richard R. Brooks Dr. Kuang-Ching (KC) Wang Dr. Feng Luo Abstract In the past few years, Online Social Networks (OSNs) have dramatically spread over the world. Facebook [4], one of the largest worldwide OSNs, has 1.35 billion users, 82.2% of whom are outside the US [36]. The browsing and posting interactions (text content) between OSN users lead to user data reads (visits) and writes (updates) in OSN datacenters, and Facebook now serves a billion reads and tens of millions of writes per second [37]. Besides that, Facebook has become one of the top Internet traffic sources [36] by sharing tremendous number of large multimedia files including photos and videos.
    [Show full text]
  • Bittorrent App Change Download Location How to Seed Moved Or Renamed Files in Bittorrent
    bittorrent app change download location How to Seed Moved or Renamed Files in BitTorrent. BitTorrent etiquette dictates that once you've downloaded files, you also seed them to help others' downloads go quickly. If you need to move or rename your download, everything's thrown off. Reader Jake Champion explains how to seed after moving or renaming files. This feature is available in uTorrent version 1.8 and later. You can not change file names and keep seeding with earlier versions of uTorrent. You can use this guide if you move or rename the folder in addition to changing file names. It is possible to change file (and folder) names because they are not really parts of the files. They are like ''labels'' that tell the computer where to find the content of the files. After such changes, you need to retarget the files in uTorrent if you want to keep seeding the torrent. The procedure for doing this is the same no matter if the torrent is already loaded in uTorrent, or if you add it as a new torrent. Seed Renamed Files. 1. If the torrent is already loaded in uTorrent, right-click it and choose ''Stop'' (if it is not already stopped). If you are adding a new torrent with changed file names, make sure that you add it in stopped mode (or stop it very quickly if you fail to do this). 2. Select the torrent, then click the ''Files'' tab in the lower pane. (If you have no lower pane, click ''Options'' and check ''Show detailed info''.) 3.
    [Show full text]
  • Solid Over the Interplanetary File System
    Solid over the Interplanetary File System Fabrizio Parrillo Christian Tschudin Dept of Mathematics and Computer Science Dept of Mathematics and Computer Science University of Basel, Switzerland University of Basel, Switzerland <[email protected]> <[email protected]> Identity Providers Service Identity Providers Service Abstract—Solid is a moderate re-decentralization architecture Clients for the Web. In Solid, applications fetch data from storage Provider Provider providers called “pods” which implement discretionary access control, permitting users to grant and revoke access rights and HTTP over the Internet HTTP over the Internet thus keeping control over their data. In this paper, we propose Pod Providers Pod Providers to apply an additional de-verticalization by introducing IPFS as File, SPARQL, File, SPARQL, File, SPARQL, File, SPARQL, a common backend for Solid pods, the goal being to prevent pod Access Ctrl Access Ctrl Access Ctrl Access Ctrl provider lock-in and permitting users to consider “self-poding” Perssistent Perssistent IPFS backend where the heavy persistency task is out-sourced to IPFS. We have implemented this Solid-over-IPFS architecture using the open- source “Community Solid Server” and successfully ran several Fig. 1. Replacing Solid’s persistency layer with IPFS Solid applications over IPFS. Index Terms—IPFS, Solid, Dweb (see Fig. 1). In this way, IPFS becomes the uniting element I. INTRODUCTION among all Solid pod operators, effectively preventing them The Solid project [1] is Tim Berners Lee’s answer to the from holding the data hostage. Moreover, end-users and their massive centralization that has occurred in the Web: a few browser do not have to (but can) be IPFS-aware because this companies dominate the Web ecosystem and are safeguarding burden now shifts to the tech-savvy pod operators, offering a and mediating most of the world’s social media content.
    [Show full text]
  • Intro to Crypto Sci-Club Notes
    Intro to Crypto Sci-Club Notes April 25, 2019 Abstract Notes preparing for sci-club crypto-blockchain presentation. List of possible topics Start with history, review of core tech concepts. Finish with applications. • First large-scale use of blockchain was in WWII Enigma coding machines. Maybe explain how these work? • Concept of mixing/entropy. Concept of mixing at different time-scales (slow, fast). • Public key crypto mathematics theory. • Public key crypto examples: PGP/GPG, x.509 certs, SSL/https, onion routing described at general level • Crypto at a detailed level: ratchet, double ratchet, forward secrecy, what is it for, how does this work? • Nonce; using nonces for secret boxes; all secret boxes must have a different nonce else there is no forward secrecy • Merkle tree (hash tree): it doesn’t have to be an actual chain; it can be a tree of chains, leading up to a signed root. Thus the dat:// protocol and btrfs, zfs – Bitcoin has merkle trees inside of it; where? why? • Git as a blockchain, and why it was needed - to prevent source code fraud (real, not imagined - intentional covert, secret insertion of corruption/bugs into Linux kernel by unknown actors (possibly NSA??)) – Git is a blockchain of commits. – Each commit is a Merkle tree of file-objects (blobs) that captures the state of that filesystem tree at that particular moment. • File sharing, sleepycat/napster, 1 – Gnutella, BearShare, LimeWire, Shareaza, all use tiger tree hash • torrent architectures (bittorrent tracker is example of DHT) • Other weirdo crypto ideas: death-lottery. • Forking and fork resolution. A fork prevents maintanance of state (e.g.
    [Show full text]