
A Peer-to-peer XML Database Thomas Risse, Predrag Knezevic Fraunhofer IPSI Dolivostrasse 15 64293 Darmstadt Germany {risse, knezevic}@ipsi.fhg.de © Fraunhofer IPSI 1 Introduction 4Short history of the Internet and P2P systems • DNS, Usenet • Moving to client/server model 4Trends • Moore’s law in the real life • More freedom from the infrastructure © Fraunhofer IPSI 2 Seite 1 1 Current Status Systems 4Content sharing • Napster, Gnutella, KaZaA, eDonkey,OverNet 4Content storing • OceanStore, Freenet, Past, GNUNet Drawbacks 4Working on the file level 4File not writable after storing 4Poor searching capabilities © Fraunhofer IPSI 3 Motivation A P2P database could be useful as 4A basis for P2P applications • Resource sharing • Data sharing 4An ad-hoc storage in business processes and workflows 4A distributed discovery service 4A storage for sensor and ad-hoc networks 4A distributed index for the Web 4A service for Grid computing © Fraunhofer IPSI 4 Seite 2 2 Example 1: Content Sharing KaZaA, Gnutella or Sharing of metadata about songs similar… (author, title, location,…) © Fraunhofer IPSI 5 Example 2: Ad-hoc Storage for Web Services © Fraunhofer IPSI 6 Seite 3 3 Example 4: Sensor Networks Measuring current, gas, water spending © Fraunhofer IPSI 7 Related Work 4Distributed databases • XPRS, Gamma, Teradata, Tandems NonStopSQL 4Distributed filesystems • NFS, AFS, xFS 4Drawbacks: • Made for stable, well connected environments • Crashed node eventually replaced • Global system view © Fraunhofer IPSI 8 Seite 4 4 Related Word (contd.) 4Existing P2P storages • OceanStore, Freenet, GNUNet, Freehaven 4Drawbacks: • Coarse-grained granularity • Lack of relationships among stored objects • Updating difficulties • Limited querying possibilities © Fraunhofer IPSI 9 Proposed Solution 4XML documents are spread in the community 4Peers store only document parts 4Documents are modified by the community during the system run-time 4P2P XML datastore mimics DOM interface 4Every XML document has a tree representation 4Tree structures can be represented using hash table structures 4Distributed tree -> DHT © Fraunhofer IPSI 10 Seite 5 5 P2P Datastore challenges The same like in classical databases 4Durability 4Consistency 4Reliability 4Concurrency and transcations 4Security 4Scalability © Fraunhofer IPSI 11 Tree Operations 4Creating a root node • Who will create the root? 4Getting a node • Figuring out what is the current value 4Updating a node • Propagate changes to all replicas 4Adding a node 4Removing a node Node reference is DHT key © Fraunhofer IPSI 12 Seite 6 6 DOM Particularities Serialized object (XML) Serialized object (DOM) <song> <title> <song> Title Name </title> <author> Author Name <title> <author> <publisher> </author> <publisher> Publisher Name </publisher> </song> Title Name Author Name Publisher Name 4Serialized objects are DOM sub-trees 4Managing all nodes separately has drawbacks: • Complicated undo • Objects spread across many peers • Objects consume larger portion of key space © Fraunhofer IPSI 13 Querying 4XPath and XQuery use DOM for accessing XML documents 4It is possible to apply them directly on the top of proposed storage 4Index structures are needed to get decent performances © Fraunhofer IPSI 14 Seite 7 7 System Architecture Applications Query Engine P2P-DOM Index Manager DHT Abstraction Layer Local Local storage DHT storage Network Layers © Fraunhofer IPSI 15 Summary Peer-to-peer is a hype today Peer-to-Peer can be a powerfull architecture 4Scalability, availability, flexibility, ... But we need a more sophisticated data managment 4Proposed P2P DOM for datastorage © Fraunhofer IPSI 16 Seite 8 8.
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages8 Page
-
File Size-