A New Completely Decentralized Communication System Over IP

Bachelor’s Degree Thesis A New Completely Decentralized Communication System Over IP Nicholas Helke August 4, 2013 Prof. Stephan Robert HEIG-VD Nicholas Helke Abstract The Internet Protocol (IP) was designed to be resilient to topology changes. It should therefore be possible to build a resilient communication system on top of IP, in fact consumer communication systems deployed today all rely to some extend on central servers. This makes these communication systems vulnerable should the central servers become unreachable for any reason. This memoir proposes a strategy for new communication system over IP that is completely decentralized, i.e. that doesn’t rely on central servers at all. This strategy uses the BitTorrent network’s Distributed Hash Table (DHT) (which is based on Kademlia) to map users to IP addresses in a completely decentralized manner. An application network protocol and a proof of concept software application which implement the proposed strategy are also discussed in this memoir. V2 i ii Nicholas Helke Terms of reference The purpose of the bachelor’s degree project of which this memoir is the report is to produce a new completely decentralized communication system over Internet Protocol (IP). The key words in [3] are to be interpreted in the context of this chapter as they are defined in said document. The development of this new system is divided into two parts: 1. A theoretical strategy for establishing a communication over IP in a completely decentralized manner. This strategy must meet the following requirements: a) The strategy must enable a conforming implementation to establish a point- to-point connection over IP between two nodes with public Internet Protocol Version 4 (IPv4) addresses where the user need only know the public key of participating node he wishes to contact. b) The strategy can not rely on a central server to establish a mapping from public keys to IPv4 address, i.e. the strategy must be “pure” Peer-to-Peer (P2P) according to Definition 2 of [29] c) A central seeding server or list of servers may be used to seed fresh instal- lations of a conforming implementation with the necessary information to join the system. Once a fresh node has been seeded however, it must be able to continue to function without ever relying on the seeding server or servers again. d) The strategy must be resilient in the face of network splits, i.e. communication must remain possible within the each subnets after the split. e) The strategy must enable an established connection to transport arbitrary data (e.g. text, voice, video or files) in both directions between the nodes. f) The strategy may employ a variety of other strategies in order to extend the applicability of this strategy to nodes behind firewalls and/or Network Ad- dress Translation (NAT). 2. A practical implementation of the above strategy in the form of a proof of concept software application and associated application network protocol. This proof of concept application and network application protocol must meet the following requirements: a) The network application protocol must be architecture, platform and lan- guage agnostic. V2 iii b) The network application protocol must allow for text communication and must be extendible to support arbitrary forms of communication (e.g. voice, video or files). This may be achieved by building feature negotiation into the protocol. c) The proof of concept application must support Linux and may use any Graph- ical User Interface (GUI) available on the platforms it supports. iv Nicholas Helke Contents 1. Introduction 1 2. Existing works 3 2.1. Bitmessage .................................... 3 2.2. Completely decentralised DNS ........................ 4 2.2.1. Dot-P2P ................................. 4 2.2.2. Namecoin ................................ 5 3. Challenges of Completely Decentralised Communication 7 3.1. Finding Peers in a Completely Decentralised Manner ........... 7 3.1.1. Decentralised data storage ....................... 7 3.1.2. BitTorrent’s Mainline DHT ...................... 9 3.1.3. Kademlia ................................ 10 3.2. Establishing the connection .......................... 14 3.3. Security ...................................... 16 3.3.1. Obfuscation ............................... 16 3.3.2. Handshake ............................... 18 3.3.3. Encryption ................................ 19 3.3.4. Web of Trust ............................... 19 3.4. Ensuring resilience to topology changes ................... 20 4. Implementation 23 4.1. Designing the program’s architecture and choosing a programming lan- guage ....................................... 23 4.2. Reference implementation’s application architecture ............ 26 4.3. Finding peers .................................. 27 4.4. Establishing a connection ........................... 31 4.4.1. Using SSDP and UPnP to create port mappings in compatible routers 31 4.4.2. On not implementing PCP née NAT-PMP .............. 39 4.4.3. Confronting our hypotheses with reality .............. 39 4.5. Security ...................................... 41 4.5.1. Public Key Infrastructure ....................... 41 4.5.2. Adapting the handshake to symmetric NAT initiators ....... 41 4.6. Communication protocol ............................ 42 4.6.1. Basic protocol .............................. 42 4.6.2. Extending the protocol ......................... 43 V2 v Contents 5. Conclusion 47 Bibliography 49 Acronyms 53 Appendices A. Dictator Breaker User Manual A1 A.1. How to run .................................... A1 A.2. Command line flags ............................... A1 B. Dictator Breaker Code B1 B.1. go.nhelke.com/dictator-breaker/main.go .................. B1 B.2. go.nhelke.com/dictator-breaker/ws.go ................... B7 B.3. go.nhelke.com/dictator-breaker/messaging/message.go ......... B9 B.4. go.nhelke.com/dictator-breaker/messaging/message_test.go ...... B10 B.5. go.nhelke.com/dictator-breaker/security/openpgp.go .......... B10 B.6. go.nhelke.com/dictator-breaker/security/openpgp_test.go ........ B14 B.7. go.nhelke.com/dictator-breaker/views/chat.html ............. B15 B.8. Known issues .................................. B17 C. GoUPNPc C1 C.1. github.com/nhelke/goupnpc/README.mdown ............. C1 C.2. github.com/nhelke/goupnpc/cmd.go .................... C2 C.3. github.com/nhelke/goupnpc/goupnp/goupnp.go ............ C3 C.4. github.com/nhelke/goupnpc/goupnp/goupnp_test.go .......... C9 C.5. github.com/nhelke/goupnpc/goupnp/backend.go ............ C10 C.6. github.com/nhelke/goupnpc/goupnp/backend_test.go ......... C14 C.7. github.com/nhelke/goupnpc/goupnp/ssdp.go .............. C15 C.8. github.com/nhelke/goupnpc/goupnp/ssdp_test.go ........... C19 C.9. Known issues .................................. C21 D. Patches submitted to open source projects D1 D.1. Patch submitted to github.com/nictuku/dht ................ D1 D.2. Patch submitted to github.com/monnand/dhkx .............. D5 E. GNU General Public License Version 2 E1 vi Nicholas Helke 1. Introduction It was the Arab Spring that first alerted me to the need for a completely decentralized communication system1. The falling dictatorships desperately attempted to thwart the revolutions by cutting and disrupting internet communication[8]. The internet was designed to be and is resistant to topology changes, however the countries of the Arab Spring had a limited number of backbone connections to the outside world, which were all under government control. The failing governments cut themselves off from the rest of the world by destroying the routing tables of the edge routers. This was surprisingly effective at disrupting communications. The number of people relying on American and European based services had been underestimated, or at least there was no widely known data about the penetration of such services. It turns out Facebook, Gmail, Skype and Twitter where the principal tools of communication for the revolutionaries and so when they were cut off from the rest of the world they were no longer able to communicate internally because they were relying on server-based services based outside their isolated subnet. Despite still be- ing physically connected to one another, they where unable to communicate as they no longer had access to the central server. A worrying trend amongst all countries developing and developed alike are increas- ing violations of civil liberties such as [12]. A decentralised communication therefore would also provides some protection against unlawful surveillance although this is more debatable given the governments increased regulation and control of Internet Ser- vice Providers (ISPs). This brought to light the necessity for a completely decentralized communication system over Internet Protocol (IP), one that would be as resilient as IP is itself. It should also be mentioned that such a system would also be beneficial to users on accidental topology changes, such as the recent failure of SEA-ME-WE 4 due to criminal sabotage[1, 2] or cases of accidental failure. Some effort has gone into structuring this memoir for linear reading. It has not however been possible to eliminate forward-references completely. In order to fully grasp some of the short comings of the solutions in chapter 2 an understanding of some no- tions of the theory from chapter 3 can be helpful. This forward-reference can be avoided by skipping chapter 2 entirely. It is of no consequence to the comprehension of the solu- tion proposed by this memoir, it serves principally to

Load more