University of Nevada, Reno Anonymous Communication Systems

University of Nevada, Reno Anonymous Communication Systems: Usage Analysis and Attack Mechanisms A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science by Esra Erdin Dr. Mehmet Hadi Gunes/Thesis Advisor May, 2012 THE GRADUATE SCHOOL We recommend that the thesis prepared under our supervision by ESRA ERDIN entitled Anonymous Communication Systems: Usage Analysis And Attack Mechanisms be accepted in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Mehmet Hadi Gunes, Ph. D., Advisor George Bebis, Ph. D., Committee Member Murat Yuksel, Ph. D., Committee Member Cahit Evrensel, Ph. D., Graduate School Representative Marsha H. Read, Ph. D., Dean, Graduate School May, 2012 i Abstract Privacy in communication has been an essential security requirement with the rapid growth of Internet. This has resulted in growing interest in methods for anonymous communication. Several system designs have been developed with the common aim of preserving communication privacy within the shared network environment. In this study, the structure of these systems is explained and an analysis of current anonymity technologies; proxy servers, remailers, JAP, Tor and I2P, is presented. Because Tor is the largest anonymity system currently in use, a detailed analysis is also presented. Moreover, growing interest in anonymity systems resulted in several de-anonymization techniques developed to exploit these anonymous communication systems. These techniques are also explained to better understand the security threats of anonymous communication. ii Acknowledgments I would like to express my deepest gratitude to my advisor Dr. Mehmet Hadi Gunes for his guidance, understanding and patience he showed during this study. I also would like to thank to my colleagues BingDong Li and Christopher Zachor for their effort and help to put together this project. Finally, I could never have been able to finish this thesis without the endless support and love of my family. With all my sincerity, I would like to thank to my family. This work was supported in part by National Institute of Justice with grant number 2010-DN-BX-K248. Esra Erdin University of Nevada, Reno May 2012 iii Contents Abstract i Acknowledgments ii List of Figures v Chapter 1 Introduction 1 Chapter2 AnonymizationTechnologyUsage 6 2.1 ProxyServer................................ 6 2.2 Remailer.................................. 8 2.3 MixNetwork ............................... 11 2.4 OnionRouting .............................. 13 2.4.1 TheOnionRouting........................ 14 2.4.2 InvisibleInternetProject. 21 Chapter 3 Tor Usage Analysis 24 3.1 RelatedWork ............................... 24 3.2 TorMeasurementStudy ......................... 25 Chapter 4 Application Perspective 30 iv Chapter 5 Deanonymization 33 5.1 Clickjacking ................................ 33 5.2 CodeInjection............................... 36 5.3 WebsiteFingerprinting . 36 5.4 ActiveDocuments............................. 37 5.4.1 PdfDocuments .......................... 37 5.4.2 MacroWordAttacks . 38 5.5 URIMethods ............................... 40 5.6 Network-LevelAttacks . 42 5.6.1 TimingAttack .......................... 42 5.6.2 PredecessorAttack . 43 5.6.3 IntersectionAttack . 43 5.6.4 MultiplicationAttack. 43 5.6.5 CircuitClogging ......................... 44 Chapter6 ConclusionsandFutureWork 45 Bibliography 47 v List of Figures 2.1 ProxyServer................................ 7 2.2 Geographic Proxy Distribution (log-scale) . ..... 9 2.3 Geographic Proxy Distribution (cont.) (log-scale) . ........ 10 2.4 Remailer Geo-Distribution (log-scale) . ..... 11 2.5 AMixProcess............................... 12 2.6 JAPGeo-Distribution(logscale) . 13 2.7 TheOnionRouter(Tor)communication . 15 2.8 Geographic Tor Server Distribution (log-scale) . ....... 17 2.9 Geographic Tor Server Distribution (cont.) (log-scale)......... 18 2.10 Geographic Tor Bandwidth Conribution (log-scale) . ....... 19 2.11 Geographic Tor Bandwidth Conribution (cont.) (log-scale) ...... 20 2.12 Geographic I2P Server Distribution (log-scale) . ........ 22 2.13 Geographic I2PBandwidth Contribution (log-scale) . ........ 23 3.1 TorUsage(log-scale) ........................... 27 3.2 GeographicTORServerDistribution . 28 3.3 GeographicTORUserDistribution . 28 3.4 Client Version Distribution (log-scale) . ...... 29 vi 4.1 CombinedSpamData .......................... 32 5.1 A link on the website that provides clickjacking . ..... 34 5.2 JavaScriptCode.............................. 35 5.3 FacebookLikepage............................ 36 5.4 CreatingButtoninPdfFile . 38 5.5 FeaturesofButtonField . 39 5.6 EmbeddedVideoFile........................... 40 5.7 FeaturesofEmbeddingOperation . 41 1 Chapter 1 Introduction Privacy in communication has been an essential security requirement with the rapid growth of the Internet. Each time when we visit a website or send an email, our client sends packets of data through the Internet that contain essential information like who is sending the message and who is the receiver of the packet. Although the data transmitting via several hops from source to the destination may be encrypted, IP header is still visible to an observer. Adversaries can access significant information about the traffic carried between the sender and the receiver of the data. By observing communication through a link, third parties can easily identify who is communicating with whom based on source and destination IP addresses. The term anonymizer refers to varios tools that help the users to keep their activities private. Many situations like freedom of speech, anticensorship issues caused a growing interest in anonymous communication through the Internet. There are different situations in which someone wants to communicate anonymously on the Internet. Some organizations such as governments, military services may want some topics to remain secret as they are so critical and significant and their exposure may 2 be inconvenient or harmful to an investigation. Law enforcement agencies may want to design online forms that people can provide hint or other information they may have without the fear of retribution or punishment. Even private citizens may want to browse the Web without advertisers collecting statistics on their personal browsing habits. There are also people living under oppressive regimes that try to limit the rights of their citizens especially on the Internet [17]. Citizens of oppressive countries can express their thoughts, ideas through anonymizer networks. Because of all the situations mentioned above, researchers design anonymity systems that build an overlay network running on top of the Internet. By using anonymizer systems, a user can communicate with another one without revealing their identities like IP addresses and location. Since milestone research on anonymous communications Chaum [12], research on these systems has extended to many areas. Anonymous communication, traffic analysis, provable shuffles, anonymous emails, anonymous publications, private information retrieval, taxonomy, security and improvement, communication censorship, and anonymous voting are some of the areas that researchers have focused on [3,14,17,27]. Several anonymity services are provided by either commercial com- panies or open source developers. Anonymizer.com is one of the commercial websites that requires its users to pay a subscription fee to keep their identity private while browsing web via Anonymizer.com servers. Moreover, Tor Onion Router (TOR) [16] and Invisible Internet Project (I2P) [4] are open source systems which are hosted by volunteers. A communication network is composed of messages, senders, that send messages, and recipients, that receive messages. Sender and recipient can be any client, server or peer in a communication network. These subjects exchange messages via 3 public communication channels. There is also a possibility to be an attacker who may be interested in monitoring the traffic transmitted between sender and recipient. The attacker can be outside of the communication network, or inside of the communication network. In order to better understand anonymous communication systems, it is essential to understand the terms anonymity, unlinkability, and unobservability. Anonymity: In their proposal Pfitzman and Hansen describe anonymity as ¨the state of being not identifiable within a set of subjects, the anonymity set” [37]. All subjects that can cause an action constitute the anonymity set. In other words, a message can possibly be sent by any of the senders who belong to his/her sender anonymity set and, similarly, a message can be received by any recipient who might be anonymous within a set of recipient anonymity set. Here, we can consider a subject as a client, server or peer in a network. All subjects might have their own anonymity sets. Unlinkability: A subject of the system can make multiple uses of a resource without an adversar being able to link to these uses to the particular subject [41]. In other words, an adversary observing the senders and recipients of a network is unable to identify the communicating parties and the ability of understanding the communication between joined parties does not increase by observing the network. The relation between anonymity and unlinkability is that anonymity of an item in a system is not linkable to any particular identity and the anonymity of a particular identity is not linkable to any item [41]. More clearly, any particular

University of Nevada, Reno Anonymous Communication Systems

Poster: Introducing Massbrowser: a Censorship Circumvention System Run by the Masses

In Computer Networks, A

Threat Modeling and Circumvention of Internet Censorship by David Fifield

How to Download Torrent Anonymously How to Download Torrent Anonymously

World-Wide Web Proxies

Mixminion: Design of a Type III Anonymous Remailer Protocol

The Design, Implementation and Operation of an Email Pseudonym Server

Privacy-Enhancing Technologies for the Internet

A Concept of an Anonymous Direct P2P Distribution Overlay System

Social Network Based Anonymous Communication in Tor Peng Zhou, Xiapu Luo, Ang Chen, and Rocky K

Technical and Legal Overview of the Tor Anonymity Network

El Gamal Mix-Nets and Implementation of a Verifier