Message in a Bottle: Sailing Past Censorship

Message In A Bottle: Sailing Past Censorship Luca Invernizzi Christopher Kruegel Giovanni Vigna UC Santa Barbara UC Santa Barbara UC Santa Barbara California, USA California, USA California, USA [email protected] [email protected] [email protected] ABSTRACT To sneak by the censorship, the dissident populations have re- Exploiting recent advances in monitoring technology and the drop sorted to technology. A report from Harvard's Center for Internet of its costs, authoritarian and oppressive regimes are tightening & Society [44] shows that the most popular censorship-avoidance the grip around the virtual lives of their citizens. Meanwhile, vectors are web proxies, VPNs, and Tor [17]. These systems share the dissidents, oppressed by these regimes, are organizing online, a common characteristic: They have a limited amount of entry cloaking their activity with anti-censorship systems that typically points. Blocking these entry points, and evading the block, has consist of a network of anonymizing proxies. The censors have become an arms race: Since 2009, China is enumerating and become well aware of this, and they are systematically finding banning the vast majority of Tor's bridges [57]. In 2012, Iran and blocking all the entry points to these networks. So far, they took a more radical approach and started blocking encrypted have been quite successful. We believe that, to achieve resilience traffic [54], a move that Tor countered the same day by deploying to blocking, anti-censorship systems must abandon the idea of a new kind of traffic camouflaging [53]. having a limited number of entry points. Instead, they should In this paper, we take a step back and explore whether it is establish first contact in an online location arbitrarily chosen possible to design a system with so many available entry points by each of their users. To explore this idea, we have developed that it is impervious to blocking, without disconnecting from the Message In A Bottle, a protocol where any blog post becomes a global network. potential \drop point" for hidden messages. We have developed Let's generalize the problem at hand with the help of Alice, and released a proof-of-concept application of our system, and a dissident who lives in the oppressed country of Tyria. Alice demonstrated its feasibility. To block this system, censors are wants to establish, for the first time, a confidential communication left with a needle-in-a-haystack problem: Unable to identify what with Bob, who lives outside the country. To use any censorship- bears hidden messages, they must block everything, effectively resistant protocol, Alice must know something about Bob, and disconnecting their own network from a large part of the Internet. how to bootstrap the protocol. In the case of anonymizing proxies This, hopefully, is a cost too high to bear. or onion-routing/mix networks (e.g., Tor), Alice needs the address of at least one of the entry points into the network, and Bob's address. Also, to achieve confidentiality, Alice needs Bob's public Keywords key, or something equivalent to that. In protocols that employ Censorship Resistance, Deniable Communications, Steganography steganography to hide messages in files uploaded to media-hosting sites (such as Collage [12]) or in network traffic (such as Telex [62]), 1. INTRODUCTION Alice must know the location of the first rendezvous point, and 1 The revolutionary wave of protests and demonstrations known Bob's public key . as the Arab Spring rose in December 2010 to shake the foundations The fact that Alice needs to know this information inevitably of a number of countries (e.g., Tunisia, Libya, and Egypt), and means that the censor can learn it too (as he might pose as showed the Internet's immense power to catalyze social awareness Alice). Bob cannot avoid this, without having some way to through the free exchange of ideas. This power is so threatening distinguish Alice from the censor (but this becomes a chicken- to repressive regimes that censorship has become a central point in and-egg problem: How did Bob come to know that, since he their agendas: Regimes have been investing in advanced censoring never had any confidential communication with Alice before?). technologies [43], and even resorted to a complete isolation from We believe that this initial something that Alice must know is a the global network in critical moments [14]. For example, Pakistan fundamental weakness of existing censorship-resistant protocols, recently blocked Wikipedia, YouTube, Flickr, and Facebook [2], which forms a crack in their resilience to blocking. For example, and Syria blocked citizen-journalism media sites [51]. this is the root cause of the issues that Tor is facing when trying to distribute bridge addresses to its users, without exposing these addresses to the censor [52]. It is because of this crack that China has been blocking the majority of Tor traffic since 2009 [57]: The Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies 1 bear this notice and the full citation on the first page. To copy otherwise, to To be precise, in Telex, Alice does not need to know the precise republish, to post on servers or to redistribute to lists, requires prior specific location of the proxy running the Telex protocol. However, she permission and/or a fee. Request permissions from [email protected]. needs to be aware that a Telex proxy will occur somewhere in the ACSAC ’13 December 09-13, 2013, New Orleans, LA, USA path to the decoy destination. Here, we consider this destination Copyright 2013 ACM 978-1-4503-2015-3/13/12 ...$15.00. to be the rendezvous point she must know. number of entry points is finite, and a determined attacker can His intent is to discover and suppress the spreading of dissident enumerate them by pretending to be Alice. ideas. Determining the current and potential capabilities of With Message In A Bottle (miab), we propose a protocol in modern censors of this kind (e.g., Iran and China) is a difficult which Alice needs to know the least possible about Bob, and how task, as the inner workings of the censoring infrastructure are to bootstrap the protocol. In fact, we will show that it is enough often kept secret. However, we can create a reasonable model for for Alice to know Bob's public key, and nothing else. Alice must our adversary by observing that, ultimately, the Censor will be know at least Bob's public key to authenticate him, so that she constrained by economic factors. Therefore, we postulate that can be sure she is not talking to a disguised censor. However, the censorship we are facing is influenced by these two factors: contrary to systems like Collage and Telex, there is no prearranged • The censoring infrastructure will be constrained by its cost rendezvous point where Alice and Bob must meet. and technological effort; This may now sound like a needle-in-a-haystack problem: If • Over-censoring will have a negative impact on the economy neither Alice nor Bob know how to contact the other one, how of the country. can they ever meet? To make this possible, and reasonably fast, From these factors, we can now devise the capabilities of our miab exploits one of the mechanisms that search engines employ Censor. We choose to do so in a conservative fashion, preferring to generate real-time results from blog posts and news articles: to overstate the Censor's powers than to understate them. We blog pings. Using these pings as a broadcast system, Bob gets make this choice also because we understand that censorship is an covertly notified that a new message from Alice is available, and arms race, in which the Censor is likely to become more powerful where to fetch it from. We will show that, just like a search as technology advances and becomes more pervasive. engine, Bob can monitor the majority of the blogs published on We assume that it is in the Censor's interest to let the general the entire Internet with limited resources, and in quasi real-time. population benefit from Internet access, because of the social In some sense, every blog becomes a potential meeting point and economic advantages of connectivity. This is a fundamental for Alice and Bob. However, there are over 165 million blogs assumption for any censorship-avoidance system: If the country online [8], and since a blog can be opened trivially by anybody, runs an entirely separate network, there is no way out2. for our practical purposes they constitute an infinite resource. The Censor's Capabilities. As part of his infrastructure, As a result of one of our experiments (see Section 5.1), we will the Censor might deploy multiple monitoring stations anywhere show that, to blacklist all the potential miab's drop points in a within his jurisdiction. Through these stations, he can capture, three-months period, the Censor should block at least 40 million analyze, modify, disrupt, and store indefinitely network traffic. fully qualified domain names, and four million IP addresses (as In the rest of the world, he will only be able to harness what is we will show, these are conservative estimates). For comparison, commercially available to him. The Censor can analyze traffic blacklisting a single IP address would block Collage's support for aggregates to discover outliers in traffic patterns, and profile Flickr (the only one implemented), and supporting additional encrypted traffic with statistical attacks on the packets content media-hosting sites requires manual intervention for each one.

Load more