Understanding User Tradeoffs for Search in Encrypted Communication

Understanding User Tradeoffs for Search in Encrypted Communication Wei Bai, Ciara Lynton, Charalampos Papamanthou and Michelle L. Mazurek University of Maryland, College Park Email: f Abstract—End-to-end message encryption is the only way to are end-to-end encrypted, searching their contents becomes achieve absolute message privacy. However, searching over complicated. Straightforward solutions in which the commu end-to-end encrypted messages is complicated. Several popular nication service maintains a central searchable index cannot instant messaging tools (e.g., WhatsApp, iMessage) circumvent be applied directly, as the communication service cannot this inconvenience by storing the search index locally on read and index messages. To avoid this, instant messaging the devices. Another approach, called searchable encryption, systems typically use a local index, in which messages allows users to search encrypted messages without storing the decrypted on a device are indexed (and can therefore be search index locally. These approaches have inherent tradeoffs searched) on that device. between usability and security properties, yet little is known An alternative to local indexing is to support search about how general users value these tradeoffs, especially in ing over encrypted content. During the past two decades, the context of email rather than instant messaging. In this researchers have made significant progress on searchable paper, we systematize these tradeoffs in order to identify key encryption, or mechanisms that can support searching en feature differences. We use these differences as the basis for crypted data on an untrusted server without revealing the a choice-based conjoint analysis experiment focused on email content of the messages or of the search queries [7]–[12]. (n=160), in which participants make a series of choices between These techniques are gradually being adopted by industry, email services with competing features. The results allow us to e.g., for databases. While they have not yet been applied specifically for messaging, researchers have begun to ex quantify the relative importance of each feature. We find that plore this potential use case. users indicate high relative importance for increasing privacy Both of these approaches to enable search for end-to and minimizing local storage requirements. While privacy is end encrypted messaging have benefits and drawbacks, in more important overall, local storage is more important than terms of security properties and in terms of utility for end adding additional marginal privacy after an initial improve users. To our knowledge, these tradeoffs have not been ment. These results suggest that searchable encryption may systematically explored from the point of view of end users. hold promise to fill a potential niche in meeting some users’ In this work, we take a first step toward filling that gap. communication needs. First, we systematize the tradeoffs of these solutions in four dimensions: search expressiveness, performance, secu 1. Introduction rity, and portability across devices. This approach allows us to identify key features (that apply to both messaging As people rely more and more heavily on online commu and email) that may influence users’ preferences. We then nication, privacy is becoming increasingly important. End apply this systematization to design a choice-based conjoint to-end encryption, in which the communications service analysis study focusing on email. We choose to study email cannot read the user’s content, is the only way to fully pro specifically because of the prominence of search in an email tect users’ online communications from malicious attackers, setting, as well as the tendency for email archives to be rela rogue company employees, and government surveillance. tively large and long-lived. Study participants were asked to In recent years, more online communication services, es make a series of choices between email-service profiles with pecially instant messaging tools, have adopted end-to-end different usability and security features; the results allow encryption; for example, WhatsApp and Apple’s iMessage us to quantify the importance of each feature relative to have brought end-to-end encrypted messaging to millions of the others in the email setting [13]–[15]. This methodology users by default [1], [2]. While these popular tools have been has been adopted in previous research to investigate users’ moving in the right direction for securing communication, online privacy concerns [16]–[18]. email adoption has not caught up [3]. Our experiment (n=160) compared six features that While end-to-end encryption provides important privacy may affect users’ preferences for encrypted communication: benefits, it can complicate functionality in a variety of ways price, multi-word search, partial-word search, privacy, local that can potentially inconvenience end users. As one exam storage and synchronization across devices. We find that ple, message searching (i.e., recovering a previously sent or privacy is the second-most important feature, after price: received message using keywords) is a critical function for participants indicated they would pay on average $0.85 per many users, especially for email [4]–[6]. When messages month to reduce service providers’ access to the contents of their messages and search queries. However, minimizing (CaaS), which split the trust between the communication the use of local storage was almost as highly valued. We service provider and the CaaS server, and found that user also find that participants’ web skills, along with their self- study participants could successfully use this approach to reported level of privacy concern, influence their choices. encrypt Facebook conversations [32], [35]. In pursuit of less Our results support the use of local indexing in many cumbersome and more secure approaches to key manage cases to maximize privacy. However, our results also suggest ment, Ryan extended the concept of certificate transparancy that searchable encryption for encrypted email may be a to end-to-end encrypted emails, and CONIKS allows key promising avenue to support those users who want more owners to monitor how their keys are distributed by a central privacy than in an unencrypted service but who also highly key server [33], [34]. Bai et al. investigated general users’ prioritize limiting use of storage on mobile devices. attitudes toward tradeoffs between security and usability We make the following concrete contributions: of different key management models, finding that models which are more convenient but less secure may be per • We systematize privacy and usability tradeoffs in ceived as “good enough” for everyday usage [36]. Lerner herent in different approaches to adding search to et al. proposed Confidente, a prototype that used Keybase 1 end-to-end encrypted communication. to centrally manage public keys and also possibly private • We conduct a conjoint analysis study to better un keys protected by users’ passwords [37]. They claimed derstand how end users value these various tradeoffs that lawyer and journalist participants made fewer mistakes for email. using Confidente made fewer mistakes than Mailvelope. • Based on these results, we make recommendations Some other studies evaluated modern end-to-end encrypted for the design of end-to-end encrypted email systems systems and found that users were still susceptible to Man and for further research. in-the-Middle (MitM) attacks due to human errors [38], [39]. Social factors and network effects also play an important 2. Related work role in adoption of secure messaging. In 2007, Solove found users were reluctant to encrypt their emails because they In this section, we discuss related work in four key areas: “have nothing to hide” [40]. Similarly, Gaw et al. in 2006 adoption and usability of end-to-end encryption; advances found that even employees in a sensitive advocacy organi in searchable encryption; habits of email search and orga zation regarded routine email encryption as “paranoid” and nization; and applications of conjoint analysis. socially undesirable [41]. More recently, researchers found that security and privacy played a minor role for general 2.1. End-to-end encryption, adoption, and usability users to adopt secure messaging tools [42], and journalists’ adoption of secure tools was driven by journalistic sources End-to-end-encrypted email communication was made and existed tools didn’t comply with their requirements [43]. more accessible by the release of PGP [19], which led to Abu-Salma et al. found that for general users, usability the development of several encrypted email systems (e.g., was not a primary obstacle to adoption; instead, fragmented [20], [21]). Since 2004, the Off-The-Record protocol has user bases, lack of interoperability, overall feature sets, and been used as the basis for several tools for secure instant limited understanding of security properties are significant messaging, including Signal [22] and WhatsApp [1]. Unger barriers [3]. These studies suggest that to promote adoption et al. surveyed secure communication tools in 2015, and of encrypted messaging, the community must consider not evaluated their security, usability and ease-of adoption prop just the usability of security features themselves, but rather erties [23]. the overall ecosystem of features and tradeoffs within which During the past two decades, researchers and practition users make adoption decisions. This study contributes

Understanding User Tradeoffs for Search in Encrypted Communication

Symmetric Asynchronous Ratcheted Communication with Associated Data

2012 07 26 Letter to Skype

Secure Messaging1

IM Security Documentation on Page Vi

Investigating Mobile Messaging Security

Zfone: a New Approach for Securing Voip Communication

Cryptographic Control Standard, Version

Exploring User Mental Models of End-To-End Encrypted Communication Tools

Developing Secure Communication Systems Using the TMS320C50 DSP

A Secure Mobile Communication System

How Secure Is Textsecure?

Is Bob Sending Mixed Signals?