Protecting Users by Confining Javascript with COWL Deian Stefan and Edward Z

Protecting Users by Confining JavaScript with COWL Deian Stefan and Edward Z. Yang, Stanford University; Petr Marchenko, Google; Alejandro Russo, Chalmers University of Technology; Dave Herman, Mozilla; Brad Karp, University College London; David Mazières, Stanford University https://www.usenix.org/conference/osdi14/technical-sessions/presentation/stefan This paper is included in the Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation. October 6–8, 2014 • Broomfield, CO 978-1-931971-16-4 Open access to the Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation is sponsored by USENIX. Protecting Users by Confining JavaScript with COWL † Deian Stefan∗ Edward Z. Yang Petr Marchenko Alejandro Russo Dave Herman Stanford Stanford Google Chalmers Mozilla Brad Karp David Mazieres` UCL Stanford ABSTRACT contributors to the tangle of JavaScript comprising an Modern web applications are conglomerations of application may not have the user’s best interest at heart. JavaScript written by multiple authors: application devel- The wealth of sensitive data processed in today’s web opers routinely incorporate code from third-party libraries, applications (e.g., email, bank statements, health records, and mashup applications synthesize data and code hosted passwords, etc.) is an attractive target. Miscreants may at different sites. In current browsers, a web application’s stealthily craft malicious JavaScript that, when incorpo- developer and user must trust third-party code in libraries rated into an application by an unwitting developer, vio- not to leak the user’s sensitive information from within lates the user’s privacy by leaking sensitive information. applications. Even worse, in the status quo, the only way Two goals for web applications emerge from the prior to implement some mashups is for the user to give her lo- discussion: flexibility for the application developer (i.e., gin credentials for one site to the operator of another site. enabling the building of applications with rich functional- Fundamentally, today’s browser security model trades pri- ity, composable from potentially disparate pieces hosted vacy for flexibility because it lacks a sufficient mechanism by different sites); and privacy for the user (i.e., to en- for confining untrusted code. We present COWL, a robust sure that the user’s sensitive data cannot be leaked from JavaScript confinement system for modern web browsers. applications to unauthorized parties). These two goals COWL introduces label-based mandatory access control are hardly new: Wang et al. articulated similar ones, and to browsing contexts in a way that is fully backward- proposed new browser primitives to improve isolation compatible with legacy web content. We use a series of within mashups, including discretionary access control case-study applications to motivate COWL’s design and (DAC) for inter-frame communication [41]. Indeed, to- demonstrate how COWL allows both the inclusion of un- day’s browsers incorporate similar mechanisms in the trusted scripts in applications and the building of mashups guises of HTML5’s iframe sandbox and postMessage that combine sensitive information from multiple mutu- API [47]. And the Same-Origin Policy (SOP, reviewed in ally distrusting origins, all while protecting users’ privacy. Section 2.1) prevents JavaScript hosted by one principal Measurements of two COWL implementations, one in from reading content hosted by another. Firefox and one in Chromium, demonstrate a virtually Unfortunately, in the status-quo web browser security imperceptible increase in page-load latency. architecture, one must often sacrifice privacy to achieve flexibility, and vice-versa. The central reason that flex- 1 INTRODUCTION ibility and privacy are at odds in the status quo is that Web applications have proliferated because it is so easy the mechanisms today’s browsers rely on for providing for developers to reuse components of existing ones. Such privacy—the SOP, Content Security Policy (CSP) [42], reuse is ubiquitous. jQuery, a widely used JavaScript li- and Cross-Origin Resource Sharing (CORS) [45]—are brary, is included in and used by over 77% of the Quant- all forms of discretionary access control. DAC has the cast top-10,000 web sites, and 59% of the Quantcast top- brittle character of either denying or granting untrusted million web sites [3]. While component reuse in the ven- code (e.g., a library written by a third party) access to erable desktop software model typically involves libraries, data. In the former case, the untrusted JavaScript might the reusable components in web applications are not lim- need the sensitive data to implement the desired appli- ited to just JavaScript library code—they further include cation functionality—hence, denying access prioritizes network-accessible content and services. privacy over flexibility. In the latter, DAC exercises no The resulting model is one in which web developers control over what the untrusted code does with the sen- cobble together multiple JavaScript libraries, web-based sitive data—and thus prioritizes flexibility over privacy. content, and web-based services written and operated by DAC is an essential tool in the privacy arsenal, but does various parties (who in turn may integrate more of these re- not fit cases where one runs untrusted code on sensitive sources) and build the required application-specific func- input, which are the norm for web applications, given tionality atop them. Unfortunately, some of the many their multi-contributor nature. ∗Work partly conducted while at Mozilla. In practice, web developers turn their backs on privacy †Work partly conducted while at Stanford. in favor of flexibility because the browser doesn’t offer 1 USENIX Association 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’14) 131 primitives that let them opt for both. For example, a de- a.com$ b.com$ veloper may want to include untrusted JavaScript from XHR$ XHR$ another origin in his application. All-or-nothing DAC leads the developer to include the untrusted library with JavaScript$ JavaScript$ a script tag, which effectively bypasses the SOP, in- postMessage$ terpolating untrusted code into the enclosing page and DOM$API$ DOM$API$ granting it unfettered access to the enclosing page’s ori- DOM$ DOM$ gin’s content.1 And when a developer of a mashup that a.com$ b.com$ integrates content from other origins finds that the SOP Figure 1: Simplified browser architecture. forbids his application from retrieving data from them, he designs his mashup to require that the user provide the our evaluation (Section 6) illustrates that COWL incurs mashup her login credentials for the sites at the two other minimal performance overhead over the respective origins [2]—the epitome of “functionality over privacy.” baseline browsers. In this paper, we present COWL (Confinement with Origin Web Labels), a mandatory access control (MAC) 2 BACKGROUND,EXAMPLES,&GOALS system that confines untrusted JavaScript in web browsers. A single top-level web page often incorporates multiple 2 COWL allows untrusted code to compute over sensitive scripts written by different authors. Ideally, the browser data and display results to the user, but prohibits the un- should protect the user’s sensitive data from unauthorized trusted code from exfiltrating sensitive data (e.g., by send- disclosure, yet afford page developers the greatest pos- ing it to an untrusted remote origin). It thus allows web sible flexibility to construct featureful applications that developers to opt for both flexibility and privacy. reuse functionality implemented in scripts provided by We consider four motivating example web applica- (potentially untrusted) third parties. To make concrete the tions—a password strength-checker, an application that diversity of potential trust relationships between scripts’ imports the (untrusted) jQuery library, an encrypted cloud- authors and the many ways page developers structure based document editor, and a third-party mashup, none amalgamations of scripts, we describe several example of which can be implemented in a way that preserves web applications, none of which can be implemented with the user’s privacy in the status-quo web security archi- strong privacy for the user in today’s web browsers. These tecture. These examples drive the design requirements examples illustrate key requirements for the design of a for COWL, particularly MAC with symmetric and hierar- flexible browser confinement mechanism. Before describ- chical confinement that supports delegation. Symmetric ing these examples, however, we offer a brief refresher on confinement allows mutually distrusting principals each status-quo browser privacy polices. to pass sensitive data to the other, and confine the other’s 2.1 Browser Privacy Policies use of the passed sensitive data. Hierarchical confinement Browsing contexts Figure 1 depicts the basic building allows any developer to confine code she does not trust, blocks of the current web security architecture. A brows- and confinement to be nested to arbitrary depths. And ing context (e.g., a page or frame) encapsulates pre- delegation allows a developer explicitly to confer the priv- sentable content and a JavaScript execution environment ileges of one execution context on a separate execution (heap and code) that interacts with content through the context. No prior browser security architecture offers this Document Object Model (DOM) [47]. Browsing contexts combination

Load more