(COSI) Attacks: Leaking Web Site States Through XS-Leaks

Cross-Origin State Inference (COSI) Attacks: Leaking Web Site States through XS-Leaks Avinash Sudhodanan Soheil Khodayari Juan Caballero IMDEA Software Institute CISPA Helmholtz Center for Information Security IMDEA Software Institute [email protected] [email protected] [email protected] Abstract—In a Cross-Origin State Inference (COSI) attack, an censorship and can determine if the victim has an account in, or attacker convinces a victim into visiting an attack web page, which is the administrator of, some prohibited web site. The problem leverages the cross-origin interaction features of the victim’s web is aggravated by COSI attacks being web attacks, which can browser to infer the victim’s state at a target web site. Multiple be performed even when the victim employs anonymization instances of COSI attacks have been found in the past under tools such as a virtual private network. different names such as login detection or access detection attacks. But, those attacks only consider two states (e.g., logged in or not) In a COSI attack, the attacker convinces the victim to visit and focus on a specific browser leak method (or XS-Leak). an attack page. The attack page includes at least one state- This work shows that mounting more complex COSI attacks dependent URL (SD-URL) from the target web site, whose such as deanonymizing the owner of an account, determining if response depends on the state of the visitor. For example, a the victim owns sensitive content, and determining the victim’s SD-URL may point to some content in the target web site account type often requires considering more than two states. Fur- only accessible when the victim has a specific state such as thermore, robust attacks require supporting a variety of browsers being authenticated. The inclusion forces the victim’s browser since the victim’s browser cannot be predicted apriori. To address to send a cross-origin request to the target web site. Since the these issues, we present a novel approach to identify and build request is cross-origin, the same-origin policy (SOP) prevents complex COSI attacks that differentiate more than two states and the attack page from directly reading the response. However, support multiple browsers by combining multiple attack vectors, possibly using different XS-Leaks. To enable our approach, we the attacker can leverage a browser leak method (also known as introduce the concept of a COSI attack class. We propose two XS-Leak) to infer, from the cross-origin response, the victim’s novel techniques to generalize existing COSI attack instances into state at the target web site. COSI attack classes and to discover new COSI attack classes. We Multiple instances of COSI attacks have been found in the systematically apply our techniques to existing attacks, identifying 40 COSI attack classes. As part of this process, we discover last 13 years by both security analysts (e.g., [26], [27], [33], a novel XS-Leak based on window.postMessage. We implement [36], [40], [51]) and academics (e.g., [21], [31], [38], [56], our approach into Basta-COSI, a tool to find COSI attacks in [64]), with roughly half of them being presented in the last a target web site. We apply Basta-COSI to test four stand-alone four years, and several in 2019 (e.g., [55], [56], [61]). However, web applications and 58 popular web sites, finding COSI attacks they have previously been considered as sparse attacks under against each of them. different names such as login detection attacks [34], [35], [51], [56], login oracle attacks [50], [57], cross-site search I. INTRODUCTION attacks [31], URL status identification attacks [47], and cross- site frame leakage attacks [55]. As far as we know, we are In a Cross-Origin State Inference (COSI) attack, the at- the first to systematically study these attacks and group them tacker’s goal is to determine the state of a victim visiting an under the same COSI attack denomination. attack page (e.g., attack.com/index.html), in a target web site arXiv:1908.02204v2 [cs.CR] 31 Jan 2020 not controlled by the attacker (e.g., linkedin.com). The state Previous works have several limitations. First, they con- of the victim in a target web site is defined, among others, sider two states. For example, login detection attacks differenti- by login status, account, and content properties. Determining ate if the victim is logged in or not, and access detection attacks the victim’s state can have important security implications. if the victim has previously accessed a site or not. However, For example, determining that a victim is logged into a sites typically have more than two states. Considering only target web site implies that the victim owns an account in two states limits the type of attacks that can be launched, and that site. This is problematic for privacy-sensitive web sites can introduce false positives, e.g., determining that a victim such as those related to post-marital affairs and pornography. is logged in when he is not. A second limitation is that they Determining content ownership can be used to establish if often test attacks only on one browser, thus the attack may not a program committee member is reviewing a specific paper work on other browsers. To address both issues, we present a in a conference management system, or if the victim has novel approach to identify and build complex COSI attacks by uploaded some copyrighted content to an anonymous file combining multiple attack vectors in order to handle more than sharing site. Determining if the victim owns a specific account, two states and multiple browsers. For example, our approach i.e., deanonymizing the account owner, enables identifying identifies a COSI attack against HotCRP that determines if which company employee runs an anonymous blog criticizing the victim, i.e., a program committee member using Chrome, the company’s management. Such state inferences are even Firefox, or Edge is the reviewer of a submitted paper. This more critical when the attacker is a nation state that performs attack involves multiple states (e.g., author, reviewer, logged out) and requires two COSI attack vectors: one to determine State Attribute Possible Values Login Status (a) Logged in if the victim is logged in and another to determine if a logged (b) Not logged in victim is reviewing the paper. Single Sign-On Status (a) Logs in via a specific SSO service (b) Logs in via another SSO service A third limitation is that they focus on a specific XS-Leak. Access Status (a) Has previously accessed Instead, our approach is generic; it supports all known XS- (b) Has not previously accessed Account Type (a) Has a premium account Leaks and can easily accommodate new ones. For example, (b) Has a regular account it incorporates a novel XS-Leak we have discovered based Account Age Category (a) Age above a certain threshold on window.postMessage, which affects popular sites such as (b) Age below a certain threshold Account Ownership (a) Owner of a specific account blogger.com, ebay.com, reddit.com, and youtube.com. At (b) Not the owner of an account the core of our generic approach is the concept of a COSI Content Ownership (a) Owner of a specific content attack class, which defines the SD-URLs that can be attacked (b) Not the owner of a content using a specific XS-Leak, the affected browsers, and the set TABLE I: Examples of user states in a target web site. of inclusion methods (i.e., HTML tags and DOM methods) that can be used to include the SD-URL in the attack page. To identify attack classes we propose a novel generalization • We implement our approach into Basta-COSI, a tool technique that given a previously known COSI attack, gen- to find COSI attacks in a target web site. We ap- eralizes it into an attack class that covers many other attack ply Basta-COSI to 62 targets including stand-alone variants. We also propose an amplification technique that iden- web applications and popular live sites. We find tifies previously unknown variations, e.g., attack classes using COSI attacks against all of them, enabling account different inclusion methods. We systematically explore the deanonymization, account type inference, SSO status, literature to identify previously known COSI attack instances login detection, and access detection. and apply our generalization and amplification techniques on them. This process identifies 40 COSI attack classes, of which • We have released Basta-COSI as part of the security 19 generalize prior attacks and 21 are new variations. service of the ElasTest open-source platform for testing cloud applications [4]. We implement our approach into a tool called Basta- COSI, publicly available as part of the open-source ElasTest II. OVERVIEW platform [4]. Given as input a target web site and state scripts defining the user states at the target web site, Basta-COSI This Section provides an overview of COSI attacks. Sec- identifies SD-URLs in the target web site, tests if those SD- tion II-A details the user state at a target web site. Section II-B URLs can be attacked using any of the 40 attack classes, and describes the two phases of a COSI attack. Section II-C produces attack pages that combine multiple attack vectors to discusses handling more than two states. Finally, Section II-D uniquely identify a state. We have applied Basta-COSI to 62 presents the COSI attack threat model. targets: four stand-alone web applications (HotCRP, GitLab, GitHub, and OpenCart) and 58 popular web sites. Basta-COSI A.

(COSI) Attacks: Leaking Web Site States Through XS-Leaks

A Novel Approach of MIME Sniffing Using

Hands-On Laboratory on Web Content Injection Attacks

Der Security-Leitfaden Für Webentwickler

Mitigation of Web-Based Program Security Vulnerability Exploitations

Filter Failure: One-Pass Delete Filter Failure: UTF-7

Cure53 Browser Security White Paper

Survey and Analysis of Client Side Detection of Content Sniffing Attack

WAF Virtual Patching Challenge

Understanding and Mitigating the Security Risks of Content Inclusion in Web Browsers

FUSE: Finding File Upload Bugs Via Penetration Testing

Tangledweb Index.Pdf

Secure Content Sniffing for Web Browsers: a Survey