A Security Policy Oracle: Detecting Security Holes Using Multiple API Implementations

A Security Policy Oracle: Detecting Security Holes Using Multiple API Implementations Varun Srivastava Michael D. Bond Kathryn S. McKinley Vitaly Shmatikov Yahoo! The Ohio State University The University of Texas at Austin varun [email protected] [email protected] fmckinley,[email protected] Abstract 1. Introduction Even experienced developers struggle to implement security poli- Demand for secure software is increasing, but ensuring that soft- cies correctly. For example, despite 15 years of development, stan- ware is secure remains a challenge. Developers are choosing dard Java libraries still suffer from missing and incorrectly applied memory-safe systems [9, 39] and languages such as Java and permission checks, which enable untrusted applications to execute C# in part because they improve security by reducing memory- native calls or modify private class variables without authorization. corruption attacks. Even memory-safe systems, however, rely on Previous techniques for static verification of authorization enforce- the access-rights model to ensure that the program has the appro- ment rely on manually specified policies or attempt to infer the pol- priate permissions before performing sensitive actions. Unfortu- icy by code-mining. Neither approach guarantees that the policy nately, even experienced developers find it difficult to specify and used for verification is correct. implement access-rights policies correctly. Consequently, semantic In this paper, we exploit the fact that many modern APIs have mistakes—such as missing permission checks which enable mali- multiple, independent implementations. Our flow- and context- cious code to bypass protection—have become a significant cause sensitive analysis takes as input an API, multiple implementations of security vulnerabilities [26]. thereof, and the definitions of security checks and security-sensitive This paper presents a new approach to finding security vulner- events. For each API entry point, the analysis computes the secu- abilities in software libraries, also known as Application Program- rity policies enforced by the checks before security-sensitive events ming Interfaces (APIs). We leverage the increasing availability of such as native method calls and API returns, compares these poli- multiple, independent implementations of the same API. By de- cies across implementations, and reports the differences. Unlike sign, these implementations must be interoperable and must im- code-mining, this technique finds missing checks even if they are plement semantically consistent security policies. A security pol- part of a rare pattern. Security-policy differencing has no intrinsic icy in the access-rights model consists of a mapping between secu- false positives: implementations of the same API must enforce the rity checks, such as verifying permissions of the calling code, and same policy, or at least one of them is wrong! security-sensitive events, such as writing to a file or the network. Our analysis finds 20 new, confirmed security vulnerabilities We exploit this consistency requirement to build a security policy and 11 interoperability bugs in the Sun, Harmony, and Classpath oracle, which accurately (i) derives security policies realized by implementations of the Java Class Library, many of which were each implementation and (ii) finds incorrect policies. missed by prior analyses. These problems manifest in 499 entry Previous static analysis techniques for finding security vulner- points in these mature, well-studied libraries. Multiple API imple- abilities either rely on manual policy specification and/or verifica- mentations are proliferating due to cloud-based software services tion, or infer frequently occurring policies by code-mining. For ex- and standardization of library interfaces. Comparing software im- ample, Koved et al. derive a policy and then require programmers plementations for consistency is a new approach to discovering to check by hand that permissions held at all sensitive points in the “deep” bugs in them. program are sufficient [22, 28]. Sistla et al. take as input manually specified check-event pairs and verify that all occurrences of Categories and Subject Descriptors D. Software [D.2 Software the event are dominated by the corresponding check [30]. These Engineering]: D.2.4 Software/Program Verification; K. Comput- approaches are error-prone and limited in expressiveness. There ing Milieux [K.6 Management of Computing and Information Sys- are hundreds of potential security-sensitive events and 31 poten- tems]: K.6.5 Security and Protection tial checks in the Java Class Library; some policies involve multiple checks and some checks do not always dominate the event. It General Terms Languages, Security, Verification is easy to overlook a security-sensitive event or omit a rare check- Keywords Security, Authorization, Access Control, Static Analy- event pair in a manual policy. Koved et al. and Sistla et al. reported sis, Java Class Libraries no bugs for the Sun Java Development Kit (JDK) and Apache Har- mony implementations of the Java Class Library. By contrast, our analysis uncovered multiple, exploitable security vulnerabilities in the same code. Another approach is code-mining, which infers policies from Permission to make digital or hard copies of all or part of this work for personal or frequently occurring patterns and flags deviations as potential classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation bugs [10, 14, 35]. These techniques fundamentally assume that on the first page. To copy otherwise, to republish, to post on servers or to redistribute the same pattern of security checks and security-sensitive opera- to lists, requires prior specific permission and/or a fee. tions occurs in many places in the code, and they are thus likely to PLDI’11, June 4–8, 2011, San Jose, California, USA. miss vulnerabilities that violate a unique policy—such as the Har- Copyright c 2011 ACM 978-1-4503-0663-8/11/06. $10.00 mony bug described in Section 2. Frequent patterns may or may not Our analysis is very precise. It produces only 3 false positives, represent actual policies, resulting in false positives and negatives. while finding 20 new security vulnerabilities (6 in JDK, 6 in Har- Code-mining techniques face an inherent tradeoff between cover- mony, 8 in Classpath) and 11 interoperability bugs that manifest age and the number of false positives. As the statistical threshold in 499 API entry points. Example vulnerabilities include opening is lowered to include more patterns, they may find more bugs, but network connections and loading libraries without properly check- the number of false positives increases since deviations from rare ing permissions of the calling code. We reported all vulnerabilities patterns can be mistakenly flagged as bugs. found by our analysis to the developers of the respective libraries, Cross-implementation consistency as a “security policy oracle.” who accepted and immediately fixed many of them [31–33]. Our key idea is to use any inconsistency in security semantics be- In summary, our analysis uncovered multiple security vulnera- tween implementations of the same API as an oracle for detecting bilities in three mature implementations of the Java Class Library, incorrect policies. Unlike approaches that rely on explicit policies with a minimal number of false positives. This paper demonstrates or frequent patterns, this technique completely avoids the need to that our approach (i) scales to industrial implementations of real- determine whether the policy is correct, replacing it by a much eas- world software libraries and (ii) is capable of discovering “deep” ier task of determining whether two policies are consistent. bugs even in well-studied, tested, widely used code. Although we Our use of consistency checking is very different from the code- describe our design and implementation with respect to the Java mining approaches. In our case, there are no benign inconsisten- programming language, our approach is applicable to other lan- cies! Any semantic inconsistency between two implementations of guages, such as C#, that use a similar access-rights model. the same API is, at the very least, an interoperability bug or, in the Current software engineering trends are likely to increase the worst case, a security hole. In theory, policy differencing has no in- applicability of comparative API analysis. For example, the pro- trinsic false positives. In practice, imprecision in conservative static liferation of distributed, multi-layer software services encourages analysis results in a very small number of false positives. Further- separation of APIs from their implementations, and thus multiple more, our technique can discover a missing permissions check even implementations of the same API. Many platforms are adopting the if this type of check occurs in a single place in the code. It may pro- Software-As-A-Service (SaaS) model [1, 16, 20, 29], which en- duce false negatives if exactly the same semantic bug occurs in all courages multiple implementations of the same functionality and implementations of a given library routine. This case is unlikely if thus provides the input material for our “security policy oracle.” the implementations are developed independently, as is the case for Widely used APIs sometimes have an open-source version, which our main test subject, the Java Class Library. programmers

A Security Policy Oracle: Detecting Security Holes Using Multiple API Implementations

CDC Runtime Guide

The Java Hotspot VM Under the Hood

Technique: HTTP the Java Way

A Java Security Scanner for Eclipse

Exploiting SAS Software Using Java Technology

Eu-19-Zhang-New-Exploit-Technique

Apache Harmony Project Tim Ellison Geir Magnusson Jr

CDC Runtime Guide

Java Security, 2Nd Edition

CDC Build System Guide

Java(8$ Andrew$Binstock,$Editor$In$Chief,$Dr.Dobbs$

CDC: Java Platform Technology for Connected Devices