Nadroid: Statically Detecting Ordering Violations in Android Applications

nAdroid: Statically Detecting Ordering Violations in Android Applications Xinwei Fu Dongyoon Lee Changhee Jung Virginia Tech, USA Virginia Tech, USA Virginia Tech, USA [email protected] [email protected] [email protected] Abstract 1 Introduction Modern mobile applications use a hybrid concurrency model. The mainstream mobile platforms (Android, iOS, Windows In this model, events are handled sequentially by event Phone) offer a distinctive concurrent programming model. loop(s), and long-running tasks are offloaded to other threads. Mobile applications are often sensor-driven (e.g. touchscreen, Concurrency errors in this hybrid concurrency model can GPS), and sensor data is most readily handled by the event- take multiple forms: traditional atomicity and ordering viola- driven model. On the other hand, developers want to take ad- tions between threads, as well as ordering violations between vantage of the multiprocessors in modern mobile devices [32]. event callbacks on a single event loop. To accommodate these competing demands, the concurrency This paper presents nAdroid, a static ordering violation model on mobile platforms is a hybrid of event loop(s) that detector for Android applications. Using our threadifica- sequentially handle events, and background threads that tion technique, nAdroid statically models event callbacks concurrently execute long-running tasks. as threads. Threadification converts ordering violations be- While this hybrid model allows developers to balance tween event callbacks into ordering violations between threads, responsiveness with performance, it also exposes mobile after which state-of-the-art thread-based race detection tools applications to new classes of concurrency errors. While mo- can be applied. nAdroid then applies a combination of bile applications can of course contain conventional multi- sound and unsound filters, based on the Android concur- threaded data races due to a non-deterministic thread sched- rency model and its happens-before relation, to prune out ule, recent studies have shown that these applications can false and benign warnings. also contain single-thread data races resulting from a non- We evaluated nAdroid with 27 open source Android ap- deterministic event posting order [3, 9, 17, 26]. These con- plications. Experimental results show that nAdroid detects currency errors have been shown to cause issues like perfor- 88 (at least 58 new) harmful ordering violations, and outper- mance degradation [24], unexpected termination [3], accel- forms the state-of-the-art static technique with fewer false erated battery drain [33], and security vulnerabilities [6ś8]. negatives and false positives. Concurrency errors in mobile systems have been tack- led with both dynamic and static techniques. Several works CCS Concepts · Software and its engineering → Soft- test mobile applications dynamically, collecting execution ware testing and debugging; traces and performing offline data race detection [3, 17, 26]. Keywords Ordering violation, Data race, Use-after-free, Though dynamic testing has relatively few false positives, Static analysis, Debugging, Android, Threadification the detection coverage is limited to the observed executions. In contrast, static analysis can inspect program code for ACM Reference Format: all possible runtime behaviors. However, the use of static Xinwei Fu, Dongyoon Lee, and Changhee Jung. 2018. nAdroid: Statically Detecting Ordering Violations in Android Applications. program analysis to detect concurrency errors has not yet In Proceedings of 2018 IEEE/ACM International Symposium on Code been well studied in the context of mobile applications. Con- Generation and Optimization (CGO’18). ACM, New York, NY, USA, ventional static data race detectors [11, 12, 19, 29, 35, 43] only 13 pages. https://doi.org/10.1145/3168829 focus on multi-threading, making them unsuitable for event- driven mobile applications. Recently, several new techniques Permission to make digital or hard copies of all or part of this work for have been proposed for mobile applications [24, 33, 39], but personal or classroom use is granted without fee provided that copies are not their scope is limited: e.g., they do not consider the happens- made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components before relationship between event callbacks (ğ2). 1 of this work owned by others than ACM must be honored. Abstracting with This paper presents nAdroid , a novel static ordering credit is permitted. To copy otherwise, or republish, to post on servers or to violation detector for mobile applications with hybrid con- redistribute to lists, requires prior specific permission and/or a fee. Request currency model, tailored to Android. Though nAdroid can permissions from [email protected]. statically detect all types of data races, classifying data races CGO’18, February 24ś28, 2018, Vienna, Austria © 2018 Association for Computing Machinery. ACM ISBN 978-1-4503-5617-6/18/02...$15.00 1nAdroid is named after Android, but has an łordering violationž between https://doi.org/10.1145/3168829 the first two letters. 62 CGO’18, February 24ś28, 2018, Vienna, Austria Xinwei Fu, Dongyoon Lee, and Changhee Jung Looper Thread Service Thread Looper Thread Service Thread Looper Thread Background Thread onServiceConnected connected onServiceConnected connected onResume bound=get(); hostBridge=get(); onClick disconnected ThreadPool.run(); onServiceDisconnected disconnected if(hostBridge!=null) post(runnable) bound=null; onPause run onServiceDisconnected if(jClient!=null) { onCreateContextMenu hostBridge=null; jClient = null; run bound.use(); jClient.abort(); hostBridge.use(); } (a) single-threaded UAF 1 in ConnectBot (b) single-threaded UAF 2 in ConnectBot (c) multi-threaded UAF in FireFox Figure 1. Examples of harmful use-after-free (UAF) ordering violations. as harmful or benign is hard in general [20, 30, 45]. Therefore, happens-before relations into static analysis to prune false this study focuses on finding use-after-free (UAF) ordering or benign warnings. violations (e.g., f=null vs. f.use()). A UAF ordering viola- • We evaluate 27 Android applications and show that nAdroid tion is a form of harmful read-after-write data race, because detects true harmful ordering violations, and produces it can lead to an unexpected NullPointerException. fewer false positives and false negatives than the state-of- The key challenge is that the event-based and thread-based the-art static technique. programming models have distinct patterns and dissimilar happens-before relations [22], making it hard to statically 2 Background and Motivation detect them together. nAdroid addresses this problem using our novel threadification technique (ğ4) that statically This section introduces the Java-based Android concurrency models the event-driven aspects of Android applications as model, provides three examples of harmful UAF ordering vio- nAdroid threads. In effect, threadification converts the tricky problem lation that found, and demonstrates the limitations of detecting single-threaded ordering violations between of existing techniques. event callbacks into the well-studied problem of detecting multi-threaded ordering violations. This allows nAdroid 2.1 The Android Concurrency Model to leverage existing static data race detectors developed for An Android application has a hybrid event-driven and thread- multi-threaded programs (Chord [29] in our study) to de- based concurrency model to handle a mix of incoming sensor tect both event-driven and threaded ordering violations in a data and user interactions (UI) best addressed with events, unified manner (ğ5). and to support arbitrary processing, best addressed with nAdroid Furthermore, introduces novel static happens- threads. A thread may attach an event queue to itself and before-based filters, crafted based on the Android concur- handle an event for execution. A thread with an event queue rency model, to prune out false UAF warnings. It is critical is called a looper thread. It continuously checks its event to remove false positives as they are often overwhelming queue and processes one event at a time by executing the enough to make programmers unwilling to use a detection corresponding event callback. Therefore, all the event call- tool. The problems specific to the Android context have not backs executed in one looper thread are atomic (no preemp- been addressed by existing static tools. We describe sound tion) with respect to each other. Furthermore, the application and unsound filters for these problems (ğ6). may create additional native threads. Since it is cumbersome nAdroid We evaluated using 27 open-source Android for a native thread to communicate with a looper thread, nAdroid applications, from which detects 88 (at least 58 the Android framework also provides a high-level concur- novel) true harmful ordering violations. Experimental re- rent construct, AsyncTask, to create a child thread that can nAdroid sults also show that produces fewer false nega- interact with the looper thread via events. tives and fewer false positives than the state-of-the-art static technique, DEvA [39]. 2.2 Examples of UAF Violations This paper makes the following contributions: • We present nAdroid, a static ordering violation detec- Figure 1 shows some real examples of harmful UAF ordering tor for Android, that considers both the event-based and violations that nAdroid found in Android applications.

Load more