Supporting Software Development Based on Crash Report Mining and Analysis

CrashAwareDev: Supporting Software Development based on Crash Report Mining and Analysis Leandro Beserra*y, Roberta Coelhoy *Informatics Superintendence yDepartment of Informatics and Applied Mathematics Federal University of Rio Grande do Norte Natal, Brazil [email protected], [email protected] Abstract— Exception handling mechanisms are a com- of a program [2], allowing exceptions to be th- mon feature of mainstream programming languages to rown, captured, and handled at different points deal with exceptional conditions. If on the one hand in the system. However, the exception handling they allow the programmer to prepare the code to deal code designed to make a system more robust often with exceptional conditions, on the other hand they can become a source of bugs that threatens the application works the other way around and become a burden robustness – studies have shown that uncaught exceptions programmers has to cope with, leading to bugs are the main cause of application crashes. To keep track such as the uncaught exceptions. The uncaught of crashes and enable an easier fault localization, several exceptions are the main cause of crashes in Java applications, nowadays, use crash reporting tools. In software systems [3]. A crash is an abnormal this work, we propose a tool named CrashAwareDev, behavior of a system that leads to the interruption integrated with the Eclipse IDE, which mines the information available in crash reporting tools to support of its execution. programmers in their day-to-day activities. The proposed After a crash occurs, systems typically store tool alerts developers about the classes related to recent information related to the crash on crash report crashes and warns about source code characteristics (bug systems. Such information usually contains the un- patterns) that can be related to future crashes (similar caught exception that caused the crash and its stack to the ones that have happened). Doing so the tool aims trace. The exception stack trace is a representation at bringing the development environment closer to crash reporting tools, that are usually only used for bug fixing of the method call stack and contains information or for extracting robustness metrics. A case study was about classes and methods by which an exception conducted and showed that the tool can support software was propagated. Since the exception stack trace development by displaying bug pattern alerts directly in is a source of information widely used by pro- the source code, signaling the classes involved in recent grammers while debugging they are often added on faults, and speeding up the crash’s fault localization crash reports [4]. Moreover, these reports may con- within the development environment itself. tain other information such as the operating system Index Terms— Exception handling, crash report, Eclipse plug-in, uncaught exceptions used at the time of the crash, the user login, the browser/system version, the user’s IP address, and I. INTRODUCTION other request parameters. In addition to facilitating the fault localization, the information available on Exception handling mechanisms [1] are a com- crash report tools can assist in prioritizing bug mon feature of modern programming languages. corrections (depending on the number of users Exception handling structures are used to deal with affected, for example) or understanding the impact unexpected events that occur during the execution of system crashes [5]. The utility of crash report systems may go beyond recording crash data and DOI reference number: 10.18293/SEKE2019-002 supporting debugging. Some studies have shown that such information can be used for various represent conditions that, generally speaking, re- purposes such as fault classification [6][7] and fault flect errors in your program’s logic and cannot localization [8][9][10]. None of the existing works, be reasonably recovered from at run time [13]. however, extract data from crash reports to support By convention, instances of Error represent un- programmers during system coding. recoverable conditions which usually result from In this paper, we propose a way to mine infor- failures detected by the Java Virtual Machine due mation stored in crash reports and provide useful to resource limitations, such as OutOfMemoryEr- information to the programmer within his/her pro- ror. Normally these cannot be handled inside the gramming environment. We present a tool called application [13]. CrashAwareDev, an Eclipse1 IDE plug-in, which III. ANALYZING CRASH REPORTS supports the developer in coding time by (i) aler- ting him/her about the classes related to recent A. Methodology crashes; (ii) warning about source code characte- Before implementing the CrashAwareDev tool, ristics (bug patterns) that can be related to future we conducted a study whose goal was to identify crashes (similar to the ones that have happened); the characteristics of most frequent crashes of an and (iii) providing direct access to crash reports industrial Web-based system, and based on such within the IDE. A case study was conducted on an characteristics, check wether or not existing static industrial web-based software system comprising analysis tools could alert about them. Figure 1 of 1300 KLOC of Java source code. Along the illustrates the main steps taken in this study. The evaluation period, a group of 5 developers used target system used in this study was an industrial the tool on a daily basis (during 4 days). Overall Web-based system, named SIPAC, comprising of the tool presented 95 warnings (i.e., 17 class alerts 1300 KLOC of Java source code ans designed and 78 bug pattern warnings). to automate business processes for universities focusing on different and complementary aspects, II. BACKGROUND such as administration, planning and management – SIPAC is used in approximatelly 55 Brazilian The Java programming language provides an universities. exception handling mechanism to support error handling [12]. When an error occurs during execution of code in a try block, the error is caught and handled by an exception handler in one of the subsequent catch blocks associated with it. If no catch block can handle the error, the method is ter- minated abnormally and the Java virtual machine (JVM) searches backward through the call stack to find an exception handler that can handle the error [15]. In Java, exceptions are represented according to a class hierarchy, on which every exception is an instance of the Throwable class, and can be of three kinds: the checked exceptions (extends Fig. 1. Methodology overview. Exception), the runtime exceptions (extends Run- timeException) and errors (extends Error) [12]. Step 1: Analysis of Crash Reports. We analy- Checked exceptions represent conditions that, zed the most frequent exceptions that caused although exceptional, can reasonably be expected crashes over a period of one month. to occur, and if they do occur must be dealt Step 2: Code Inspection and Identification of with in some way. Unchecked runtime exceptions Bug Patterns. We manually inspected the code related to the most frequent kinds system crashes 1https://www.eclipse.org/ to identify common characteristics among them. Step 3: Analysis of Existing Static Analysis definition of four specific bug patterns listed in Tools. We analyzed whether or not existing static Table II. analysis tools could be able to detect and therefore prevent such causes of crashes identified on the Tag Description Ocurr. BP01 Unchecked database query return 3 previous step. BP02 Unchecked classes methods parameters 2 Step 4: Survey. Finally, we conducted a study BP03 Unchecked request parameters 2 on which we surveyed a group of developers BP04 Controllers do not implement get/set 4 methods (working on the target system) to evaluate how they used. TABLE II BUG PATTERNS DETECTED DURING ANALYSIS. B. Crash Analysis In order to investigate the main causes of crashes reported for the target system, we analyzed the Each bug pattern found in this study is described crash reports of one month (June 2018). In this as follows: period (2018-06-01 to 2018-06-30), approximately • BP01: When querying any data in a database, 1933617 accesses were made to the system, from it is recommended to check if the value is this set 680 accesses were faulty (0.0035% of the not null, to prevent a possible null pointer accesses) - on which the user faced one or more exception. crashes. We observed that approximately 1966 • BP02: Static methods of utility classes should crashes were recorded in the period, affecting 347 check if their arguments are not null. distinct users. • BP03: Like BP01, objects retrieved from the We extracted the types of most frequent un- HTTP session must be checked. caught exceptions (leading to crashes). The top 5 • BP04: Private attributes of view controllers types of uncaught exceptions raised shown in Table must most often have the get and set methods I. We can observe that approximately 63.9% (1241 implemented. Otherwise, a JspException may errors of 1966) of the crashes of the analyzed be thrown while rendering the Web page. period was caused by five types of exceptions. C. Analysis of Existing Static Analysis Tools Root Cause Ocurr. % Analyzed We then investigated whether some of the exis- NullPointerException 749 38,09 10 ting static analysis tools could alert about some of LazyInitializationException 171 8,69 2 JspException 166 8,44 21 the 39 crash causes identified during the manual 2 3 IllegalArgumentException 102 5,18 1 inspection. We used SonarLint and SpotBugs , PSQLException 53 2,69 5 both in its Eclipse’s plug-in version and observed Total 1241 63,9 39 that none of the 39 defects were detected by the TABLE I tools. THE TOP 5 TYPES OF UNCAUGHT EXCEPTIONS RAISED IN JUNE/2018. D. Survey We applied a survey the developers working on the target system to investigate how they use the We then manually inspected the source code crash report information during development.

Supporting Software Development Based on Crash Report Mining and Analysis

Challenges and Issues of Mining Crash Reports

Crash Reporting: Mozilla’S Open Source Solution

An Autopsy of a Web Browser's Leaked Crash Reports

Why Did This Reviewed Code Crash? an Empirical Study of Mozilla Firefox

City Research Online

The Unreasonable Effectiveness of Traditional Information Retrieval in Crash Report Deduplication

Privacy-Aware Remote Error Analysis on Commodity Software

Statistical Debugging

A Crash Reporting Library for Android

Stephen a Ridley

Look 360: an Android Application Using Google Analytics and Firebase with Mobile Cloud Computing

Ingimp: Introducing Instrumentation to an End-User Open Source Application Michael Terry, Matthew Kay, Brad Van Vugt, Brandon Slack, Terry Park David R