Javascript: the (Un)Covered Parts

JavaScript: The (Un)covered Parts Amin Milani Fard Ali Mesbah University of British Columbia University of British Columbia Vancouver, BC, Canada Vancouver, BC, Canada [email protected] [email protected] Abstract—Testing JavaScript code is important. JavaScript has (hidden scopes), are considered to be harder to test [1], [2], grown to be among the most popular programming languages [12], [11], [29], [44]. However, there is no evidence that to and it is extensively used to create web applications both on what extent this is true in real-world practice. the client and server. We present the first empirical study of JavaScript tests to characterize their prevalence, quality metrics In this work, we study JavaScript (unit) tests in the wild (e.g. code coverage), and shortcomings. We perform our study from different angles. The results of this study reveal some across a representative corpus of 373 JavaScript projects, with of the shortcomings and difficulties of manual testing, which over 5.4 million lines of JavaScript code. Our results show provide insights on how to improve existing JavaScript test that 22% of the studied subjects do not have test code. About generation tools and techniques. We perform our study across a 40% of projects with JavaScript at client-side do not have a test, while this is only about 3% for the purely server-side representative corpus of 373 popular JavaScript projects, with JavaScript projects. Also tests for server-side code have high over 5.4 million lines of JavaScript code. To the best of our quality (in terms of code coverage, test code ratio, test commit knowledge, this work is the first study on JavaScript tests. The ratio, and average number of assertions per test), while tests for main contributions of our work include: client-side code have moderate to low quality. In general, tests written in Mocha, Tape, Tap, and Nodeunit frameworks have • A large-scale study to investigate the prevalence of high quality and those written without using any framework JavaScript tests in the wild; have low quality. We scrutinize the (un)covered parts of the • A tool, called TESTSCANNER, which statically extracts code under test to find out root causes for the uncovered code. different metrics in our study and is publicly available Our results show that JavaScript tests lack proper coverage for [18]; event-dependent callbacks (36%), asynchronous callbacks (53%), and DOM-related code (63%). We believe that it is worthwhile • An evaluation of the quality of JavaScript tests in terms for the developer and research community to focus on testing of code coverage, average number of assertions per test, techniques and tools to achieve better coverage for difficult to test code ratio, and test commit ratio; cover JavaScript code. • An analysis of the uncovered parts of the code under test Keywords-JavaScript applications; testing; empirical study; to understand which parts are difficult to cover and why. test quality; code coverage. II. METHODOLOGY I. INTRODUCTION The goal of this work is to study and characterize JavaScript tests in practice. We conduct quantitative and qualitative JavaScript is currently the most widely used program- analyses to address the following research questions: ming language according to a recent survey of more than RQ1: How prevalent are JavaScript tests? 56K developers conducted by Stack Overflow [43], and RQ2: What is the quality of JavaScript tests? also exploration of the programming languages used across RQ3: Which part of the code is mainly uncovered by tests GitHub repositories [22]. JavaScript is extensively used to and why? build responsive modern web applications, and is also used to create desktop and mobile applications, as well as server-side A. Subject Systems network programs. Consequently, testing JavaScript applica- We study 373 popular open source JavaScript projects. 138 tions and modules is important. However, JavaScript is quite of these subject systems are the ones used in a study for challenging to test and analyze due to some of its specific JavaScript callbacks [21] including 86 of the most depended- features. For instance, the complex and dynamic interactions on modules in the NPM repository [15] and 52 JavaScript between JavaScript and the Document Object Model (DOM), repositories from GitHub Showcases1 [13]. Moreover, we makes it hard for developers to test effectively [32], [29], [19]. added 234 JavaScript repositories from Github with over 4000 To assist developers with writing tests, there exist number stars. The complete list of these subjects and our analysis of JavaScript unit testing frameworks, such as Mocha [6], results, are available for download [18]. We believe that Jasmine[4], QUnit [9], and Nodeunit [8], each having its own this corpus of 373 projects is representative of real-world advantages [10]. Also the research community have proposed JavaScript projects as they differ in domain (category), size some automated testing tools and test generation techniques (SLOC), maturity (number of commits and contributors), and for JavaScript programs [33], [29], [32], [19], [23], though popularity (number of stars and watchers). they are not considerably used by testers and developers yet. Some JavaScript features, such as DOM interactions, event- 1GitHub Showcases include popular and trending open source repositories dependent callbacks, asynchronous callbacks, and closures organized around different topics. TABLE I: Our JavaScript subject systems (60K files, 3.7 M production SLOC, 1.7 M test SLOC, and 100K test cases). ID Category # Subject Ave # Ave Prod Ave Test Ave # Ave # Ave # systems JS files SLOC SLOC tests assertions stars C1 UI Components, Widgets, and Frameworks 52 41 4.7K 2.8K 235 641 9.8K C2 Visualization, Graphics, and Animation Libraries 48 53 10.2K 3.8K 425 926 7.5K C3 Web Applications and Games 33 61 10.6K 1.4K 61 119 4K C4 Software Development Tools 29 67 12.7K 7.8K 227 578 6.9K C5 Web and Mobile App Design and Frameworks 25 91 22.3K 6.9K 277 850 14.4K C6 Parsers, Code Editors, and Compilers 22 167 27K 9.5K 701 1142 5.5K C7 Editors, String Processors, and Templating Engines 19 26 4.3K 1.9K 102 221 6.5K C8 Touch, Drag&Drop, Sliders, and Galleries 19 10 1.9K 408 52 72 7.9K C9 Other Tools and Libraries 17 93 9.1K 7.6K 180 453 8.5K C10 Network, Communication, and Async Utilities 16 19 4.1K 7.6K 279 354 7.6K C11 Game Engines and Frameworks 13 86 17K 1.2K 115 293 3.5K C12 I/O, Stream, and Keyboard Utilities 13 8 0.6K 1K 40 61 1.5K C13 Package Managers, Build Utilities, and Loaders 11 47 3.4K 5.4K 200 300 8.5K C14 Storage Tools and Libraries 10 19 4K 7K 222 317 5.5K C15 Testing Frameworks and Libraries 10 28 2.8K 3.6K 271 632 5.7K C16 Browser and DOM Utilities 9 45 5.6K 7.1K 76 179 5.2K C17 Command-line Interface and Shell Tools 9 9 2.8K 1K 26 244 2.6K C18 Multimedia Utilities 9 11 1.6K 760 17 97 6.2K C19 MVC Frameworks 9 174 40.1K 15.2K 657 1401 14.2K Client-side 128 39 8.2K 3.2K 343 798 7.9K Server-side 130 63 9.4K 7.2K 231 505 6.7K Client and server-side 115 73 12.7K 4.7K 221 402 7.4K Total 373 57 10.1K 4.5K 263 644 7.3K We categorize our subjects into 19 categories using topics Client side Server side Client and server side 100% from JSter JavaScript Libraries Catalog [14] and GitHub 90% Showcases [13] for the same or similar projects. Table I 80% 70% presents these categories with average values for the number 60% of JavaScript files (production code), source lines of code 50% 40% (SLOC) for production and test code, number of test cases, 30% 20% and number of stars in Github repository for each category. 10% We used SLOC [17] to count lines of source code excluding 0% C7 C8 C9 C1 C2 C3 C4 C5 C6 C10 C17 C18 C19 C11 C12 C13 C14 C15 C16 libraries. Overall, we study over 5.4 million (3.7 M production Total and 1.7 M test) source lines of JavaScript code. Fig. 1: Distribution of studied subject systems. Figure 1 depicts the distribution of our subject systems with respect to the client or server side code. Those systems that contain server-side components are written in Node.js2, a B. Analysis popular server-side JavaScript framework. We apply the same To address our research questions, we statically and dynam- categorization approach as explained in [21]. Some projects ically analyze test suites of our subject programs. To extract such as MVC frameworks, e.g. Angular, are purely client-side, some of the metrics in our study, we develop a static analyzer while most NPM modules are purely server-side. We assume tool, called TESTSCANNER [18], which parses production and that client-side code is stored in directories such as www, test code into an abstract syntax tree using Mozilla Rhino [7]. public, static, or client. We also use code annotations such In the rest of this section we explain details of our analysis as /* jshint browser:true, jquery:true */ to for each research question. identify client-side code. 1) Prevalence of tests (RQ1): To answer RQ1, we look The 373 studied projects include 128 client-side, 130 server- for presence of JavaScript tests written in any framework (e.g. side, and 115 client&server-side code. While distributions Mocha, Jasmine, or QUnit). Tests are usually located at folders in total have almost the same size, they differ per project namely tests, specs3, or similar names.

Load more