Automated Test Cases Prioritization for Self-Driving Cars in Virtual

111 Automated Test Cases Prioritization for Self-driving Cars in Virtual Environments CHRISTIAN BIRCHLER, Zurich University of Applied Science, Switzerland SAJAD KHATIRI, Zurich University of Applied Science & Software Institute - USI, Lugano, Switzerland POURIA DERAKHSHANFAR, Delft University of Technology, Netherlands SEBASTIANO PANICHELLA, Zurich University of Applied Science, Switzerland ANNIBALE PANICHELLA, Delft University of Technology, Netherlands Testing with simulation environments help to identify critical failing scenarios emerging autonomous systems such as self-driving cars (SDCs) and are safer than in-field operational tests. However, these tests are very expensive and aretoo many to be run frequently within limited time constraints. In this paper, we investigate test case prioritization techniques to increase the ability to detect SDC regression faults with virtual tests earlier. Our approach, called SDC-Prioritizer, prioritizes virtual tests for SDCs according to static features of the roads used within the driving scenarios. These features are collected without running the tests and do not require past execution results. SDC-Prioritizer utilizes meta-heuristics to prioritize the test cases using diversity metrics (black-box heuristics) computed on these static features. Our empirical study conducted in the SDC domain shows that SDC-Prioritizer doubles the number of safety-critical failures that virtual tests can detect at the same level of execution time compared to baselines: random and greedy-based test case orderings. Furthermore, this meta-heuristic search performs statistically better than both baselines in terms of detecting safety-critical failures. SDC-Prioritizer effectively prioritize test cases for SDCs with a large improvement in fault detection while its overhead(upto 0.34% of the test execution cost) is negligible. CCS Concepts: • Software and its engineering ! Search-based software engineering; Software testing and debug- ging. Additional Key Words and Phrases: Autonomous Systems, Software Simulation, Test Case Prioritization ACM Reference Format: Christian Birchler, Sajad Khatiri, Pouria Derakhshanfar, Sebastiano Panichella, and Annibale Panichella. 2018. Automated Test Cases Prioritization for Self-driving Cars in Virtual Environments. Proc. ACM Meas. Anal. Comput. Syst. 37, 4, Article 111 (August 2018), 22 pages. https://doi.org/10.1145/1122445.1122456 1 INTRODUCTION Self-driving cars (SDCs) are autonomous systems that collect, analyze, and leverage sensor data from the surrounding environment to control physical actuators at run-time [3, 13]. Testing automation for SDCs is vital to Authors’ addresses: Christian Birchler, Zurich University of Applied Science, Switzerland, [email protected]; Sajad Khatiri, Zurich University of Applied Science & Software Institute - USI, Lugano, Switzerland, [email protected]; Pouria Derakhshanfar, Delft University of Technology, Netherlands, [email protected]; Sebastiano Panichella, Zurich University of Applied Science, Switzerland, [email protected]; Annibale arXiv:2107.09614v1 [cs.SE] 20 Jul 2021 Panichella, Delft University of Technology, Netherlands, [email protected]. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation onthefirst page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]. © 2018 Association for Computing Machinery. 2476-1249/2018/8-ART111 $15.00 https://doi.org/10.1145/1122445.1122456 Proc. ACM Meas. Anal. Comput. Syst., Vol. 37, No. 4, Article 111. Publication date: August 2018. 111:2 • Christian Birchler, Sajad Khatiri, Pouria Derakhshanfar, Sebastiano Panichella, and Annibale Panichella ensure their safety and reliability [42, 43], but it presents several limitations and drawbacks: (i) the limited ability to repeat tests under the same conditions due to ever-changing environmental factors [43]; (ii) the difficulty to test the systems in safety-critical scenarios (to avoid irreversible damages caused by dreadful outcomes) [36, 40, 63]; (iii) not being able to guarantee the system’s reliability in its operational design domain due to a lack of testing under a wide range of execution conditions [42]. The usage of virtual simulation environments addresses several of the challenges above for SDCs testing practices [1, 14, 15, 23]. Hence, simulation environments are used in industry in multiple development stages of Cyber-physical Systems (CPSs) [60], including model (MiL), software (SiL), and hardware in the loop (HiL). As a consequence, multiple open-source and commercial simulation environments have been developed for SDCs, which can be more effective and safer than traditional in-field testing methods [4]. Adequate testing for SDCs requires writing (either manually or assisted by generation tools [2, 30]) a very large number of driving scenarios (test cases) to assess that the system behaves correctly in many possible critical and corner cases. The large running time of simulation-based tests and the large size of the test suites make regression testing particularly challenging for SDCs [28, 67]. In particular, regression testing requires running the test suite before new software releases, to assess that the applied software changes do not impact the behavior of the unchanged parts [52, 68]. The goal of this paper is to investigate black-box test case prioritization (TCP) techniques for SDCs. TCP methods sort (prioritize) the test cases with the aim to run the fault-revealing tests as early as possible [68]. While various black-box heuristics have been proposed for traditional systems and CPSs, they cannot be applied to SDCs as is. Black-box approaches for “traditional” systems sort the tests based on their diversity, computed on the values of the input parameters [44] and the sequence of method calls [17]. However, SDC simulation scenarios (e.g., with road shape, weather conditions) do not consist of sequences of method calls as in traditional tests [2, 30]. Approaches targeting CPSs measure test distance based on signal [9], and fault-detection capability [12]. However, this data is unknown up-front without running the SDC tests. The main challenges to address when designing black-box TCP methods for SDCs concern (i) the definition of features that can characterize SDC safety-critical scenarios in virtual tests; and (ii) design optimization algorithms that successfully prioritize the test cases based on the selected features. Therefore, to address these challenges, we formulated the following research question: • RQ1: What features to consider for prioritizing safety-critical tests in SDCs in virtual environments? We identify 16 static features for driving scenarios in SDCs virtual tests, such as the length of the road, the number of left and right turns, etc. These features are extracted from the test scenarios prior to their execution. Hence, we introduce SDC-Prioritizer, a TCP approach based on Genetic Algorithms (GA) that prioritizes test cases of SDCs by leveraging these features. • RQ2: What is the cost-effectiveness of SDC-Prioritizer compared to baseline approaches? To answer RQ2, we conducted an empirical study with three different datasets (with BeamNG.research [14]) and composed by test scenarios that target the lane-keeping features of SDCs. In this context, fault-revealing tests are virtual test scenarios in which a self-driving car would not respect the lane tracking safety requirement [31]. We targeted BeamNG by BeamNG.reasearch [14] (detailed in Section2) as a reference simulation environment, which has been recently used in the Search-Based Software Testing (SBST) tool competition1[53]. The test scenarios for this environment have been produced with AsFault [30], an open-source project that generates test cases to assess SDCs behavior (detailed in Section2). 1https://sbst21.github.io/tools/ Proc. ACM Meas. Anal. Comput. Syst., Vol. 37, No. 4, Article 111. Publication date: August 2018. Automated Test Cases Prioritization for Self-driving Cars in Virtual Environments • 111:3 By comparing SDC-Prioritizer with two baselines —namely random search, and the greedy algorithm— on these three benchmarks, we analyze the performance of our technique in terms of its ability to detect more faults while incurring a lower test execution cost. Finally, we assess whether SDC-Prioritizer can be used in practical settings, i.e., it does not add a too large computational overhead to the regression testing process: • RQ3: What is the overhead introduced by SDC-Prioritizer? The results of our empirical study show that SDC-Prioritizer reduces the time required to identify more safety-critical scenarios (on average 17%). It also shows that our approach leads to an increase of detected faults (about 100%) in the first 20% of the test execution time compared to random test case orders (“default” baselines for search-based approaches [60, 68]) and the greedy algorithm for TCP. Furthermore, SDC-Prioritizer does not introduce significant computational overhead in the SDCs simulation process, which is of critical importance to SDC development in industrial settings. The contributions of this paper

Automated Test Cases Prioritization for Self-Driving Cars in Virtual

SIGSOFT CFP-ICSE04.Qxd

Improving Web Services Maintenance Through Regression Testing Divya Rohatgi, Prof

FINAL PROGRAM University of California, Irvine [email protected] Program Chair Matthew B

Curriculum Vitae James A

Mary Lou Soffa: Curriculum Vitae

Dissertation Towards Model-Based Regression

CRA-W Newletter October 2008 Final

Source-Code Level Regression Test Selection: the Model-Driven Way

Empirical Studies of a Safe Regression Test Selection Technique

ICSE 2001 Advance Program

Trade-Offs in Continuous Integration:Assurance, Security, And

COST-EFFECTIVE TECHNIQUES for CONTINUOUS INTEGRATION TESTING Jingjing Liang University of Nebraska - Lincoln, [email protected]