Mutation Testing

Total Page:16

File Type:pdf, Size:1020Kb

Mutation Testing Assessing Test Quality David Schuler Dissertation zur Erlangung des Grades des Doktors der Ingenieurwissenschaften der Naturwissenschaftlich-Technischen Fakultaten¨ der Universitat¨ des Saarlandes Saarbrucken,¨ 2011 Day of Defense Dean Prof. Holger Hermanns Head of the Examination Board Prof. Dr. Reinhard Wilhelm Members of the Examination Board Prof. Dr. Andreas Zeller, Prof. Dr. Sebastian Hack Dr. Valentin Dallmeier ii Abstract When developing tests, one is interested in creating tests of good quality that thor- oughly test the program. This work shows how to assess test quality through mutation testing with impact metrics, and through checked coverage. Although there a different aspects that contribute to a test’s quality, the most impor- tant factor is its ability to reveal defects, because software testing is usually carried out with the aim to detect defects. For this purpose, a test has to provide inputs that execute the defective code under such conditions that it causes an infection. This infection has to propagate and result in a failure, which can be detected by a check of the test. In the past, the aspect of test input quality has been extensively studied while the quality of checks has received less attention. The traditional way of assessing the quality of a test suite’s checks is mutation test- ing. Mutation testing seeds artificial defects (mutations) into a program, and checks whether the tests detect them. While this technique effectively assesses the quality of checks, it also has two drawbacks. First, it places a huge demand on computing resources. Second, equivalent mutants, which are mutants that are semantically equiv- alent to the original program, dilute the quality of the results. In this work, we address both of these issues. We present the JAVALANCHE framework that applies several op- timizations to enable automated and efficient mutation testing for real-life programs. Furthermore, we address the problem of equivalent mutants by introducing impact met- rics to detect non-equivalent mutants. Impact metrics compare properties of tests suite runs on the original program with runs on mutated versions, and are based on abstrac- tions over program runs such as dynamic invariants, covered statements, and return values. The intention of these metrics is that mutations that have a graver influence on the program run are more likely to be non-equivalent. Moreover, we introduce checked coverage, an alternative approach to measure the quality of a test suite’s checks. Checked coverage determines the parts of the code that were not only executed, but that actually contribute to the results checked by the test suite, by computing dynamic backward slices from all explicit checks of the test suite. iv Zusammenfassung Diese Arbeit stellt dar, wie die Qualitat¨ von Software Tests durch mutationsbasiertes Testen in Verbindung mit Auswirkungsmaßen und durch Checked Coverage beurteilt werden kann. Obwohl unterschiedliche Faktoren die Qualitat¨ eines Tests beeinflussen, ist der wichtigste Aspekt die Fahigkeit¨ Fehler aufzudecken. Dazu muss ein Test Eingaben bereitstellen, die den fehlerhaften Teil des Programms so ausfuhren,¨ dass eine Infek- tion entsteht, d.h. der Programmzustand fehlerhaft wird. Diese Infektion muss sich so fortpflanzen, dass sie in einem fehlerhaften Ergebnis resultiert, welches dann von einer Test-Prufung¨ erkannt werden muss. Die herkommliche¨ Methode um die Qualitat¨ von Test-Prufungen¨ zu beurteilen ist mutationsbasiertes Testen. Hierbei werden kunstliche¨ Fehler (Mutationen) in ein Pro- gramm eingebaut und es wird uberpr¨ uft,¨ ob diese von den Tests erkannt werden. Ob- wohl diese Technik die Qualitat¨ von Test-Prufungen¨ beurteilen kann, weist sie zwei Nachteile auf. Erstens hat sie einen großen Bedarf an Rechenkapazitaten.¨ Zweitens verwassern¨ aquivalente¨ Mutationen, welche zwar die Syntax eines Programms andern,¨ jedoch nicht seine Semantik, die Qualitat¨ der Ergebnisse. In dieser Arbeit werden Losungen¨ fur¨ beide Probleme aufgezeigt. Wir prasentieren¨ JAVALANCHE, ein Sys- tem, das effizientes und automatisiertes mutationsbasiertes Testen fur¨ realistische Pro- gramme ermoglicht.¨ Des Weiteren wird das Problem von aquivalenten¨ Mutationen mit- tels Auswirkungsmaßen angegangen. Auswirkungsmaße vergleichen Eigenschaften zwischen einem normalen Programmlauf und einem Lauf eines mutierten Programms. Hierbei werden verschiedene Abstraktionen uber¨ den Programmlauf benutzt. Die zu- grunde liegende Idee ist, dass eine Mutation, die eine große Auswirkung auf den Pro- grammlauf hat, weniger wahrscheinlich aquivalent¨ ist. Daruber¨ hinaus stellen wir Checked Coverage vor, ein neuartiges Abdeckungsmaß, welches die Qualitat¨ von Test-Prufungen¨ misst. Checked Coverage bestimmt die Teile im Programmcode, die nicht nur ausgefuhrt,¨ sondern deren Resultate auch von den Tests uberpr¨ uft¨ werden. vi Acknowledgements First of all, I would like to thank my adviser Andreas Zeller for the guidance and support while working on my PhD. Many thanks also go to Sebastian Hack for being the second examiner of my thesis, and to During my time at the Software Engineering chair, I have been fortunate to work with great colleagues, of whom many helped me with proofreading parts of this thesis. Thank you Valentin Dallmeier, Gordon Fraser, Florian Groß, Clemens Hammacher, Kim Herzig, Yana Mileva, Jeremias Roßler,¨ and Kevin Streit. While working at this chair, I have been lucky to have great office mates. Thank you Christian Lindig, Rahul Premraj and Thomas Zimmermann. The research that I carried out would not have been possible without the work of our system administrators Kim Herzig, Sascha Just, Sebastian Hafner and Christian Holler who did a great job at maintaining the infras- tructure at our chair. For parts of this research, I collaborated with Bernhard Grun¨ and Jaechang Nam, whom I would like to thank. Furthermore, I would like to thank Arnika Marx and Anna Theobald for proofreading. I am deeply grateful to my friends and family. Especially, I would like to thank Anna, Conny, Heiner, Bastian, Julia, Luzie, Peter and Timo for cooking together and the good time we spent together. Finally, my biggest thanks go to my parents Sigrid and Werner for always supporting me. Contents 1 Introduction1 1.1 Thesis Structure.............................3 1.2 Publications...............................4 2 Background7 2.1 Software Testing............................7 2.2 Unit Tests................................9 2.3 Coverage Metrics............................ 11 2.3.1 Control Flow Criteria...................... 12 2.3.2 Data Flow Criteria....................... 14 2.3.3 Logic Coverage Criteria.................... 14 2.3.4 Summary Coverage Criteria.................. 15 2.4 Program Analysis............................ 16 2.4.1 Static Program Analysis.................... 16 2.4.2 Dynamic Program Analysis.................. 17 2.4.3 Execution Trace........................ 18 2.4.4 Program Analysis Techniques................. 19 2.4.5 Coverage Metrics........................ 19 2.4.6 Program Slicing........................ 19 2.4.7 Invariants............................ 22 2.4.8 Related Work.......................... 24 2.5 Summary................................ 25 3 Mutation Testing 27 3.1 Underlying Hypotheses......................... 30 3.1.1 Competent Programmer Hypothesis.............. 30 3.1.2 Coupling Effect......................... 31 ix CONTENTS 3.2 Costs of Mutation Testing........................ 31 3.3 Optimizations.............................. 32 3.3.1 Mutation Reduction Techniques................ 32 3.3.2 Mutant Sampling........................ 33 3.3.3 Selective Mutation....................... 33 3.3.4 Weak Mutation......................... 34 3.3.5 Mutation Schemata....................... 34 3.3.6 Coverage Data......................... 35 3.3.7 Parallelization......................... 35 3.4 Related Work.............................. 35 3.5 Summary................................ 36 4 The Javalanche Framework 37 4.1 Applying Javalanche.......................... 39 4.2 Subject Programs............................ 41 4.3 Mutation Testing Results........................ 43 4.4 Related Work.............................. 44 4.4.1 Further Uses of Javalanche................... 45 4.5 Summary................................ 47 5 Equivalent Mutants 49 5.1 Types of Mutations........................... 50 5.1.1 A Regular Mutation...................... 50 5.1.2 An Equivalent Mutation.................... 50 5.1.3 A Not Executed Mutation................... 51 5.2 Manual Classification.......................... 52 5.2.1 Percentage of Equivalent Mutants............... 52 5.2.2 Classification Time....................... 53 5.2.3 Mutation Operators....................... 53 5.2.4 Types of Equivalent Mutants.................. 54 5.2.5 Discussion........................... 56 5.3 Related Work.............................. 58 5.4 Summary................................ 59 6 Invariant Impact of Mutations 61 6.1 Learning Invariants........................... 62 6.2 Checking Invariants........................... 63 6.3 Classifying Mutations.......................... 64 6.4 Evaluation................................ 65 x CONTENTS 6.4.1 Evaluation Subjects....................... 65 6.4.2 Manual Classification of Impact Mutants........... 67 6.4.3 Invariant Impact and Tests................... 71 6.4.4 Ranking............................. 73 6.4.5 Invariant Impact of the Manually Classified Mutants..... 76 6.4.6 Discussion..........................
Recommended publications
  • ON SOFTWARE TESTING and SUBSUMING MUTANTS an Empirical Study
    ON SOFTWARE TESTING AND SUBSUMING MUTANTS An empirical study Master Degree Project in Computer Science Advanced level 15 credits Spring term 2014 András Márki Supervisor: Birgitta Lindström Examiner: Gunnar Mathiason Course: DV736A WECOA13: Web Computing Magister Summary Mutation testing is a powerful, but resource intense technique for asserting software quality. This report investigates two claims about one of the mutation operators on procedural logic, the relation operator replacement (ROR). The constrained ROR mutant operator is a type of constrained mutation, which targets to lower the number of mutants as a “do smarter” approach, making mutation testing more suitable for industrial use. The findings in the report shows that the hypothesis on subsumption is rejected if mutants are to be detected on function return values. The second hypothesis stating that a test case can only detect a single top-level mutant in a subsumption graph is also rejected. The report presents a comprehensive overview on the domain of mutation testing, displays examples of the masking behaviour previously not described in the field of mutation testing, and discusses the importance of the granularity where the mutants should be detected under execution. The contribution is based on literature survey and experiment. The empirical findings as well as the implications are discussed in this master dissertation. Keywords: Software Testing, Mutation Testing, Mutant Subsumption, Relation Operator Replacement, ROR, Empirical Study, Strong Mutation, Weak Mutation
    [Show full text]
  • Thesis Template
    Automated Testing: Requirements Propagation via Model Transformation in Embedded Software Nader Kesserwan A Thesis In The Concordia Institute For Information System Engineering Presented in Partial Fulfillment of the Requirements For the Degree of Doctor of Philosophy (Information and Systems Engineering) at Concordia University Montreal, Quebec, Canada March 2020 © Nader Kesserwan 2020 ABSTRACT Automated Testing: Requirements Propagation via Model Transformation in Embedded Software Nader Kesserwan, Ph.D. Concordia University, 2020 Testing is the most common activity to validate software systems and plays a key role in the software development process. In general, the software testing phase takes around 40-70% of the effort, time and cost. This area has been well researched over a long period of time. Unfortunately, while many researchers have found methods of reducing time and cost during the testing process, there are still a number of important related issues such as generating test cases from UCM scenarios and validate them need to be researched. As a result, ensuring that an embedded software behaves correctly is non-trivial, especially when testing with limited resources and seeking compliance with safety-critical software standard. It thus becomes imperative to adopt an approach or methodology based on tools and best engineering practices to improve the testing process. This research addresses the problem of testing embedded software with limited resources by the following. First, a reverse-engineering technique is exercised on legacy software tests aims to discover feasible transformation from test layer to test requirement layer. The feasibility of transforming the legacy test cases into an abstract model is shown, along with a forward engineering process to regenerate the test cases in selected test language.
    [Show full text]
  • Testing, Debugging & Verification
    Testing, debugging & verification Srinivas Pinisetty This course Introduction to techniques to get (some) certainty that your program does what it’s supposed to. Specification: An unambiguous description of what a function (program) should do. Bug: failure to meet specification. What is a Bug? Basic Terminology ● Defect (aka bug, fault) introduced into code by programmer (not always programmer's fault, if, e.g., requirements changed) ● Defect may cause infection of program state during execution (not all defects cause infection) ● Infected state propagates during execution (infected parts of states may be overwritten or corrected) ● Infection may cause a failure: an externally observable error (including, e.g., non-termination) Terminology ● Testing - Check for bugs ● Debugging – Relating a failure to a defect (systematically find source of failure) ● Specification - Describe what is a bug ● (Formal) Verification - Prove that there are no bugs Cost of certainty Formal Verification Property based testing Unit testing Man hours (*) Graph not based on data, only indication More certainty = more work Contract metaphor Supplier: (callee) Implementer of method Client: (caller) Implementer of calling method or user Contract: Requires (precondition): What the client must ensure Ensures (postcondition): What the supplier must ensure ● Testing ● Formal specification ○ Unit testing ○ Logic ■ Coverage criteria ■ Propositional logic ● Control-Flow based ■ Predicate Logic ● Logic Based ■ SAT ■ Extreme Testing ■ SMT ■ Mutation testing ○ Dafny ○ Input
    [Show full text]
  • Different Types of Testing
    Different Types of Testing Performance testing a. Performance testing is designed to test run time performance of software within the context of an integrated system. It is not until all systems elements are fully integrated and certified as free of defects the true performance of a system can be ascertained b. Performance tests are often coupled with stress testing and often require both hardware and software infrastructure. That is, it is necessary to measure resource utilization in an exacting fashion. External instrumentation can monitor intervals, log events. By instrument the system, the tester can uncover situations that lead to degradations and possible system failure Security testing If your site requires firewalls, encryption, user authentication, financial transactions, or access to databases with sensitive data, you may need to test these and also test your site's overall protection against unauthorized internal or external access Exploratory Testing Often taken to mean a creative, internal software test that is not based on formal test plans or test cases; testers may be learning the software as they test it Benefits Realization tests With the increased focus on the value of Business returns obtained from investments in information technology, this type of test or analysis is becoming more critical. The benefits realization test is a test or analysis conducted after an application is moved into production in order to determine whether the application is likely to deliver the original projected benefits. The analysis is usually conducted by the business user or client group who requested the project and results are reported back to executive management Mutation Testing Mutation testing is a method for determining if a set of test data or test cases is useful, by deliberately introducing various code changes ('bugs') and retesting with the original test data/cases to determine if the 'bugs' are detected.
    [Show full text]
  • Guidelines on Minimum Standards for Developer Verification of Software
    Guidelines on Minimum Standards for Developer Verification of Software Paul E. Black Barbara Guttman Vadim Okun Software and Systems Division Information Technology Laboratory July 2021 Abstract Executive Order (EO) 14028, Improving the Nation’s Cybersecurity, 12 May 2021, di- rects the National Institute of Standards and Technology (NIST) to recommend minimum standards for software testing within 60 days. This document describes eleven recommen- dations for software verification techniques as well as providing supplemental information about the techniques and references for further information. It recommends the following techniques: • Threat modeling to look for design-level security issues • Automated testing for consistency and to minimize human effort • Static code scanning to look for top bugs • Heuristic tools to look for possible hardcoded secrets • Use of built-in checks and protections • “Black box” test cases • Code-based structural test cases • Historical test cases • Fuzzing • Web app scanners, if applicable • Address included code (libraries, packages, services) The document does not address the totality of software verification, but instead recom- mends techniques that are broadly applicable and form the minimum standards. The document was developed by NIST in consultation with the National Security Agency. Additionally, we received input from numerous outside organizations through papers sub- mitted to a NIST workshop on the Executive Order held in early June, 2021 and discussion at the workshop as well as follow up with several of the submitters. Keywords software assurance; verification; testing; static analysis; fuzzing; code review; software security. Disclaimer Any mention of commercial products or reference to commercial organizations is for infor- mation only; it does not imply recommendation or endorsement by NIST, nor is it intended to imply that the products mentioned are necessarily the best available for the purpose.
    [Show full text]
  • Jia and Harman's an Analysis and Survey of the Development Of
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING 1 An Analysis and Survey of the Development of Mutation Testing Yue Jia Student Member, IEEE, and Mark Harman Member, IEEE Abstract— Mutation Testing is a fault–based software testing Besides using Mutation Testing at the software implementation technique that has been widely studied for over three decades. level, it has also been applied at the design level to test the The literature on Mutation Testing has contributed a set of specifications or models of a program. For example, at the design approaches, tools, developments and empirical results. This paper level Mutation Testing has been applied to Finite State Machines provides a comprehensive analysis and survey of Mutation Test- ing. The paper also presents the results of several development [20], [28], [88], [111], State charts [95], [231], [260], Estelle trend analyses. These analyses provide evidence that Mutation Specifications [222], [223], Petri Nets [86], Network protocols Testing techniques and tools are reaching a state of maturity [124], [202], [216], [238], Security Policies [139], [154], [165], and applicability, while the topic of Mutation Testing itself is the [166], [201] and Web Services [140], [142], [143], [193], [245], subject of increasing interest. [259]. Index Terms— mutation testing, survey Mutation Testing has been increasingly and widely studied since it was first proposed in the 1970s. There has been much research work on the various kinds of techniques seeking to I. INTRODUCTION turn Mutation Testing into a practical testing approach. However, Mutation Testing is a fault-based testing technique which pro- there is little survey work in the literature on Mutation Testing.
    [Show full text]
  • What Are We Really Testing in Mutation Testing for Machine
    What Are We Really Testing in Mutation Testing for Machine Learning? A Critical Reflection Annibale Panichella Cynthia C. S. Liem [email protected] [email protected] Delft University of Technology Delft University of Technology Delft, The Netherlands Delft, The Netherlands Abstract—Mutation testing is a well-established technique for niques, as extensively surveyed by Ben Braiek & Khom [5], assessing a test suite’s quality by injecting artificial faults into as well as Zhang et al. [6]. While this is a useful development, production code. In recent years, mutation testing has been in this work, we argue that current testing for ML approaches extended to machine learning (ML) systems, and deep learning (DL) in particular; researchers have proposed approaches, tools, are not sufficiently explicit about what is being tested exactly. and statistically sound heuristics to determine whether mutants In particular, in this paper, we focus on mutation testing, a in DL systems are killed or not. However, as we will argue well-established white-box testing technique, which recently in this work, questions can be raised to what extent currently has been applied in the context of DL. As we will show, used mutation testing techniques in DL are actually in line with several fundaments of classical mutation testing are currently the classical interpretation of mutation testing. We observe that ML model development resembles a test-driven development unclearly or illogically mapped to the ML context. After (TDD) process, in which a training algorithm (‘programmer’) illustrating this, we will conclude this paper with several generates a model (program) that fits the data points (test concrete action points for improvement.
    [Show full text]
  • Using Mutation Testing to Measure Behavioural Test Diversity
    Using mutation testing to measure behavioural test diversity Francisco Gomes de Oliveira Neto, Felix Dobslaw, Robert Feldt Chalmers and the University of Gothenburg Dept. of Computer Science and Engineering Gothenburg, Sweden [email protected], fdobslaw,[email protected] Abstract—Diversity has been proposed as a key criterion [7], or even over the entire set of tests altogether [8]. In to improve testing effectiveness and efficiency. It can be used other words, the goal is then, to obtain a test suite able to to optimise large test repositories but also to visualise test exercise distinct parts of the SUT to increase the fault detection maintenance issues and raise practitioners’ awareness about waste in test artefacts and processes. Even though these diversity- rate [5], [8]. However, the main focus in diversity-based testing based testing techniques aim to exercise diverse behavior in the has been on calculating diversity on, typically static, artefacts system under test (SUT), the diversity has mainly been measured while there has been little focus on considering the outcomes on and between artefacts (e.g., inputs, outputs or test scripts). of existing test cases, i.e., whether similar tests fail together Here, we introduce a family of measures to capture behavioural when executed against the same SUT. Many studies report that diversity (b-div) of test cases by comparing their executions and failure outcomes. Using failure information to capture the using the outcomes of test cases has proven to be an effective SUT behaviour has been shown to improve effectiveness of criteria to prioritise tests, but there are many challenges with history-based test prioritisation approaches.
    [Show full text]
  • ISTQB Glossary of Testing Terms 2.3X
    Standard glossary of terms used in Software Testing Version 2.3 (dd. March 28th, 2014) Produced by the ‘Glossary Working Party’ International Software Testing Qualifications Board Editor : Erik van Veenendaal (Bonaire) Copyright Notice This document may be copied in its entirety, or extracts made, if the source is acknowledged. Copyright © 2014, International Software Testing Qualifications Board (hereinafter called ISTQB®). 1 Acknowledgements This document was produced by the Glossary working group of the International Software Testing Qualifications Board (ISTQB). The team thanks the national boards for their suggestions and input. At the time the Glossary version 2.3 was completed the Glossary working group had the following members (alphabetic order): Armin Beer, Armin Born, Mette Bruhn-Pedersen, Josie Crawford, Ernst Dőring, George Fialkovitz, Matthias Hamburg, Bernard Homes, Ian Howles, Ozgur Kisir, Gustavo Marquez- Soza, Judy McKay (Vice-Chair) Avi Ofer, Ana Paiva, Andres Petterson, Juha Pomppu, Meile Posthuma. Lucjan Stapp and Erik van Veenendaal (Chair) The document was formally released by the General Assembly of the ISTQB on March, 28th, 2014 2 Change History Version 2.3 d.d. 03-28-2014 This new version has been developed to support the Foundation Extention Agile Tester syllabus. In addition a number of change request have been implemented in version 2.3 of the ISTQB Glossary. New terms added: Terms changed; - build verification test - acceptance criteria - burndown chart - accuracy testing - BVT - agile manifesto - content
    [Show full text]
  • Coverage-Guided Adversarial Generative Fuzzing Testing of Deep Learning Systems
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, VOL. XX, NO. X, XXXX 1 CAGFuzz: Coverage-Guided Adversarial Generative Fuzzing Testing of Deep Learning Systems Pengcheng Zhang, Member, IEEE, Qiyin Dai, Patrizio Pelliccione Abstract—Deep Learning systems (DL) based on Deep Neural Networks (DNNs) are increasingly being used in various aspects of our life, including unmanned vehicles, speech processing, intelligent robotics and etc. Due to the limited dataset and the dependence on manually labeled data, DNNs always fail to detect erroneous behaviors. This may lead to serious problems. Several approaches have been proposed to enhance adversarial examples for testing DL systems. However, they have the following two limitations. First, most of them do not consider the influence of small perturbations on adversarial examples. Some approaches take into account the perturbations, however, they design and generate adversarial examples based on special DNN models. This might hamper the reusability on the examples in other DNN models, thus reducing their generalizability. Second, they only use shallow feature constraints (e.g. pixel-level constraints) to judge the difference between the generated adversarial example and the original example. The deep feature constraints, which contain high-level semantic information - such as image object category and scene semantics, are completely neglected. To address these two problems, we propose CAGFuzz, a Coverage-guided Adversarial Generative Fuzzing testing approach for Deep Learning Systems, which generates adversarial examples for DNN models to discover their potential defects. First, we train an Adversarial Case Generator (AEG) based on general data sets. AEG only considers the data characteristics, and avoids low generalization ability. Second, we extract the deep features of the original and adversarial examples, and constrain the adversarial examples by cosine similarity to ensure that the semantic information of adversarial examples remains unchanged.
    [Show full text]
  • If You Can't Kill a Supermutant, You Have a Problem
    If You Can’t Kill a Supermutant, You Have a Problem Rahul Gopinath∗, Bjorn¨ Mathis†, Andreas Zeller‡ Saarland University Email: ∗[email protected], †[email protected] , ‡[email protected] Abstract—Quality of software test suites can be effectively Detected: yes and accurately measured using mutation analysis. Traditional mutation involves seeding first and sometimes higher order faults into the program, and evaluating each for detection. yes no However, traditional mutants are often heavily redundant, and it is desirable to produce the complete matrix of test cases vs mutants detected by each. Unfortunately, even the traditional yes yes mutation analysis has a heavy computational footprint due to the requirement of independent evaluation of each mutant by the complete test suite, and consequently the cost of evaluation of complete kill matrix is exorbitant. We present a novel approach of combinatorial evaluation of Fig. 1: Evaluation of supermutants. The filled dots indicate multiple mutants at the same time that can generate the complete mutations applied within the supermutants. The non detected mutant kill matrix with lower computational requirements. mutants are indicated by doubled borders. Our approach also has the potential to reduce the cost of execution of traditional mutation analysis especially for test suites with weak oracles such as machine-generated test suites, while at mutant kill matrix2 because each mutant needs to be evaluated the same time liable to only a linear increase in the time taken for mutation analysis in the worst case. by executing it against each test case. Note that the number of mutants that can be generated from a program is typically I.
    [Show full text]
  • White Box Testing Process
    Introduction to Dynamic Analysis Reference material • Introduction to dynamic analysis • Zhu, Hong, Patrick A. V. Hall, and John H. R. May, "Software Unit Test Coverage and Adequacy," ACM Computing Surveys, vol. 29, no.4, pp. 366-427, December, 1997 Common Definitions • Failure-- result that deviates from the expected or specified intent • Fault/defect-- a flaw that could cause a failure • Error -- erroneous belief that might have led to a flaw that could result in a failure • Static Analysis -- the static examination of a product or a representation of the product for the purpose of inferring properties or characteristics • Dynamic Analysis -- the execution of a product or representation of a product for the purpose of inferring properties or characteristics • Testing -- the (systematic) selection and subsequent "execution" of sample inputs from a product's input space in order to infer information about the product's behavior. • usually trying to uncover failures • the most common form of dynamic analysis • Debugging -- the search for the cause of a failure and subsequent repair Validation and Verification: V&V • Validation -- techniques for assessing the quality of a software product • Verification -- the use of analytic inference to (formally) prove that a product is consistent with a specification of its intent • the specification could be a selected property of interest or it could be a specification of all expected behaviors and qualities e.g., all deposit transactions for an individual will be completed before any withdrawal transaction
    [Show full text]