Dissertation Deep Pattern Mining for Program Repair

Total Page:16

File Type:pdf, Size:1020Kb

Dissertation Deep Pattern Mining for Program Repair PhD-FSTC-2019-79 The Faculty of Science, Technology and Communication Dissertation Presented on the 18/12/2019 in Luxembourg to obtain the degree of Docteur de l’Université du Luxembourg en Informatique by Kui Liu Born on 11th August 1988 in Jiangxi, China Deep Pattern Mining for Program Repair Dissertation Defense Committee Dr. Yves Le Traon, Dissertation Supervisor Professor, University of Luxembourg, Luxembourg Dr. Tegawendé Bissyandé, Chairman Assistant Professor, University of Luxembourg, Luxembourg Dr. Dongsun Kim, Vice Chairman Quality Assurance Engineer, Furiosa.ai, South Korea Dr. Jacques Klein, Expert Associate Professor, University of Luxembourg, Luxembourg Dr. Andreas Zeller Professor, Saarland University, Saarbrucken, Germany Dr. David Lo Associate Professor, Singapore Management University, Singapore Abstract Error-free software is a myth. Debugging thus accounts for a significant portion of software maintenance and absorbs a large part of software cost. In particular, the manual task of fixing bugs is tedious, error- prone and time-consuming. In the last decade, automatic bug-fixing, also referred to as automated program repair (APR) has boomed as a promising endeavor of software engineering towards alleviating developers’ burden. Several potentially promising techniques have been proposed making APR an increasingly prominent topic in both the research and practice communities. In production, APR will drastically reduce time-to-fix delays and limit downtime. In a development cycle, APR can help suggest changes to accelerate debugging. As an emergent domain, however, program repair has many open problems that the community is still exploring. Our work contributes to this momentum on two angles: the repair of programs for functionality bugs, and the repair of programs for method naming issues. The thesis starts with highlighting findings on key empirical studies that we have performed to inform future repair approaches. Then, we focus on template-based program repair scenarios and explore deep learning models for inferring accurate and relevant patterns. Finally, we integrate these patterns into APR pipelines, which yield the state of the art repair tools. The dissertation includes the following contributions: • Real-world Patch Study: Existing APR studies have shown that the state-of-the-art techniques in automated repair tend to generate patches only for a small number of bugs even with quality issues (e.g., incorrect behavior and nonsensical changes). To improve APR techniques, the community should deepen its knowledge on repair actions from real-world patches since most of the techniques rely on patches written by human developers. However, previous investigations on real-world patches are limited to statement level that is not sufficiently fine-grained to build this knowledge. This dissertation starts with deepening this knowledge via a systematic and fine-grained study of real-world Java program bug fixes. • Fault Localization Impact: Existing test-suite-based APR systems are highly dependent on the performance of the fault localization (FL) technique that is the process of the widely studied APR pipeline. However, APR systems generally focus on the patch generation, but tend to use similar but different strategies for fault localization. To assess the impact of FL on APR, we identify and investigate a practical bias caused by the FL step in a repair pipeline. We propose to highlight the different FL configurations used in the literature, and their impact on APR systems when applied to the real bugs. Then, we explore the performance variations that can be achieved by “tweaking” the FL step. • Fix Pattern Mining: Fix patterns (a.k.a. fix templates) have been studied in various APR scenarios. Particularly, fix patterns have been widely used in different APR systems. To date, fix pattern mining is mainly studied in three ways: manually summarization, transformation inferring and code change action statistics. In this dissertation, we explore mining fix patterns for static bugs leveraging deep learning and clustering algorithms. • Avatar: Fix pattern based patch generation is a promising direction in the APR community. Notably, it has been demonstrated to produce more acceptable and correct patches than the patches obtained with mutation operators through genetic programming. The performance of fix pattern based APR systems, however, depends on the fix ingredients mined from commit changes i in development histories. Unfortunately, collecting a reliable set of bug fixes in repositories can be challenging. We propose to investigate the possibility in an APR scenario of leveraging code changes that address violations by static bug detection tools. To that end, we build the Avatar APR system, which exploits fix patterns of static analysis violations as ingredients for patch generation. • TBar: Fix patterns are widely used in patch generation of APR, however, the repair performance of a single fix pattern is not studied. We revisit the performance of template-based APR to build comprehensive knowledge about the effectiveness of fix patterns, and to highlight the importance of complementary steps such as fault localization or donor code retrieval. To that end, we first investigate the literature to collect, summarize and label recurrently-used fix patterns. Based on the investigation, we build TBar, a straightforward APR tool that systematically attempts to apply these fix patterns to program bugs. We thoroughly evaluate TBar on the Defects4J benchmark. In particular, we assess the actual qualitative and quantitative diversity of fix patterns, as well as their effectiveness in yielding plausible or correct patches. • Debugging Method Names: Except the issues about semantic/static bugs in programs, we note that how to debug inconsistent method names automatically is important to improve program quality. In this dissertation, we propose a deep learning based approach to spotting and refactoring inconsistent method names in programs. ii To my dear wife Lie Ma. Acknowledgements This dissertation would not have been possible without the support of many people who in one way or another have contributed and extended their precious knowledge and experience in my PhD studies. It is my pleasure to express my gratitude to them. First of all, I would like to express my deepest thanks to my supervisor, Prof. Yves Le Traon, who has given me this great opportunity to come across continents to pursue my doctoral degree. He has always trusted and supported me with his great kindness throughout my whole PhD journey. Second, I am equally grateful to my daily advisers, Dr. Tegawendé Bissyandé and Dr. Dongsun Kim, who have introduced me into the world of automated program repair. Since then, working in this field is just joyful for me. I am particularly grateful to them as well as Dr. Jacques Klein, for their patience, valuable advice and flawless guidance. They have taught me how to perform research, how to write technical papers, and how to conduct fascinating presentations. Their dedicated guidance has made my PhD journey a fruitful and fulfilling experience. I am very happy for the friendship we have built up during the years. Third, I would like to extend my thanks to all my co-authors including Prof. Sin Yoo, Prof. Martin Monperrus, Prof. Suntae Kim, Dr. Li Li, Anil Koyuncu, Kisub Kim, Pingfan Kong, Taeyoung Kim, etc. for their valuable discussions and collaborations. I would like to thank all the members of my PhD defense committee, including chairman Dr. Tegawendé Bissyandé, Prof. David Lo, Prof. Andreas Zeller, my supervisor Prof. Yves Le Traon, and my daily adviser Dr. Dongsun Kim. It is my great honor to have them in my defense committee and I appreciate very much their efforts to examine my dissertation and evaluate my PhD work. I would like to also express my great thanks to all the friends that I have made in the Grand Duchy of Luxembourg for the memorable moments that we have had. More specifically, I would like to thank all the team members of SerVal at SnT for the great coffee breaks and interesting discussions. Finally, and more personally, I would like to express my deepest appreciation to my dear wife Lie Ma, for her everlasting support, encouragement, and love. Also, I can never overemphasize the importance of my mother in shaping who I am and giving me the gift of education. The last but not least, I would like to extend my thanks to my daughter Ruoshui Liu who gives me unexpected happiness and motivation for this dissertation. Without their support, this dissertation would not have been possible. Kui Liu University of Luxembourg December 2019 v Contents List of figures xi List of tables xv Contents xvi 1 Introduction 1 1.1 Motivation.........................................2 1.2 Challenges of Program Repair...............................3 1.2.1 Challenges in Fault Localization.........................3 1.2.2 APR-Inherited Challenges.............................4 1.2.3 Challenges of Debugging Method Names.....................5 1.3 Contributions........................................5 1.4 Roadmap..........................................6 2 Background 9 2.1 Automated Program Repair................................ 10 2.1.1 State-of-the-Art APR............................... 10 2.1.2 Fault Localizaiton in APR............................. 11 2.1.3 APR Assessment.................................. 12 2.2 Fix Pattern Inference.................................... 12 2.3 Debugging Method Names................................. 12 3 A Closer Look at Real-World
Recommended publications
  • APPLICATION of the DELTA DEBUGGING ALGORITHM to FINE-GRAINED AUTOMATED LOCALIZATION of REGRESSION FAULTS in JAVA PROGRAMS Master’S Thesis
    TALLINN UNIVERSITY OF TECHNOLOGY Faculty of Information Technology Marina Nekrassova 153070IAPM APPLICATION OF THE DELTA DEBUGGING ALGORITHM TO FINE-GRAINED AUTOMATED LOCALIZATION OF REGRESSION FAULTS IN JAVA PROGRAMS Master’s thesis Supervisor: Juhan Ernits PhD Tallinn 2018 TALLINNA TEHNIKAÜLIKOOL Infotehnoloogia teaduskond Marina Nekrassova 153070IAPM AUTOMATISEETITUD SILUMISE RAKENDAMINE VIGADE LOKALISEERIMISEKS JAVA RAKENDUSTES Magistritöö Juhendaja: Juhan Ernits PhD Tallinn 2018 Author’s declaration of originality I hereby certify that I am the sole author of this thesis. All the used materials, references to the literature and the work of others have been referred to. This thesis has not been presented for examination anywhere else. Author: Marina Nekrassova 08.01.2018 3 Abstract In software development, occasionally, in the course of software evolution, the functionality that previously worked as expected stops working. Such situation is typically denoted by the term regression. To detect regression faults as promptly as possible, many agile development teams rely nowadays on automated test suites and the practice of continuous integration (CI). Shortly after the faulty change is committed to the shared mainline, the CI build fails indicating the fact of code degradation. Once the regression fault is discovered, it needs to be localized and fixed in a timely manner. Fault localization remains mostly a manual process, but there have been attempts to automate it. One well-known technique for this purpose is delta debugging algorithm. It accepts as input a set of all changes between two program versions and a regression test that captures the fault, and outputs a minimized set containing only those changes that directly contribute to the fault (in other words, are failure-inducing).
    [Show full text]
  • Repairnator Patches Programs Automatically
    Repairnator patches programs automatically Ubiquity Volume 2019, Number July (2019), Pages 1-12 Martin Monperrus, Simon Urli, Thomas Durieux, Matias Martinez, Benoit Baudry, Lionel Seinturier DOI: 10.1145/3349589 Repairnator is a bot. It constantly monitors software bugs discovered during continuous integration of open-source software and tries to fix them automatically. If it succeeds in synthesizing a valid patch, Repairnator proposes the patch to the human developers, disguised under a fake human identity. To date, Repairnator has been able to produce patches that were accepted by the human developers and permanently merged into the code base. This is a milestone for human-competitiveness in software engineering research on automatic program repair. Programmers have always been unhappy that mistakes are so easy, and they need to devote much of their work to finding and repairing defects. Consequently, there has been considerable research on detecting bugs, preventing bugs, or even proving their absence under specific assumptions. Yet, regardless of how bugs are found, this vast research has assumed that a human developer would fix the bugs. A decade ago, a group of researchers took an alternative path: regardless of how a bug is detected, they envisioned that an algorithm would analyze it to synthesize a patch. The corresponding research field is called "automated program repair." This paradigm shift has triggered a research thrust with significant achievements [1]: Some past bugs in software repositories would have been amenable to automatic correction. In this following example, a patch fixes an if- statement with incorrect condition (a patch inserts, deletes, or modifies source code): - if (x < 10) + if (x ≤ 10) foo(); Armed with the novel mindset of repairing instead of detecting, and with opulent and cheap computing power, it is possible today to automatically explore large search spaces of possible patches.
    [Show full text]
  • Karampatsis2021.Pdf (1.617Mb)
    This thesis has been submitted in fulfilment of the requirements for a postgraduate degree (e.g. PhD, MPhil, DClinPsychol) at the University of Edinburgh. Please note the following terms and conditions of use: This work is protected by copyright and other intellectual property rights, which are retained by the thesis author, unless otherwise stated. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the author. The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the author. When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given. Scalable Deep Learning for Bug Detection Rafael-Michael Karampatsis I V N E R U S E I T H Y T O H F G E R D I N B U Doctor of Philosophy Institute for Language, Cognition and Computation School of Informatics University of Edinburgh 2020 Dedicated to my brother Elias and to my lovely parents for encouraging me to never stop pursuing my dreams Declaration I declare that this thesis has been composed by myself and that this work has not been submitted for any other degree or professional qualification. I confirm that the work submitted is my own, except where work which has formed part of jointly-authored publications has been included. My contribution and those of the other authors to this work have been explicitly indicated at the beginning of each chapter.
    [Show full text]
  • Bug Assignment: Insights on Methods, Data and Evaluation
    Bug Assignment: Insights on Methods, Data and Evaluation by Ali Sajedi Badashian A thesis submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Computing Science University of Alberta © Ali Sajedi Badashian, 2018 Abstract The bug-assignment problem is prevalently defined as ranking developers based on their competence to fix a given bug. Previous methods in the area used machine-learning or information-retrieval techniques and considered textual elements of bug reports as evidence of expertise of developers to give each of the developers a score and sort the developers for the given bug report. Despite the importance of the subject and the substantial attention it has received from researchers during last 15 years, still it is a challenging, time-consuming task in large software projects. Even there is still no unanimity on how to validate and comparatively evaluate bug-assignment methods and, often times, methods reported in the literature are not repro- ducible. In this thesis, we make the following contributions. 1) We investigate the effect of three important experimental-design parameters in the previous research; the evaluation metric(s) they report, their definition of who the real assignee is, and the community of developers they consider as candidate assignees. Supported by our experiment on a comprehensive data set of bugs we collected from Github, we propose a systematic framework for evaluation of bug-assignment research. Addressing those aspects supports better evaluation, enables replication of the study and promotes its usage in other research or industrial applications. 2) We propose a new bug-assignment approach relying on the set of Stack Overflow tags as the thesaurus of programming keywords.
    [Show full text]
  • Genetic Improvement
    Emergent and Self-Adaptive Systems: Theory and Practice Data Science Institute, University of Lancaster, 19-20th Oct 2017 Genetic Improvement W. B. Langdon Computer Science, University College, London A Comprehensive Survey, IEEE TEVC Starting in 2015 annual workshops on GI 17.10.2017 Genetic Improvement W. B. Langdon Computer Science, University College, London A Comprehensive Survey, IEEE TEVC Starting in 2015 annual workshops on GI Genetic Improvement of Programs • What is Genetic Improvement • What has Genetic Improvement done – Technology behind automatic bug fixing – Improvement of existing code: speedup, transplanting, program adaptation • Goals of Genetic Improvement – more automated/higher level programming • What can GI do for Emergent and Self-Adaptive Systems? W. B. Langdon, UCL 3 What is Genetic Improvement W. B. Langdon, UCL 4 Genetic Improvement Use GP to evolve a population of computer programs – Start with representation of human written code – Programs’ fitness is determined by running them – Better programs are selected to be parents – New generation of programs are created by randomly combining above average parents or by mutation. – Repeat generations until solution found. Free Free PDF E-book kindle Charles Darwin 1809-1882 Typical GI Evolutionary Cycle Auto-clean up W. B. Langdon, UCL 6 GI Automatic Coding • Genetic Improvement does not start from zero • Use existing system – Source of non-random code – Use existing code as test “Oracle”. (Program is its own functional specification) – Can always compare against previous version – Easier to tell if better than if closer to poorly defined goal functionality. • Testing scales (sort of). Hybrid with “proof” systems 7 What has Genetic Improvement done W.
    [Show full text]
  • Fixing Recurring Crash Bugs Via Analyzing Q&A Sites
    Fixing Recurring Crash Bugs via Analyzing Q&A Sites Qing Gao, Hansheng Zhang, Jie Wang, Yingfei Xiong, Lu Zhang, Hong Mei Key Laboratory of High Confidence Software Technologies (Peking University), MoE Institute of Software, School of Electronics Engineering and Computer Science, Peking University, Beijing, 100871, P. R. China fgaoqing11, zhanghs12, wangjie14, xiongyf04, zhanglu, [email protected] framework, which indicates that Q&A sites are more or less a Abstract—Recurring bugs are common in software systems, reliable source for obtaining fixes for a large portion of bugs. especially in client programs that depend on the same framework. As the first step of fixing recurring bugs via analyzing Existing research uses human-written templates, and is limited to Q&A sites, we focus on a specific class of bugs: crash bugs. certain types of bugs. In this paper, we propose a fully automatic Crash bugs are among the most severe bugs in real-world approach to fixing recurring crash bugs via analyzing Q&A sites. software systems, and a lot of research efforts have been put By extracting queries from crash traces and retrieving a list of Q&A pages, we analyze the pages and generate edit scripts. Then into handling crash bugs, including localizing the causes of we apply these scripts to target source code and filter out the crash bugs [5], keeping the system running under the presence incorrect patches. The empirical results show that our approach of crashes [6], and checking the correctness of fixes to crash is accurate in fixing real-world crash bugs, and can complement bugs [7].
    [Show full text]
  • State-Of-The-Art in Self-Healing and Patch Generation
    ICT PROJECT 258109 Monitoring Control for Remote Software Maintenance Project number: 258109 Project Acronym: FastFix Project Title: Monitoring Control for Remote Software Maintenance Call identifier: FP7-ICT-2009-5 Instrument: STREP Thematic Priority: Internet of Services, Software, Virtualisation Start date of the project: June 1, 2010 Duration: 30 months D6.1: State-of-the-art in Self-Healing and Patch Generation Lead contractor: Lero Benoit Gaudin, Emil Vassev, Mike Hinchey, Paddy Author(s): Nixon, Dennis Pagano, João Coelho Garcia, Nitesh Narayan. Submission date: December 2010 Dissemination level: Public Project co-founded by the European Commission under the Seventh Framework Programme © S2, TUM, LERO, INESC ID, TXT, PRODEVELOP Abstract: As software maintenance processes still rely on human expertise and manual intervention, they usually require important efforts in terms of time and costs. Research has been conducted in the past recent years in order to try to overcome this issue by considering systems that would maintain themselves. This concept gave birth to Autonomic Computing (AC), which aims to transfer and automate part of the maintenance processes to the system itself. Self-healing is one of the properties that AC seeks to equip system with. Self-healing aims for the system to automatically recover from faults, possibly fixing them through patch generation. This document introduces self-healing approaches as well as other research areas that are very related to this concept, such as fault-tolerance, Automatic Diagnosis and Automatic Repair. The latter contains works related to automatic patch generation. This document has been produced in the context of the FastFix Project. The FastFix project is part of the European Community’s Seventh Framework Program for research and development and is as such funded by the European Commission.
    [Show full text]
  • Obstacles in Fully Automatic Program Repair: a Survey S
    Obstacles in Fully Automatic Program Repair: A survey S. Amirhossein Mousavi, Donya Azizi Babani, Francesco Flammini Abstract The current article is an interdisciplinary attempt to decipher automatic program repair processes. The review is done by the manner typical to human science known as “diffraction”. We attempt to spot a gap in the literature of self-healing and self-repair operations and further investigate the approaches that would enable us to tackle the problems we face. As a conclusion, we suggest a shift in the current approach to automatic program repair operations in order to attain our goals. The emphasis of this review is to achieve “full automation.” Several obstacles are shortly mentioned in the current essay but the main shortage that is covered is the “overfitting” obstacle, and this particular problem is investigated in the stream that is related to “full automation” of the repair process. 1. Introduction In today’s world, security and dependability happen to become the most significant priority of system requirements whereby the system functions even in the presence of faults or cyber- attacks. An exhaustive review; “Epic failures: 11 infamous software bugs”, [1] illustrates most famous software system disasters caused by the system bug which has resulted in massive financial losses and even human lives losses. A bug or failure can cause drastic and unrecoverable losses in crucial and sensitive systems such as aviation systems, highways or railways traffic control systems etc. Debugging processes almost impose up to 50% of the development expenses of software production [2], [3]. However, by thoroughly investigating the existing literature at hand, one may encounter various definitions, explanations, and synonyms for bugs, fault, failure, defects, etc.
    [Show full text]
  • Tailoring Programs for Static Analysis Via Program Transformation
    Tailoring Programs for Static Analysis via Program Transformation Rijnard van Tonder Claire Le Goues School of Computer Science School of Computer Science Carnegie Mellon University Carnegie Mellon University USA USA [email protected] [email protected] ABSTRACT analyses in practice. Analyses must approximate, both theoreti- Static analysis is a proven technique for catching bugs during soft- cally [22, 25] and in the interest of practicality [9, 11]. Developers ware development. However, analysis tooling must approximate, are sensitive to whether tools integrate seamlessly into their exist- both theoretically and in the interest of practicality. False positives ing workflows [21]; at minimum analyses must be fast, surface few are a pervading manifestation of such approximations—tool con- false positives, and provide mechanisms to suppress warnings [10]. figuration and customization is therefore crucial for usability and The problem is that the choices that tools make toward these ends directing analysis behavior. To suppress false positives, develop- are broadly generic, and the divergence between tool assumptions ers readily disable bug checks or insert comments that suppress and program reality (i.e., language features or quality concerns) spurious bug reports. Existing work shows that these mechanisms can lead to unhelpful, overwhelming, or incorrect output [20]. fall short of developer needs and present a significant pain point Tool configuration and customization is crucial for usability and for using or adopting analyses. We draw on the insight that an directing analysis behavior. The inability to easily and selectively analysis user always has one notable ability to influence analysis disable analysis checks and suppress warnings presents a significant behavior regardless of analyzer options and implementation: modi- pain point for analysis users [10].
    [Show full text]
  • Automatic Patch Generation Learned from Human-Written Patches
    A Critical Review of “Automatic Patch Generation Learned from Human-Written Patches”: Essay on the Problem Statement and the Evaluation of Automatic Software Repair Martin Monperrus University of Lille & INRIA, France [email protected] The automatic detection of bugs has been a vast research field for decades, with a large spectrum of static and dy- ABSTRACT namic techniques. Active research on the automatic repair1 of bugs is more recent. A seminal line of research started in At ICSE’2013, there was the first session ever dedicated to 2009 with the GenProg system [37, 15], and at the 2013 In- automatic program repair. In this session, Kim et al. pre- ternational Conference on Software Engineering, there was sented PAR, a novel template-based approach for fixing Java the first session ever dedicated to automatic program repair. bugs. We strongly disagree with key points of this paper. The PAR system [19] was presented there, it is an ap- Our critical review has two goals. First, we aim at explain- proach for automatically fixing bugs of Java code. The re- ing why we disagree with Kim and colleagues and why the pair problem statement is the same as GenProg [15] “given a reasons behind this disagreement are important for research test suite with at least one failing test, generate a patch that on automatic software repair in general. Second, we aim makes all test cases passing”. PAR introduces a new tech- at contributing to the field with a clarification of the essen- nique to fix bugs, based on templates. Each of PAR’s ten tial ideas behind automatic software repair.
    [Show full text]
  • Leveraging Light-Weight Analyses to Aid Software Maintenance
    Leveraging Light-Weight Analyses to Aid Software Maintenance A Dissertation Presented to the Faculty of the School of Engineering and Applied Science University of Virginia In Partial Fulfillment of the requirements for the Degree Doctor of Philosophy (Computer Science) by Zachary P. Fry May 2014 c 2014 Zachary P. Fry Abstract While software systems have become a fundamental part of modern life, they require maintenance to continually function properly and to adapt to potential environment changes [1]. Software maintenance, a dominant cost in the software lifecycle [2], includes both adding new functionality and fixing existing problems, or “bugs,” in a system. Software bugs cost the world’s economy billions of dollars annually in terms of system down-time and the effort required to fix them [3]. This dissertation focuses specifically on corrective software maintenance — that is, the process of finding and fixing bugs. Traditionally, managing bugs has been a largely manual process [4]. This historically involved developers treating each defect as a unique maintenance concern, which results in a slow process and thus a high aggregate cost for finding and fixing bugs. Previous work has shown that bugs are often reported more rapidly than companies can address them, in practice [5]. Recently, automated techniques have helped to ease the human burden associated with maintenance activities. However, such techniques often suffer from a few key drawbacks. This thesis argues that automated maintenance tools often target narrowly scoped problems rather than more general ones. Such tools favor maximizing local, narrow success over wider applicability and potentially greater cost benefit. Additionally, this dissertation provides evidence that maintenance tools are traditionally evaluated in terms of functional correctness, while more practical concerns like ease-of-use and perceived relevance of results are often overlooked.
    [Show full text]
  • The Living Review on Automated Program Repair Martin Monperrus
    The Living Review on Automated Program Repair Martin Monperrus To cite this version: Martin Monperrus. The Living Review on Automated Program Repair. 2020. hal-01956501v2 HAL Id: hal-01956501 https://hal.archives-ouvertes.fr/hal-01956501v2 Preprint submitted on 15 Dec 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The Living Review on Automated Program Repair Martin Monperrus version of December 15, 2020, http://bit.ly/2CehUt5 Concept This paper is a living review on automatic program repair1. Compared to a traditional survey, a living review evolves over time. I use a concise bullet-list style meant to be easily accessible by the greatest number of readers, in particular students and practitioners. Within a section, all papers are ordered in a reverse chronological order, so as to easily get the research timeline. The references are sorted chronologically and years are explicitly stated inline to easily grasp the most recent references. Inclusion criteria The inclusion criteria are that the considered papers 1) must be about automatic repair with some kind of patch generation (runtime repair without patch generation is excluded2); 2) must contain a reasonable amount of material (at least 4 double- column pages); 3) are stored on an durable site (notable publisher, arXiv, Zenodo).
    [Show full text]