Automatic Bug-Finding Techniques for Large Software Projects

Total Page:16

File Type:pdf, Size:1020Kb

Automatic Bug-Finding Techniques for Large Software Projects Masaryk University Faculty}w¡¢£¤¥¦§¨ of Informatics!"#$%&'()+,-./012345<yA| Automatic Bug-finding Techniques for Large Software Projects Ph.D. Thesis Jiri Slaby Brno, 2013 Revision 159 Supervisor: prof. RNDr. Antonín Kučera, PhD. Co-supervisor: assoc. prof. RNDr. Jan Strejček, PhD. ii Copyright c Jiri Slaby, 2013 All rights reservedtož to ja iii iv Abstract This thesis studies the application of contemporary automatic bug-finding techniques to large software projects. As they tend to be complex systems, there is a call for efficient analyzers that can exercise such projects without generating too many false positives. To fulfill these requirements, this thesis aims at the static analysis with the primary focus at techniques like the abstract interpretation and symbolic execution. Since their introduction in 1970’s, their applicability is increasing thanks to current fast processors and further improvements of the techniques. The thesis summarizes the current state of the art of chosen techniques implemented in the current static analyzers. Further, we discuss some large code bases which can be used as a test-bed for the techniques. Finally, we choose the Linux kernel. On the top of the explored state of the art, we de- velop our own techniques. We present the framework Stanse which utilizes the abstract interpretation and has no issues with handling the Linux kernel. The framework also prunes false positives resulting from some checkers. The pruning method is based on the pattern matching, but it turns out to be weak. Hence we subsequently try to improve the situation by the symbolic execution. For that purpose we use an executor called Klee in our new tool Symbiotic. But since the symbolic execution requires a long computation time, we slice the code by a static slicer first. During our experiments, we cumulated knowledge about categories of bugs usually present in code such as the Linux kernel. We investigated it more and developed a database of known bugs in a fixed version of the kernel. It is publicly available and contains information about errors and false positives found by several bug-finding tools. This can help developers of new static analyzers to compare their brand new tool against the database and see how their tool stands with respect to false positives, missed errors and of course error hits. We also participated with Symbiotic in the Software Verification Com- petition. The tool had to be adapted to the competition needs. Since we participated only with a too preliminary version of the tool, the results were not so encouraging. Nonetheless, the tool was improved later and it gives good results now – we collect nearly all points in some competition categories. We plan to compete in the competition with our tuned tool again. v vi Acknowledgement I would like to thank my supervisors prof. Kučera and assoc. prof. Strejček for their patience with my ignorance during the nice past years spent at the university. I really enjoyed that time and was taught many new things which would stay unrevealed for me otherwise. I thank my colleagues for their ideas and inspiration, especially Marek with whom we started the studies and walked most of the path. Of course, my parents and family altogether deserve many thanks for their eternal support. Without them and their care, I would be only a poor man on this Earth. And last, but definitely not least, Zuzanka must not be missing on this list. Her love, patience, and care is much appreciated. I am really glad we met each other. vii viii Contents 1 Introduction 1 1.1 Motivation . .3 1.1.1 Requirements for Analyzers . .4 1.1.2 Picking the Project . .7 1.2 Background . .8 1.2.1 Classes of Reports . .9 1.2.2 Theoretical Properties . .9 1.2.3 Sensitivity of Analyzers . 11 1.2.4 Types of Analyses . 12 1.2.5 Other Views . 13 1.2.6 Code Parsing . 13 1.2.7 Pointer Analysis . 16 1.3 Thesis Goals . 20 1.4 Thesis Contribution . 21 1.5 Thesis Organization . 21 2 State of the Art 23 2.1 Pattern Matching . 24 2.2 Data-flow Analysis . 25 2.3 Abstract Interpretation . 29 2.4 Symbolic Execution . 35 2.4.1 Concolic Execution . 39 2.5 Program Slicing . 42 2.6 Model Checking . 45 2.7 Industrial Tools . 47 2.8 Summary of the Linux Kernel Checkers . 48 3 Stanse 51 3.1 Framework Functionality . 52 3.2 Checkers . 59 3.2.1 AutomatonChecker ................. 59 3.2.2 LockChecker ...................... 61 3.2.3 ThreadChecker .................... 61 ix 3.2.4 ReachabilityChecker ................ 61 3.3 Results on Linux Kernel . 62 3.3.1 Important Errors Found by Stanse .......... 64 3.4 Discussion . 66 3.5 Conclusions . 69 4 Symbiotic 71 4.1 Running Example . 72 4.2 Our Contribution . 74 4.3 Instrumentation . 76 4.4 Slicing . 79 4.5 Symbolic Execution . 81 4.6 Implementation . 81 4.6.1 Prepare . 83 4.6.2 Slicer . 84 4.6.3 Kleerer . 85 4.6.4 Symbolic Execution . 86 4.6.5 Stitching Everything Together . 86 4.7 Experimental Results . 88 4.8 Related Work . 89 4.9 Future Work . 90 4.10 Conclusion . 91 5 Building Bug-database 93 5.1 Database Structure . 95 5.2 Database Frontend . 95 5.3 Current Contents of the Database . 98 5.3.1 Sources of Bug-Reports . 98 5.3.2 Linux Kernel Error Types . 99 5.3.3 Bug-Reports in the Database . 101 5.4 Use of the Database . 102 5.5 Conclusion and Future Plans . 105 6 Participating SV-COMP 107 6.1 The Competition . 108 6.2 Our Participation . 110 6.3 Results . 111 6.4 Conclusion . 113 7 Conclusion 115 Bibliography 117 x A Author’s Contribution 143 A.1 List of Publications . 143 A.2 List of Software . 144 xi xii Chapter 1 Introduction In the contemporary world, we are surrounded by devices making our lives easier. It counts from modern lifts which speak to us by natural language to supercomputers helping to treat cancer such as IBM Watson [But12]. Since Moore’s law [Moo65] has proven to hold for the past four decades, most present devices are featuring quite fast processing units thanks to the increasing count of built-in transistors. This allows developers to implement many interesting but complex algorithms. However being written by human beings vastly, it may lead to errant behaviour of the implementations as they are only as strong as the weakest link in the imaginary chain. One example is the error that led to a crash of the Ariane 5 rocket that should launch a satellite to the orbit of the Earth [Dow97]. $370M were buried in the middle of nowhere due to a programmer’s omission to check for an overflow in the acceleration computation. So these errors obviously cause companies revenue losses. It is known the later in the development cycle an error is revealed, the more expensive it is to fix the error [BB01]. The situation becomes even worse when the software is deployed at the customer’s site or in a rocket flying 13 000 ft above the sea level already. Companies have been always desiring for cheap solutions perfectly fit- ting their business plans. And yet, solutions which are based on algorithms that will cause no harm when deployed, completely error-free at best. But they are out of luck as, in general, there cannot be a tool which would tell us whether some algorithm is error-free. If there were one, it would be in contradiction of the Halting problem [Tur36]. However while it is impossible in theory, it does not necessarily mean we should not at least try hard for non-general cases when dealing with algorithms implemented in software. There, the implementations have at least finite state space thanks to finite memory. Even though the limited state space is still huge, companies have been always using techniques helping to avoid and remove errors from their programs. Even some kind of standardization of used coding principles like dictated coding style may reduce the number of introduced errors [BB01]. 1 2 CHAPTER 1. INTRODUCTION The reason may be that a project management imposes a duty on develop- ers to write simpler code. For example, the management can ban complex constructs which are known to introduce errors. Or, for some environments like embedded systems, heap allocations can be disallowed. This results in weeding out one whole category of programming errors – memory leaks. Some companies also started adopting standards like ISO 9001:2000 [ISO00] or MISRA C [Bur13]. Their customers are assured that the internal man- agement of such a company conforms to a standardized behaviour that tries to increase quality of developed products. Standardization cannot by definition evade programming errors com- pletely. Since the early times of computer technology, programmers in com- panies, or testers if they were distinct persons, were forced to test programs. Testers were running the programs with diverse input, random inclusive, to make the program misbehave or crash in order to catch and fix program errors. This approach is called dynamic program testing [MSTW10]. It is used very often as it is relatively cheap and mostly very effective technique. Thanks to executing the code in real-time with real data, whenever the dy- namic testing reveals an error, it is guaranteed to be genuine (if we look aside from malfunction of broken hardware). But it indeed takes a huge amount of time to perform systematic testing of large projects. And yet, the dynamic testing needs obviously a runnable executable which is not usually available in the early phases of development.
Recommended publications
  • Undefined Behaviour in the C Language
    FAKULTA INFORMATIKY, MASARYKOVA UNIVERZITA Undefined Behaviour in the C Language BAKALÁŘSKÁ PRÁCE Tobiáš Kamenický Brno, květen 2015 Declaration Hereby I declare, that this paper is my original authorial work, which I have worked out by my own. All sources, references, and literature used or excerpted during elaboration of this work are properly cited and listed in complete reference to the due source. Vedoucí práce: RNDr. Adam Rambousek ii Acknowledgements I am very grateful to my supervisor Miroslav Franc for his guidance, invaluable help and feedback throughout the work on this thesis. iii Summary This bachelor’s thesis deals with the concept of undefined behavior and its aspects. It explains some specific undefined behaviors extracted from the C standard and provides each with a detailed description from the view of a programmer and a tester. It summarizes the possibilities to prevent and to test these undefined behaviors. To achieve that, some compilers and tools are introduced and further described. The thesis contains a set of example programs to ease the understanding of the discussed undefined behaviors. Keywords undefined behavior, C, testing, detection, secure coding, analysis tools, standard, programming language iv Table of Contents Declaration ................................................................................................................................ ii Acknowledgements .................................................................................................................. iii Summary .................................................................................................................................
    [Show full text]
  • Codesonar the SAST Platform for Devsecops
    DATASHEET CodeSonar The SAST Platform for DevSecOps Accelerate Application Security Software teams are under constant pressure to deliver more content with higher complexity, in shorter timeframes, with increased quality and security. Static Application Security Testing is a proven best practice to help software teams deliver the best code in the shortest timeframe. GrammaTech has been a leader in this field for over 15 years with CodeSonar delivering multi-language SAST capabilities for enterprises where software quality and software security matter. DevSecOps - Speed and Scale Language Support Software developers need rapid feedback on security CodeSonar supports many popular languages, including vulnerabilities in their code. CodeSonar can be integrated into C/C++, Java, C# and Android, as well as support for native software development environments, works unobtrusively to binaries in Intel, Arm and PowerPC instruction set architectures. the developer and provides rapid feedback. CodeSonar also supports OASIS SARIF, for exchange of information with other tools in the DevSecOps environment. Examples of Defects Detected • Buffer over- and underruns • Cast and conversion problems • Command injection • Copy-paste error • Concurrency • Ignored return value • Memory leak • Tainted data • Null pointer dereference • Dangerous function • Unused parameter / value And hundreds more Security Quality Privacy Broad coverage of security vulnerabilities, Integration into DevSecOps to improve Checkers that detect performance including OWASP Top10, SANS/CWE 25. quality of the code and developer impacts such as unnecessary test for Support for third party applications efficiency. Find code quality and nullness, creation of redundant objects or through byte code analysis. performance issues at speed. superfluous memory writes. Team Support Built In CodeSonar is designed to support large teams.
    [Show full text]
  • Grammatech Codesonar Product Datasheet
    GRAMMATECH CODESONAR PRODUCT DATASHEET Static Code Analysis and Static Application Security Testing Software Assurance Services Delivered by a senior software CodeSonar empowers teams to quickly analyze and validate the code – source and/or binary – engineer, the software assurance identifying serious defects or bugs that cause cyber vulnerabilities, system failures, poor services focus on automating reliability, or unsafe conditions. reporting on your software quality, creating an improvement plan and GrammaTech’s Software Assurance Services provide the benets of CodeSonar to your team in measuring your progress against that an accelerated timeline and ensures that you make the best use of SCA and SAST. plan. This provides your teams with reliable, fast, actionable data. GrammaTech will manage the static Enjoy the Benets of the Deepest Static Analysis analysis engine on your premises for you, such that your resources can Employ Sophisticated Algorithms Comply with Coding Standards focus on developing software. The following activities are covered: CodeSonar performs a unied dataow and CodeSonar supports compliance with standards symbolic execution analysis that examines the like MISRA C:2012, IS0-26262, DO-178B, Integration in your release process computation of the entire program. The US-CERT’s Build Security In, and MITRE’S CWE. Integration in check-in process approach does not rely on pattern matching or similar approximations. CodeSonar’s deeper Automatic assignment of defects analysis naturally nds defects with new or Reduction of parse errors unusual patterns. Review of warnings Analyze Millions of Lines of Code Optimization of conguration CodeSonar can perform a whole-program Improvement plan and tracking analysis on 10M+ lines of code.
    [Show full text]
  • Codesonar for C# Datasheet
    DATASHEET CodeSonar C# SAST when Safety and Security Matter Accelerate Application Security Software teams are under constant pressure to deliver more content with higher complexity, in shorter timeframes, with increased quality and security. Static Application Security Testing is a proven best practice to help software teams deliver the best code in the shortest timeframe. GrammaTech has been a leader in this field for over 15 years with CodeSonar delivering C# multi-language SAST capabilities for enterprises where software quality and software security matter. DevSecOps - Speed and Scale Abstract Interpretation Software developers need rapid feedback on security GrammaTech SAST tools use the concept of abstract vulnerabilities in their work artifacts. CodeSonar can be interpretation to statically examine all the paths through the integrated into software development environments, can work application and understand the values of variables and how they unobtrusively to the developer and provide rapid feedback. impact program state. Abstract interpretation gives CodeSonar for C# the highest scores in vulnerability benchmarks. Security Quality Privacy Broad coverage of security vulnerabilities, Integration into DevSecOps to improve Checkers that detect performance including OWASP Top10, SANS/CWE 25. quality of the code and developer impacts such as unnecessary test for Support for third party applications efficiency. Find code quality and nullness, creation of redundant objects or through byte code analysis. performance issues at speed. superfluous memory writes. Use Cases Enterprise customers are using C# in their internal applications, either in-house built, or built by a third-party. Static analysis is needed to to improve security and quality to drive business continuity. Mobile and Client customers are using C# on end-points, sometimes in an internet-of-things deployment, or to provide information to mobile users.
    [Show full text]
  • Software Vulnerabilities Principles, Exploitability, Detection and Mitigation
    Software vulnerabilities principles, exploitability, detection and mitigation Laurent Mounier and Marie-Laure Potet Verimag/Université Grenoble Alpes GDR Sécurité – Cyber in Saclay (Winter School in CyberSecurity) February 2021 Software vulnerabilities . are everywhere . and keep going . 2 / 35 Outline Software vulnerabilities (what & why ?) Programming languages (security) issues Exploiting a sofwtare vulnerability Software vulnerabilities mitigation Conclusion Example 1: password authentication Is this code “secure” ? boolean verify (char[] input, char[] passwd , byte len) { // No more than triesLeft attempts if (triesLeft < 0) return false ; // no authentication // Main comparison for (short i=0; i <= len; i++) if (input[i] != passwd[i]) { triesLeft-- ; return false ; // no authentication } // Comparison is successful triesLeft = maxTries ; return true ; // authentication is successful } functional property: verify(input; passwd; len) , input[0::len] = passwd[0::len] What do we want to protect ? Against what ? I confidentiality of passwd, information leakage ? I control-flow integrity of the code I no unexpected runtime behaviour, etc. 3 / 35 Example 2: make ‘python -c ’print "A"*5000’‘ run make with a long argument crash (in recent Ubuntu versions) Why do we need to bother about crashes (wrt. security) ? crash = consequence of an unexpected run-time error not trapped/foreseen by the programmer, nor by the compiler/interpreter ) some part of the execution: I may take place outside the program scope/semantics I but can be controled/exploited by an attacker (∼ “weird machine”) out of scope execution runtime error crash normal execution possibly exploitable ... security breach ! ,! may break all security properties ... from simple denial-of-service to arbitrary code execution Rk: may also happen silently (without any crash !) 4 / 35 Back to the context: computer system security what does security mean ? I a set of general security properties: CIA Confidentiality, Integrity, Availability (+ Non Repudiation + Anonymity + .
    [Show full text]
  • Code Review Guide
    CODE REVIEW GUIDE 2.0 RELEASE Project leaders: Larry Conklin and Gary Robinson Creative Commons (CC) Attribution Free Version at: https://www.owasp.org 1 F I 1 Forward - Eoin Keary Introduction How to use the Code Review Guide 7 8 10 2 Secure Code Review 11 Framework Specific Configuration: Jetty 16 2.1 Why does code have vulnerabilities? 12 Framework Specific Configuration: JBoss AS 17 2.2 What is secure code review? 13 Framework Specific Configuration: Oracle WebLogic 18 2.3 What is the difference between code review and secure code review? 13 Programmatic Configuration: JEE 18 2.4 Determining the scale of a secure source code review? 14 Microsoft IIS 20 2.5 We can’t hack ourselves secure 15 Framework Specific Configuration: Microsoft IIS 40 2.6 Coupling source code review and penetration testing 19 Programmatic Configuration: Microsoft IIS 43 2.7 Implicit advantages of code review to development practices 20 2.8 Technical aspects of secure code review 21 2.9 Code reviews and regulatory compliance 22 5 A1 3 Injection 51 Injection 52 Blind SQL Injection 53 Methodology 25 Parameterized SQL Queries 53 3.1 Factors to Consider when Developing a Code Review Process 25 Safe String Concatenation? 53 3.2 Integrating Code Reviews in the S-SDLC 26 Using Flexible Parameterized Statements 54 3.3 When to Code Review 27 PHP SQL Injection 55 3.4 Security Code Review for Agile and Waterfall Development 28 JAVA SQL Injection 56 3.5 A Risk Based Approach to Code Review 29 .NET Sql Injection 56 3.6 Code Review Preparation 31 Parameter collections 57 3.7 Code Review Discovery and Gathering the Information 32 3.8 Static Code Analysis 35 3.9 Application Threat Modeling 39 4.3.2.
    [Show full text]
  • Applications of SMT Solvers to Program Verification
    Nikolaj Bjørner Leonardo de Moura Applications of SMT solvers to Program Verification Rough notes for SSFT 2014 Prepared as part of a forthcoming revision of Daniel Kr¨oningand Ofer Strichman's book on Decision Procedures May 19, 2014 Springer Contents 1 Applications of SMT Solvers ............................... 5 1.1 Introduction . .5 1.2 From Programs to Logic . .6 1.2.1 An Imperative Programming Language Substrate. .6 1.2.2 Programs that are logic in disguise . .8 1.3 Test-case Generation using Dynamic Symbolic Execution . .9 1.3.1 The Application . .9 1.3.2 An Example . 10 1.3.3 Methodolgy. 11 1.3.4 Interfacing with SMT Solvers . 13 1.3.5 Industrial Adoption . 14 1.4 Symbolic Software Model checking . 15 1.4.1 The Application . 15 1.4.2 An Example . 15 1.4.3 Methodolgy. 16 1.4.4 Interfacing with SMT Solvers . 18 1.4.5 Industrial Adoption . 18 1.5 Static Analysis . 19 1.5.1 The Application . 19 1.5.2 An Example . 20 1.5.3 Methodolgy. 20 1.5.4 Interfacing with SMT Solvers . 22 1.5.5 Industrial Adoption . 22 1.6 Program Verification . 23 1.6.1 The Application . 23 1.6.2 An Example . 23 1.6.3 Methodolgy. 25 1.6.4 Bit-precise reasoning . 28 1.6.5 Industrial Adoption . 29 1.7 Bibliographical notes . 29 4 Contents References ..................................................... 31 1 Applications of SMT Solvers 1.1 Introduction A significant application domain for SMT solvers is in the analysis, verifi- cation, testing and construction of programs.
    [Show full text]
  • Report on the Third Static Analysis Tool Exposition (SATE 2010)
    Special Publication 500-283 Report on the Third Static Analysis Tool Exposition (SATE 2010) Editors: Vadim Okun Aurelien Delaitre Paul E. Black Software and Systems Division Information Technology Laboratory National Institute of Standards and Technology Gaithersburg, MD 20899 October 2011 U.S. Department of Commerce John E. Bryson, Secretary National Institute of Standards and Technology Patrick D. Gallagher, Under Secretary for Standards and Technology and Director Abstract: The NIST Software Assurance Metrics And Tool Evaluation (SAMATE) project conducted the third Static Analysis Tool Exposition (SATE) in 2010 to advance research in static analysis tools that find security defects in source code. The main goals of SATE were to enable empirical research based on large test sets, encourage improvements to tools, and promote broader and more rapid adoption of tools by objectively demonstrating their use on production software. Briefly, participating tool makers ran their tool on a set of programs. Researchers led by NIST performed a partial analysis of tool reports. The results and experiences were reported at the SATE 2010 Workshop in Gaithersburg, MD, in October, 2010. The tool reports and analysis were made publicly available in 2011. This special publication consists of the following three papers. “The Third Static Analysis Tool Exposition (SATE 2010),” by Vadim Okun, Aurelien Delaitre, and Paul E. Black, describes the SATE procedure and provides observations based on the data collected. The other two papers are written by participating tool makers. “Goanna Static Analysis at the NIST Static Analysis Tool Exposition,” by Mark Bradley, Ansgar Fehnker, Ralf Huuck, and Paul Steckler, introduces Goanna, which uses a combination of static analysis with model checking, and describes its SATE experience, tool results, and some of the lessons learned in the process.
    [Show full text]
  • Partner Directory Wind River Partner Program
    PARTNER DIRECTORY WIND RIVER PARTNER PROGRAM The Internet of Things (IoT), cloud computing, and Network Functions Virtualization are but some of the market forces at play today. These forces impact Wind River® customers in markets ranging from aerospace and defense to consumer, networking to automotive, and industrial to medical. The Wind River® edge-to-cloud portfolio of products is ideally suited to address the emerging needs of IoT, from the secure and managed intelligent devices at the edge to the gateway, into the critical network infrastructure, and up into the cloud. Wind River offers cross-architecture support. We are proud to partner with leading companies across various industries to help our mutual customers ease integration challenges; shorten development times; and provide greater functionality to their devices, systems, and networks for building IoT. With more than 200 members and still growing, Wind River has one of the embedded software industry’s largest ecosystems to complement its comprehensive portfolio. Please use this guide as a resource to identify companies that can help with your development across markets. For updates, browse our online Partner Directory. 2 | Partner Program Guide MARKET FOCUS For an alphabetical listing of all members of the *Clavister ..................................................37 Wind River Partner Program, please see the Cloudera ...................................................37 Partner Index on page 139. *Dell ..........................................................45 *EnterpriseWeb
    [Show full text]
  • Softwaretest- & Analysetools Für Produktivität Und Qualität
    Softwaretest- & Analysetools für Produktivität und Qualität Statische Codeanalyse Dynamische Codeanalyse Software-Komplexitätsmessung Code Coverage www.verifysoft.com GrammaTech CodeSonar® GrammaTech CodeSonar® – Wenn Softwarequalität zum Prinzip erhoben wird Statische Analyse von Quell- und Binärcode Da die statische Analyse eine Ausführung der zu analysierenden (Teil-) Applikation nicht erfordert, kann CodeSonar bereits früh im Entwicklungsprozess kritische Softwarefehler aufdecken. Risiken, hervorgerufen durch z.B. gefährliche Sicherheitslücken, nichtdetermistische Nebenläufigkeitsfehler und Speicherlecks können so minimiert und hohe Wartungskosten, verursacht durch schwer lesbaren Code, vermieden werden. void gyroDisp(const char *sdf){ char* sBuffer; sBuffer = (char*) malloc(strlen(sdf)) + 1; if (!sBuffer){ exit(EXIT_FAILURE); } strcpy(sBuffer, sdf); /* Format string for LCD */ createLCD453Format(sBuffer); free(sBuffer); } f Kritische Fehler aufdecken f Nebenläufigkeitsprobleme erkennen f Sicherheitsschwachstellen eliminieren f Reports zur Zertifizierung nach ISO 26262 / DO-178C f Codierrichtlinien überprüfen generieren 4 | www.verifysoft.com GrammaTech CodeSonar® Statische Quellcodeanalyse HINTERGRUNDWISSEN Als führendes Werkzeug zur statischen Quellcodeanalyse Unterstützte Programmiersprachen f C/C++ f Java f C# weist CodeSonar im Vergleich zu vielen anderen stati- schen Analyse-Tools nicht nur eine bessere Fehlererken- Unterstützte Betriebssysteme nung auf, es zeichnet sich zudem durch eine vergleichs- f Windows f OS X weise geringe
    [Show full text]
  • Combining Binary Analysis Techniques to Trigger Use-After-Free Josselin Feist
    Finding the needle in the heap : combining binary analysis techniques to trigger use-after-free Josselin Feist To cite this version: Josselin Feist. Finding the needle in the heap : combining binary analysis techniques to trigger use- after-free. Cryptography and Security [cs.CR]. Université Grenoble Alpes, 2017. English. NNT : 2017GREAM016. tel-01681707v2 HAL Id: tel-01681707 https://tel.archives-ouvertes.fr/tel-01681707v2 Submitted on 12 Jan 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THESE` Pour obtenir le grade de DOCTEUR DE LA COMMUNAUTE´ UNIVERSITE´ GRENOBLE ALPES Specialit´ e´ : Informatique Arretˆ e´ ministerial´ : 25 mai 2016 Present´ ee´ par Josselin Feist These` dirigee´ par Marie-Laure Potet and Laurent Mounier prepar´ ee´ au sein Verimag et de Ecole´ Doctorale Mathematiques,´ Sciences et technologies de l’information, Informatique Finding the Needle in the Heap: Combining Binary Analysis Tech- niques to Trigger Use-After-Free These` soutenue publiquement le 29 mars 2017, devant le jury compose´ de : PHILIPPE ELBAZ-VINCENT Professeur,
    [Show full text]
  • How Developers Engage with Static Analysis Tools in Different Contexts
    Noname manuscript No. (will be inserted by the editor) How Developers Engage with Static Analysis Tools in Different Contexts Carmine Vassallo · Sebastiano Panichella · Fabio Palomba · Sebastian Proksch · Harald C. Gall · Andy Zaidman This is an author-generated version. The final publication will be available at Springer. Abstract Automatic static analysis tools (ASATs) are instruments that sup- port code quality assessment by automatically detecting defects and design issues. Despite their popularity, they are characterized by (i) a high false positive rate and (ii) the low comprehensibility of the generated warnings. However, no prior studies have investigated the usage of ASATs in different development contexts (e.g., code reviews, regular development), nor how open source projects integrate ASATs into their workflows. These perspectives are paramount to improve the prioritization of the identified warnings. To shed light on the actual ASATs usage practices, in this paper we first survey 56 developers (66% from industry and 34% from open source projects) and inter- view 11 industrial experts leveraging ASATs in their workflow with the aim of understanding how they use ASATs in different contexts. Furthermore, to in- vestigate how ASATs are being used in the workflows of open source projects, we manually inspect the contribution guidelines of 176 open-source systems and extract the ASATs' configuration and build files from their corresponding GitHub repositories. Our study highlights that (i) 71% of developers do pay attention to different warning categories depending on the development con- text; (ii) 63% of our respondents rely on specific factors (e.g., team policies and composition) when prioritizing warnings to fix during their programming; and (iii) 66% of the projects define how to use specific ASATs, but only 37% enforce their usage for new contributions.
    [Show full text]