Defense Against Software Exploits El Habib Boudjema

Defense against software exploits El Habib Boudjema To cite this version: El Habib Boudjema. Defense against software exploits. Mathematical Software [cs.MS]. Université Paris-Est, 2018. English. NNT : 2018PESC1015. tel-01970951 HAL Id: tel-01970951 https://tel.archives-ouvertes.fr/tel-01970951 Submitted on 6 Jan 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Universite´ Paris Est ECOLE´ DOCTORALE MSTIC THESE` DE DOCTORAT pour obtenir le titre de Docteur de l'UniversitéParis-Est Specialit´ e´ : Informatique défenduepar BOUDJEMA El habib Défense contre les attaques de logiciels soutenue le 04 mai 2018 devant le jury composéde : Prof. ZEADALLY Sherali Rapporteur University of Kentucky, USA Dr. SAUVERON Damien Rapporteur Universitéde Limoges, France Prof. BEN-OTHMAN Jalel Président UniversitéParis 13, France Prof. HABBAS Zineb Examinateur Universitéde Lorraine, France Prof. KHEDOUCI Hamamache Examinateur UniversitéLyon 1, France Dr. VERLAN Serghei Examinateur UniversitéParis-Est, France Prof. MOKDAD Lynda Directrice de thèse UniversitéParis-Est, France Dr. FAURE Christèle Co-directrice de thèse Saferiver, France Dr. DELEBARRE Véronique Invitée Dirigeante Saferiver, France i Thèsepréparéeà: Saferiver, Montrouge. France et Laboratoire LACL UniversitéParis-Est, Créteil.France Abstract Defense Against Software Exploits In the beginning of the third millennium, we are witnessing a new age. This new age is characterized by the shift from an industrial economy to an economy based on information technology. It is the Information Age. Today, we rely on software in practically every aspect of our life. Information technology is used by all economic actors: manufactures, governments, banks, universities, hospitals, retail stores, etc. A single software vulnerability can lead to devastating consequences and irreparable damage. The situation is worsened by the software becoming larger and more complex making the task of avoiding software flaws more and more difficult task. Automated tools finding those vulnerabilities rapidly before it is late, are becoming a basic need for software industry community. This thesis is investigating security vulnerabilities occurring in C language applications. We searched the sources of these vulnerabilities with a focus on C library functions calling. We dressed a list of property checks to detect code portions lead- ing to security vulnerabilities. Those properties give for a library function call the conditions making this call a source of a security vulnerability. When these conditions are met, the corresponding call must be reported as vulnerable. These checks were implemented in Carto-C tool and experimented on the Juliet test base and on real life application sources. We also investigated the detection of exploitable vulnerability at binary code level. We started by defining what exploitable vulnerability behavioral patterns are. The focus was on the most exploited vulnerability classes such as stack buffer overflow, heap buffer overflow and use-after-free. Af- ter, a new method on how to search for these patterns by exploring application execution traces is proposed. During the exploration, necessary information is ex- tracted and used to find the patterns of the searched vulnerabilities. This method was implemented in our tool Vyper and experimented successfully on Juliet test base and real life application binaries. Résumé Défensecontre les attaques de logiciels Dans ce débutdu troisième millénium,nous sommes témoinsd'un nouvel âge. Ce nouvel âgeest caractérisépar la transition d'une économieindustrielle vers une économie baséesur la technologie de l'information. C'est l'âgede l'information. Aujourd'hui le logiciel est présent dans pratiquement tous les aspects de notre vie. Une seule vulnérabilitélogicielle peut conduire àdes conséquencesdévastatrices. La détectionde ces vulnérabilitésest une tâche qui devient de plus en plus dure surtout avec les logiciels devenant plus grands et plus complexes. Dans cette thèse,nous nous sommes intéressésaux vulnérabilitésde sécuritéim- pactant les applications développéesen langage C et particulièrement les vulnéra- bilitésprovenant de l'usage des fonctions de ce langage. Nous avons proposé une liste de vérificationspour la détectiondes portions de code causant des vulnérabilitésde sécurité.Ces vérificationssont sous la forme de conditions ren- dant l'appel d'une fonction vulnérable.Des implémentations dans l'outil Carto-C et des expérimentations sur la base de test Juliet et les sources d'applications réellesont étéréalisées.Nous nous sommes également intéressésàla détectionde vulnérabilitésexploitables au niveau du code binaire. Nous avons définien quoi consiste le motif comportemental d'une vulnérabilité. Nous avons proposéune méthode permettant de rechercher ces motifs dans les traces d'exécutionsd'une application. Le calcul de ces traces d'exécutionest effectuéen utilisant l'exécution concolique. Cette méthode est baséesur l'annotation de zones mémoiressensibles et la détection d'accèsdangereux àces zones. L'implémentation de cette méthode a étéréaliséedans l'outil Vyper et des expérimentations sur la base de test Juliet et les codes binaires d'applications réellesont étémenéesavec succès. iv Keywords: static analysis, vulnerability detection, software exploit Mots-clés: analyse statique, détection de vulnérabilité, attaque logicielle Contents Abstract ii Résumé iii Contentsv List of Figures viii List of Tables ix Abbreviationsx Introduction1 1 Overview of cyber security domains5 1.1 Cyber security: a growing large field.................5 1.2 Security vulnerabilities: public enemy of cyber security.......8 1.3 Causes of security vulnerabilities................... 10 1.4 High level security measures...................... 11 2 Source of vulnerabilities in C language 14 2.1 C language definition.......................... 15 2.2 Formatted output functions in C language.............. 16 2.2.1 Output format string structure................ 18 2.2.2 Output functions usage problems............... 20 2.2.3 Checking and reporting issues................. 22 2.3 Input formatted functions in C language............... 23 2.3.1 Input functions format string structure............ 24 2.3.2 Input formatted functions usage problems.......... 26 2.3.3 Checking and reporting issues................. 28 2.4 POSIX formatted functions...................... 28 2.4.1 Format string structure difference............... 28 2.4.2 List of additional functions................... 30 2.4.3 Safety and security issues................... 30 2.4.4 Checking and reporting issues................. 30 v Contents vi 2.5 GNU/Linux formatted functions.................... 31 2.5.1 Format string structure difference............... 31 2.5.2 List of additional functions................... 31 2.5.3 Safety and security issues................... 32 2.5.4 Checking and reporting issues................. 32 2.6 Windows formatted function...................... 32 2.6.1 Format string structure difference............... 32 2.6.2 List of additional functions................... 33 2.6.3 Safety and security issues................... 35 2.6.4 Checking and reporting issues................. 35 2.7 Command and program execution functions............. 35 2.7.1 Safety and security issues................... 37 2.8 Memory manipulation functions.................... 38 2.8.1 Safety and security issues................... 39 3 Static analysis tools 42 3.1 Front-end: parsing and translating.................. 43 3.2 Middle-end : computation and detection............... 49 3.3 Back-end: results collection and reporting.............. 51 3.4 List of tools............................... 51 3.5 Major problems of static analysis................... 52 4 Security vulnerabilities in C language applications 55 4.1 Static analysis for security....................... 57 4.2 Covered C language parts....................... 58 4.3 Covered security vulnerabilities.................... 60 4.4 Properties for security vulnerability detection and reporting.... 63 4.4.1 Format string vulnerabilities.................. 63 4.4.2 Command execution vulnerabilities.............. 69 4.4.3 Buffer and memory vulnerabilities............... 73 4.5 Test and evaluation of Carto-C.................... 76 4.5.1 Test with synthetic test cases................. 77 4.5.2 Test on real applications.................... 81 4.6 Discussion and conclusions....................... 83 5 Vulnerability detection by behavioral pattern recognition 84 5.1 Concolic execution and binary code analysis............. 86 5.2 Exploitable vulnerabilities....................... 87 5.3 Formalization of exploitable vulnerability detection......... 92 5.4 Formal model application on exploitable vulnerabilities....... 95 5.5 Test and evaluation of Vyper..................... 97 5.5.1 Testing on custom test cases.................. 98 5.5.2 Testing on Juliet test base................... 98 5.5.3 Testing

Defense Against Software Exploits El Habib Boudjema

Undefined Behaviour in the C Language

Codesonar the SAST Platform for Devsecops

Grammatech Codesonar Product Datasheet

Codesonar for C# Datasheet

Software Vulnerabilities Principles, Exploitability, Detection and Mitigation

Code Review Guide

Applications of SMT Solvers to Program Verification

Report on the Third Static Analysis Tool Exposition (SATE 2010)

Partner Directory Wind River Partner Program

Automatic Bug-Finding Techniques for Large Software Projects

Softwaretest- & Analysetools Für Produktivität Und Qualität

Combining Binary Analysis Techniques to Trigger Use-After-Free Josselin Feist