Improving Software Dependability Through Documentation Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Improving Software Dependability Through Documentation Analysis Improving Software Dependability through Documentation Analysis by Edmund Wong A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Doctor of Philosophy in Computer Engineering Waterloo, Ontario, Canada, 2019 © Edmund Wong 2019 Examining Committee Membership The following served on the Examining Committee for this thesis. The decision of the Examining Committee is by majority vote. Internal-External Examiner: Meiyappan Nagappan Assistant Professor, Dept. of Computer Science, University of Waterloo Supervisor: Lin Tan Associate Professor, Dept. of Electrical and Com- puter Engineering, University of Waterloo Internal Member: Arie Gurfinkel Associate Professor, Dept. of Electrical and Com- puter Engineering, University of Waterloo Internal Member: Mahesh Tripunitara Associate Professor, Dept. of Electrical and Com- puter Engineering, University of Waterloo External Member: Nicholas A. Kraft Software Researcher, ABB Corporate Research ii I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, including any required final revisions, as accepted by my examiners. I understand that my thesis may be made electronically available to the public. iii Abstract Software documentation contains critical information that describes a system’s func- tionality and requirements. Documentation exists in several forms, including code com- ments, test plans, manual pages, and user manuals. The lack of documentation in existing software systems is an issue that impacts software maintainability and programmer pro- ductivity. Since some code bases contain a large amount of documentation, we want to leverage these existing documentation to improve software dependability. Specifically, we utilize documentation to help detect software bugs and repair corrupted files, which can reduce the number of software error and failure to improve a system’s reliability (e.g., continuity of correct service). We also generate documentation (e.g., code comment) auto- matically to help developers understand the source code, which helps improve a system’s maintainability (e.g., ability to undergo repairs and modifications). In this thesis, we analyze software documentation and propose two branches of work, which focuses on three types of documentation including manual pages, code comments, and user manuals. The first branch of work focuses on documentation analysis because doc- umentation contains valuable information that describes the behavior of the program. We automatically extract constraints from documentation and apply them on a dynamic anal- ysis symbolic execution tool to find bugs in the target software, and we extract constraints manually from documentation and apply them on a structured-file parsing application to repair corrupted PDF files. The second branch of work focuses on automatic code comment generation to improve software documentation. For documentation analysis, we propose and implement DASE and DocRepair. DASE leverages automatically extracted constraints from documentation to improve a dynamic analysis symbolic execution tool. DASE guides symbolic execution to focus the testing on execution paths that execute a program’s core functionalities using constraints learned from the documentation. We evaluated DASE on 88 programs from five mature real-world software suites to detect software bugs. DASE detects 12 previously unknown bugs that symbolic execution would fail to detect when given no input constraints, 6 of which have been confirmed by the developers. In DocRepair we perform an empirical study to study and repair corrupted PDF files. We create the first dataset of 319 corrupted PDF files and conduct an empirical study on 119 real-world corrupted PDF files to study the common types of file corruption. Based on the result of the empirical study we propose a technique called DocRepair. DocRepair’s re- pair algorithm includes seven repair operators that utilizes manually extracted constraints from documentation to repair corrupted files. We evaluate DocRepair against three com- mon PDF repair tools. Amongst the 1,827 collected corrupted files from over two corpora iv of PDF files, DocRepair can successfully repair 354 files compared to Mutool, PDFtk, and GhostScript which repair 508, 41 and 84 respectively. We also propose a technique to combine multiple repair tools called DocRepair+, which can successfully repair 751 files. In the case where there is a lack of documentation, DASE and DocRepair+ would not work. Therefore, we propose automated documentation generation to address the issue. We propose and implement CloCom+ to generate code comments by mining both existing software repositories in GitHub and a Question and Answer site, Stack Overflow. CloCom+ generated 442 unique comments for 16 Java projects. Although CloCom+ improves on previous work, SumSlice, on automatic comment generation, the quality (evaluated on completeness, conciseness, expressiveness, and usefulness) and yield (number of generated comments) are still rather low which makes the technique not ready for real-world usage. In the future, it may be possible to combine the two proposed branches of work (doc- umentation analysis and documentation generation) to further improve software depend- ability. For example, we can extract constraints from the automatically generated docu- mentation (e.g., code comments). v Acknowledgements I would like to thank all the little people who made this possible. vi Dedication This is dedicated to the one I love. vii Table of Contents List of Tables xiii List of Figures xvi 1 Introduction1 1.1 Automatic Documentation Analysis......................2 1.1.1 General Symbolic Execution-Based Software Testing.........3 1.1.2 Automatic File Repair.........................4 1.2 Automatic Documentation Generation....................5 1.2.1 Mining Question and Answer Sites..................6 1.2.2 Mining Code Repositories.......................7 1.3 Contributions..................................7 1.4 Overview of Thesis...............................8 2 Related Work9 2.1 Symbolic Execution...............................9 2.2 Automatic File Repair............................. 11 2.3 Automatic Comment Generation....................... 13 2.4 Source Code Summarization.......................... 15 2.5 Mining Descriptions for Code Artifact..................... 16 2.6 Code Clone Detection............................. 17 viii 2.7 Fuzz Testing................................... 17 2.8 Documentation Analysis............................ 18 3 Documentation Analysis: Symbolic Execution-Based Software Testing using Documentation Constraints 20 3.1 Motivation.................................... 20 3.2 Overview..................................... 22 3.3 Background................................... 27 3.4 Design and Implementation.......................... 27 3.4.1 Extracting File Format Constraints.................. 27 3.4.2 Adding File Layout Constraints.................... 30 3.4.3 Extracting Valid Options........................ 31 3.4.4 Using Options to Flatten Symbolic Execution............ 32 3.5 Evaluation Method............................... 32 3.5.1 Evaluated Programs.......................... 32 3.5.2 Experimental Setup........................... 33 3.6 Evaluation Results............................... 34 3.6.1 Detected Bugs.............................. 34 3.6.2 Code Coverage............................. 37 3.6.3 DASE Complements Developer Tests................. 39 3.6.4 Constraint Extraction Results..................... 40 3.7 Threats to Validity............................... 41 3.7.1 Internal Validity............................ 41 3.7.2 External Validity............................ 41 3.7.3 Construct Validity........................... 42 3.7.4 Conclusion Validity........................... 42 3.8 Summary.................................... 42 ix 4 Documentation Analysis: Automatic File Repair 43 4.1 Motivation.................................... 43 4.2 Background................................... 48 4.2.1 PDF File Format............................ 48 4.2.2 Existing Repair Approaches...................... 49 4.3 Definitions.................................... 50 4.4 A Study of Corrupted PDF Files....................... 50 4.4.1 Collecting Corrupted PDF Files.................... 51 4.4.2 Identifying Corrupted PDF Files................... 52 4.4.3 PDF Repair Tools and Viewers.................... 53 4.4.4 Empirical Study Findings....................... 53 4.5 Design and Implementation.......................... 62 4.5.1 Data Parsing and Collection...................... 62 4.5.2 Repair Operators............................ 62 4.6 Evaluation Method and Results........................ 69 4.7 Threats to Validity............................... 73 4.7.1 Internal Validity............................ 73 4.7.2 External Validity............................ 73 4.7.3 Construct Validity........................... 74 4.7.4 Conclusion Validity........................... 74 4.8 Summary.................................... 75 5 Automatic Documentation Generation: Crowd Sourced Comment Gen- eration 76 5.1 Motivation.................................... 76 5.2 Examples and Challenges............................ 79 5.2.1 Example One.............................. 79 5.2.2 Example Two.............................. 82 x 5.2.3 Example Three............................. 84 5.3 Design and Implementation.......................... 85 5.3.1 Code-Description Mapping Extraction from Stack Overflow..... 86 5.3.2
Recommended publications
  • Vulnerability Report Attacks Bypassing Confidentiality in Encrypted PDF
    Vulnerability Report Attacks bypassing confidentiality in encrypted PDF Jens M¨uller1, Fabian Ising2, Vladislav Mladenov1, Christian Mainka1, Sebastian Schinzel2, J¨orgSchwenk1 May 16, 2019 1Chair for Network and Data Security 2FH M¨unsterUniversity of Applied Sciences Abstract In this report, we analyze PDF encryption and show two novel techniques for breaking the confidentiality of encrypted documents. Firstly, we abuse the PDF feature of partially encrypted documents to wrap the encrypted part of the document within attacker-controlled content and therefore, exfiltrate the plaintext once the document is opened by a legitimate user. Secondly, we abuse a flaw in the PDF encryption specification allowing an attacker to arbitrarily manipulate encrypted content without knowing the cor- responding key/password. The only requirement is one single block of known plaintext, which we show is fulfilled by design. By using exfiltration channels our attacks allow the recovery of the entire plaintext or parts of it within an encrypted document. The attacks rely only on standard compliant PDF features. We evaluated our attacks on 27 widely used PDF viewers and found all of them vulnerable. 1 Contents 1 Background4 1.1 Portable Document Format (PDF) . .4 1.2 PDF Encryption . .6 1.3 PDF Interactive Features . .7 2 Attacker Model8 3 PDF Encryption: Security Analysis9 3.1 Partial Encryption . .9 3.2 CBC Malleability . 10 3.3 PDF Interactive Features . 12 4 How To Break PDF Encryption 14 4.1 Direct Exfiltration (Attack A) . 14 4.1.1 Requirements . 15 4.1.2 Direct Exfiltration through PDF Forms (A1) . 15 4.1.3 Direct Exfiltration via Hyperlinks (A2) .
    [Show full text]
  • Living Without Google on Android
    Alternative for Google Apps on Android - living without Google on Android Android without any Google App? What to use instead of Hangouts, Map, Gmail? Is that even possible? And why would anyone want to live without Google? I've been using a lot of different custom ROMs on my devices, so far the two best: plain Cyanogenmod 11 snapshot on the Nexus 4[^1], and MIUI 2.3.2 on the HTC Desire G7[^2]. All the others ( MUIU 5, MIUI 6 unofficial, AOKP, Kaos, Slim, etc ) were either ugly, unusable, too strange or exceptionally problematic on battery life. For a long time, the first step for me was to install the Google Apps, gapps packages for Plays Store, Maps, and so on, but lately they require so much rights on the phone that I started to have a bad taste about them. Then I started to look for alternatives. 1 of 5 So, what to replace with what? Play Store I've been using F-Droid[^3] as my primary app store for a while now, but since it's strictly Free Software[^4] store only, sometimes there's just no app present for your needs; aptoide[^5] comes very handy in that cases. Hangouts I never liked Hangouts since the move from Gtalk although for a little while it was exceptional for video - I guess it ended when the mass started to use it in replacement of Skype and its recent suckyness. For chat only, check out: ChatSecure[^6], Conversations[^7] or Xabber[^8]. All of them is good for Gtalk-like, oldschool client and though Facebook can be configured as XMPP as well, I'd recommend Xabber for that, the other two is a bit flaky with Facebook.
    [Show full text]
  • Ghostscript and Mupdf Status Openprinting Summit April 2016
    Ghostscript and MuPDF Status OpenPrinting Summit April 2016 Michael Vrhel, Ph.D. Artifex Software Inc. San Rafael CA Outline Ghostscript overview What is new with Ghostscript MuPDF overview What is new with MuPDF MuPDF vs Ghostscript MuJS, GSView The Basics Ghostscript is a document conversion and rendering engine. Written in C ANSI 1989 standard (ANS X3.159-1989) Essential component of the Linux printing pipeline. Dual AGPL/Proprietary licensed. Artifex owns the copyright. Source and documentation available at www.ghostscript.com Graphical Overview PostScript PCL5e/c with PDF 1.7 XPS Level 3 GL/2 and RTL PCLXL Ghostscript Graphics Library High level Printer drivers: Raster output API: Output drivers: Inkjet TIFF PSwrite PDFwrite Laser JPEG XPSwrite Custom etc. CUPS Devices Understanding devices is a major key to understanding Ghostscript. Devices can have high-level functionality. e.g. pdfwrite can handle text, images, patterns, shading, fills, strokes and transparency directly. Graphics library has “default” operations. e.g. text turns into bitmaps, images decomposed into rectangles. In embedded environments, calls into hardware can be made. Raster devices require the graphics library to do all the rendering. Relevant Changes to GS since last meeting…. A substantial revision of the build system and GhostPDL directory structure (9.18) GhostPCL and GhostXPS "products" are now built by the Ghostscript build system "proper" rather than having their own builds (9.18) New method of internally inserting devices into the device chain developed. Allows easier implementation of “filter” devices (9.18) Implementation of "-dFirstPage"/"-dLastPage" with all input languages (9.18) Relevant Changes to GS since last meeting….
    [Show full text]
  • Breaking PDF Encryption
    Practical Decryption exFiltration: Breaking PDF Encryption Jens Müller Fabian Ising Vladislav Mladenov [email protected] [email protected] [email protected] Ruhr University Bochum, Chair for Münster University of Applied Ruhr University Bochum, Chair for Network and Data Security Sciences Network and Data Security Christian Mainka Sebastian Schinzel Jörg Schwenk [email protected] [email protected] [email protected] Ruhr University Bochum, Chair for Münster University of Applied Ruhr University Bochum, Chair for Network and Data Security Sciences Network and Data Security ABSTRACT Home/Trusted Environment The Portable Document Format, better known as PDF, is one of the Decrypted Document 1. Victim opens 2. Exfiltrating Tax Declaration decrypted content most widely used document formats worldwide, and in order to en- an encrypted PDF file Scrooge McDuck with their password via the Internet sure information confidentiality, this file format supports document TOP SECRET Victim encryption. In this paper, we analyze PDF encryption and show Attacker two novel techniques for breaking the confidentiality of encrypted Victim’s PC documents. First, we abuse the PDF feature of partially encrypted documents to wrap the encrypted part of the document within Figure 1: An overview of the attack scenario: The victim attacker-controlled content and therefore, exfiltrate the plaintext opens an encrypted PDF document and unintentionally once the document is opened by a legitimate user. Second, we abuse leaks the decrypted content to an attacker-controlled server. a flaw in the PDF encryption specification to arbitrarily manipulate The encrypted PDF file was manipulated by the attacker be- encrypted content.
    [Show full text]
  • Complete Issue 40:3 As One
    TUGBOAT Volume 40, Number 3 / 2019 General Delivery 211 From the president / Boris Veytsman 212 Editorial comments / Barbara Beeton TEX Users Group 2019 sponsors; Kerning between lowercase+uppercase; Differential “d”; Bibliographic archives in BibTEX form 213 Ukraine at BachoTEX 2019: Thoughts and impressions / Yevhen Strakhov Publishing 215 An experience of trying to submit a paper in LATEX in an XML-first world / David Walden 217 Studying the histories of computerizing publishing and desktop publishing, 2017–19 / David Walden Resources 229 TEX services at texlive.info / Norbert Preining 231 Providing Docker images for TEX Live and ConTEXt / Island of TEX 232 TEX on the Raspberry Pi / Hans Hagen Software & Tools 234 MuPDF tools / Taco Hoekwater 236 LATEX on the road / Piet van Oostrum Graphics 247 A Brazilian Portuguese work on MetaPost, and how mathematics is embedded in it / Estev˜aoVin´ıcius Candia LATEX 251 LATEX news, issue 30, October 2019 / LATEX Project Team Methods 255 Understanding scientific documents with synthetic analysis on mathematical expressions and natural language / Takuto Asakura Fonts 257 Modern Type 3 fonts / Hans Hagen Multilingual 263 Typesetting the Bangla script in Unicode TEX engines—experiences and insights Document Processing / Md Qutub Uddin Sajib Typography 270 Typographers’ Inn / Peter Flynn Book Reviews 272 Book review: Hermann Zapf and the World He Designed: A Biography by Jerry Kelly / Barbara Beeton 274 Book review: Carol Twombly: Her brief but brilliant career in type design by Nancy Stock-Allen / Karl
    [Show full text]
  • Incorporer Les Polices Dans Un Document Libreoffice Une Police D’Écriture Ou Police De Caractères Est Une Manière D'écrire Les Caractères
    Incorporer les polices dans un document LibreOffice Une police d’écriture ou police de caractères est une manière d'écrire les caractères. Il en existe énormément et pour à peu près tous les usages. Si vous communiquez électroniquement avec d'autres, il est fort probable qu'ils n'aient pas toutes vos polices numériques. De plus, certaines sont couvertes par un droit d'auteur strict (au sens où l'auteur ne donne pas plus de droits aux utilisateurs que la loi lui oblige) et le droit des marques. C'est un véritable problème. En effet, les polices de caractères informatiques déterminent beaucoup de choses, dont la taille des caractères, l'apparence des caractères et l'espacement entre ceux-ci. Si votre correspondant n'a pas les mêmes polices que vous, votre texte peut donc être visuellement déformé. Les logiciels essayent généralement de trouver la police la plus proche, mais ce n'est pas toujours une grande réussite. Pour combler ce problème, il faut intégrer les polices aux documents que vous transmettez. Bien entendu, cela rendra plus lourd vos documents étant donné qu'ils auront les polices en plus. La bonne nouvelle est que LibreOffice (au moins à partir de la version 4.3) permet cela ! Dans un document OpenDocument OpenDocument est un format ouvert et standardisé qui est utilisé par défaut dans LibreOffice, OpenOffice et Calligra. Les fichiers OpenDocument ont généralement une extension .odx, avec "x" un caractère quelconque. Pour incorporer les polices d'un document au format OpenDocument, suivez les étapes suivantes. 1. Ouvrez votre document au format OpenDocument avec LibreOffice.
    [Show full text]
  • Pymupdf 1.12.2 Documentation » Next | Index Pymupdf Documentation
    PyMuPDF 1.12.2 documentation » next | index PyMuPDF Documentation Introduction Note on the Name fitz License Covered Version Installation Option 1: Install from Sources Step 1: Download PyMuPDF Step 2: Download and Generate MuPDF Step 3: Build / Setup PyMuPDF Option 2: Install from Binaries Step 1: Download Binary Step 2: Install PyMuPDF MD5 Checksums Targeting Parallel Python Installations Using UPX Tutorial Importing the Bindings Opening a Document Some Document Methods and Attributes Accessing Meta Data Working with Outlines Working with Pages Inspecting the Links of a Page Rendering a Page Saving the Page Image in a File Displaying the Image in Dialog Managers Extracting Text Searching Text PDF Maintenance Modifying, Creating, Re-arranging and Deleting Pages Joining and Splitting PDF Documents Saving Closing Example: Dynamically Cleaning up Corrupt PDF Documents Further Reading Classes Annot Example Colorspace Document Remarks on select() select() Examples setMetadata() Example setToC() Example insertPDF() Examples Other Examples Identity IRect Remark IRect Algebra Examples Link linkDest Matrix Remarks 1 Remarks 2 Matrix Algebra Examples Shifting Flipping Shearing Rotating Outline Page Description of getLinks() Entries Notes on Supporting Links Homologous Methods of Document and Page Pixmap Supported Input Image Types Details on Saving Images with writeImage() Pixmap Example Code Snippets Point Remark Point Algebra Examples Shape Usage Examples Common Parameters Rect Remark Rect Algebra Examples Operator Algebra for Geometry Objects
    [Show full text]
  • Osspolice - Identifying Open-Source License Violation and 1-Day Security Risk at Large Scale
    OSSPolice - Identifying Open-Source License Violation and 1-day Security Risk at Large Scale Ruian Duan, Ashish Bijlani, Meng Xu Taesoo Kim, Wenke Lee ACM CCS 2017 1 Background • Open SourCe Software (OSS) is gaining popularity, e.g. GitHub reported 20M users and 57M repos • Mobile app market grows fast with over 2M apps on Play Store • Developers reuse OSS as is for lots of benefits • Legal risks and seCurity risks arise 2 Risks in OSS use • OSS liCenses have Constraints (e.g. For now, GNU GPL is an GNU GPL requires derivative works enforceable contract, says US to open sourCe) federal judge! Artifex Slaps Palm with PDF Reader Copyright Suit Equifax blames open-source • 1-day vulnerabilities in stale OSS software for its record-breaking versions are exploited by haCkers security breach Community Health Systems Breach Possible due to Heartbleed Vulnerability 3 Goal • Design a tool, OSSPoliCe, to analyze Android apps for open-sourCe license violation and 1-day seCurity risk by deteCting reuse of OSS and their versions at large sCale • Requirements • AcCurate deteCtion for hundreds of thousands of OSS • AcCurate version pinpointing • EffiCient resourCe usage • Fast search to support vetting a large number of Android apps 4 Overview and challenges • Feature selection • Source vs binary: automatiCally building sourCe Code is hard, due to dependenCies, various build configs etC. • Compare App against OSS • Fused app binaries: multiple OSS Can be linked or compiled into a single file • Partial builds and internal code clones: not all OSS features
    [Show full text]
  • Moon+ Reader User Manual
    Moon+ Reader User Manual You can download the OverDrive app and Moon+ Reader from the Amazon appstore on Fire HDX tablets. Kindle Tips and Tricks Advanced User Guide. Moon+Reader Pro Apk Full Free Download Latest Version Moon+Reader Pro Apk better designed book reader through powerful controls & full purposes, ropes. A Google User. Awesome customization Moon+ Reader has awesome customization capabilities! You can pick your favorite fonts, font colors, wallpapers. It has a simple user interface which doesn't distract you from the book, Allows Offline dictionaries + Moon+ Reader or Colordict + Moon+ Reader, are good combinations. Books: Where do I get solution manual for "Chemcal Engineering. Translation, Share functions all in moon+ ebook reader. tables correctly but also removes all of the nice user interface functions that make this app useful. Since version 1.47, user defined emblems (icons) can be displayed next to covers an example is a recipe book made using the "Recipe Reader" plugin in calibre. powerful these searches can be see the user manual page on the subject. Moon+ Reader User Manual Read/Download One app may have a better user interface, the other one may be more stable. You may realize that there is Moon+ Reader offers a lot of customization options. Moon+ Reader Pro v2.6.7 Apk Free (Patched/Modded) Re. website to provide a perfect user experience which includes easy reading and navigating you site. Play Books, Kobo, Kindle, Moon+ Reader, FBReader, Feedly, Comixology, etc. your preference, there is no detailed user manual that explains everything. Discover a new way to read e-books with the Prestigio eReader application.
    [Show full text]
  • DVD-Ofimática 2014-07
    (continuación 2) Calizo 0.2.5 - CamStudio 2.7.316 - CamStudio Codec 1.5 - CDex 1.70 - CDisplayEx 1.9.09 - cdrTools FrontEnd 1.5.2 - Classic Shell 3.6.8 - Clavier+ 10.6.7 - Clementine 1.2.1 - Cobian Backup 8.4.0.202 - Comical 0.8 - ComiX 0.2.1.24 - CoolReader 3.0.56.42 - CubicExplorer 0.95.1 - Daphne 2.03 - Data Crow 3.12.5 - DejaVu Fonts 2.34 - DeltaCopy 1.4 - DVD-Ofimática Deluge 1.3.6 - DeSmuME 0.9.10 - Dia 0.97.2.2 - Diashapes 0.2.2 - digiKam 4.1.0 - Disk Imager 1.4 - DiskCryptor 1.1.836 - Ditto 3.19.24.0 - DjVuLibre 3.5.25.4 - DocFetcher 1.1.11 - DoISO 2.0.0.6 - DOSBox 0.74 - DosZip Commander 3.21 - Double Commander 0.5.10 beta - DrawPile 2014-07 0.9.1 - DVD Flick 1.3.0.7 - DVDStyler 2.7.2 - Eagle Mode 0.85.0 - EasyTAG 2.2.3 - Ekiga 4.0.1 2013.08.20 - Electric Sheep 2.7.b35 - eLibrary 2.5.13 - emesene 2.12.9 2012.09.13 - eMule 0.50.a - Eraser 6.0.10 - eSpeak 1.48.04 - Eudora OSE 1.0 - eViacam 1.7.2 - Exodus 0.10.0.0 - Explore2fs 1.08 beta9 - Ext2Fsd 0.52 - FBReader 0.12.10 - ffDiaporama 2.1 - FileBot 4.1 - FileVerifier++ 0.6.3 DVD-Ofimática es una recopilación de programas libres para Windows - FileZilla 3.8.1 - Firefox 30.0 - FLAC 1.2.1.b - FocusWriter 1.5.1 - Folder Size 2.6 - fre:ac 1.0.21.a dirigidos a la ofimática en general (ofimática, sonido, gráficos y vídeo, - Free Download Manager 3.9.4.1472 - Free Manga Downloader 0.8.2.325 - Free1x2 0.70.2 - Internet y utilidades).
    [Show full text]
  • Pymupdf Documentation Release 1.18.19
    PyMuPDF Documentation Release 1.18.19 Jorj X. McKie Sep 17, 2021 Contents 1 Introduction 1 1.1 Note on the Name fitz ..........................................2 1.2 License and Copyright..........................................2 1.3 Covered Version.............................................2 2 Installation 3 2.1 Step 1: Install MuPDF..........................................3 2.2 Step 2: Download and Generate PyMuPDF...............................3 3 Tutorial 5 3.1 Importing the Bindings..........................................5 3.2 Opening a Document...........................................5 3.3 Some Document Methods and Attributes................................6 3.4 Accessing Meta Data...........................................6 3.5 Working with Outlines..........................................6 3.6 Working with Pages...........................................7 3.6.1 Inspecting the Links, Annotations or Form Fields of a Page..................7 3.6.2 Rendering a Page........................................8 3.6.3 Saving the Page Image in a File................................8 3.6.4 Displaying the Image in GUIs.................................8 3.6.4.1 wxPython.......................................8 3.6.4.2 Tkinter.........................................9 3.6.4.3 PyQt4, PyQt5, PySide.................................9 3.6.5 Extracting Text and Images...................................9 3.6.6 Searching for Text....................................... 10 3.7 PDF Maintenance............................................ 10 3.7.1 Modifying,
    [Show full text]
  • Usability Evaluation Framework for Mobile Apps Using Source Code Analysis
    Usability Evaluation Framework for Mobile Apps Using Source Code Analysis Thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science & Engineering by Research by Neeraj Mathur 201350902 [email protected] International Institute of Information Technology Hyderabad - 500032, INDIA Feb, 2019 Copyright c Neeraj Mathur, 2019 All Rights Reserved International Institute of Information Technology Hyderabad, India CERTIFICATE It is certified that the work contained in this thesis, titled “Usability Evaluation Framework for Mobile Apps Using Source Code Analysis” by Neeraj Mathur, has been carried out under my supervision and is not submitted elsewhere for a degree. Date Adviser: Prof. Y. Raghu Reddy To my Parents Acknowledgments My journey in IIIT-Hyderabad has been a wonderful experience. It could not have been so without the support of many people. As I submit my MS thesis, I would like to extend my gratitude to all those people who helped me on successfully completing this journey. First of all, I want to thank my guide Prof.Y. Raghu Reddy, for accepting me as a student and constantly guiding me during MS Research program. His guidance has helped me improve not only as a researcher but also as a person. His help and support during difficult times strengthened and motivated me to move further. I thank my fellow research mates at Software Engineering Research Center: Lalit Mohan, Sai Anirudh, Shivam Khandelwal, Sai Gollapudi and Sridhar C for stimulating discussions. In particular, I am grateful to Lalit Mohan and Sai Anirudh for constantly guiding me and giving directions to my research work.
    [Show full text]