Machine Learning and Automated Theorem Proving

Total Page:16

File Type:pdf, Size:1020Kb

Machine Learning and Automated Theorem Proving UCAM-CL-TR-792 Technical Report ISSN 1476-2986 Number 792 Computer Laboratory Machine learning and automated theorem proving James P. Bridge November 2010 15 JJ Thomson Avenue Cambridge CB3 0FD United Kingdom phone +44 1223 763500 http://www.cl.cam.ac.uk/ c 2010 James P. Bridge This technical report is based on a dissertation submitted October 2010 by the author for the degree of Doctor of Philosophy to the University of Cambridge, Corpus Christi College. Technical reports published by the University of Cambridge Computer Laboratory are freely available via the Internet: http://www.cl.cam.ac.uk/techreports/ ISSN 1476-2986 Machine learning and automated theorem proving James P. Bridge Summary Computer programs to find formal proofs of theorems have a history going back nearly half a century. Originally designed as tools for mathematicians, modern applications of automated theorem provers and proof assistants are much more diverse. In particular they are used in formal methods to verify software and hardware designs to prevent costly, or life threatening, errors being introduced into systems from microchips to controllers for medical equipment or space rockets. Despite this, the high level of human expertise required in their use means that theo- rem proving tools are not widely used by non specialists, in contrast to computer algebra packages which also deal with the manipulation of symbolic mathematics. The work de- scribed in this dissertation addresses one aspect of this problem, that of heuristic selection in automated theorem provers. In theory such theorem provers should be automatic and therefore easy to use; in practice the heuristics used in the proof search are not universally optimal for all problems so human expertise is required to determine heuristic choice and to set parameter values. Modern machine learning has been applied to the automation of heuristic selection in a first order logic theorem prover. One objective was to find if there are any features of a proof problem that are both easy to measure and provide useful information for determining heuristic choice. Another was to determine and demonstrate a practical approach to making theorem provers truly automatic. In the experimental work, heuristic selection based on features of the conjecture to be proved and the associated axioms is shown to do better than any single heuristic. Additionally a comparison has been made between static features, measured prior to the proof search process, and dynamic features that measure changes arising in the early stages of proof search. Further work was done on determining which features are important, demonstrating that good results are obtained with only a few features required. Acknowledgments I would like to thank my two supervisors, Professor Lawrence Paulson and Dr Sean Holden, for their guidance, feedback and encouragement during my research. Thanks must also go to the author of the theorem prover \E", Dr Stephan Schulz, who has with great patience responded promptly to many e-mails. Contents 1 Motivation 11 1.1 The thesis . 11 1.2 Applications of automated theorem provers . 12 1.3 Choice of theorem prover . 13 1.3.1 Automation versus expressive power . 14 1.3.2 SAT solvers . 14 1.3.3 First order logic theorem provers . 15 1.3.4 Proof assistants . 15 1.3.5 Prover used . 15 1.4 Importance of heuristic selection . 16 1.5 Motivation for using machine learning . 16 1.6 Dissertation summary . 17 2 Background 19 2.1 Logic . 19 2.1.1 Logic levels or languages . 19 2.1.2 Proof methods . 25 2.1.3 Decidability and semi-decidability . 29 2.1.4 Expressive power . 30 2.2 ATPs versus proof assistants . 31 2.3 Resolution based theorem proving . 31 2.3.1 Resolution and related calculi . 31 2.3.2 Practical implementations . 36 2.4 Machine learning . 37 2.4.1 General concepts . 37 2.4.2 Machine learning approaches . 39 2.4.3 Decision trees . 40 CONTENTS CONTENTS 2.4.4 Linearly separable classes . 41 2.4.5 Perceptrons . 44 2.4.6 Margin . 45 2.4.7 Transforming the feature space . 45 2.4.8 Kernel functions arising from transformed space . 46 2.4.9 The support vector machine . 47 2.4.10 Nonseparable data and soft margin classifiers . 49 2.4.11 Alternatives to SVMs . 50 2.4.12 Feature selection . 51 2.5 Applying machine learning to theorem proving . 53 2.5.1 TEAMWORK and the E-theorem prover . 53 2.5.2 Neural networks and folding architecture networks . 55 2.5.3 Learning with symbols and large axiom libraries . 55 2.5.4 Proof planning (Omega project) . 56 2.6 Summary . 56 3 Methodology 59 3.1 Generic description of experimental method . 59 3.2 Data | conjectures to be proved . 60 3.3 Measuring features . 60 3.4 Dynamic and static features . 61 3.5 Theorem prover used . 62 3.6 Selecting the heuristics for the working set . 63 3.6.1 Clause selection within heuristics . 63 3.7 Fitting a support vector machine - SVMLight . 63 3.8 Kernel functions . 64 3.8.1 Linear basis function kernel . 64 3.8.2 Polynomial kernel . 65 3.8.3 Sigmoid tanh kernel . 65 3.8.4 Radial basis function kernel . 65 3.9 Custom software . 65 3.10 Overview of experimental work . 66 3.11 Computer hardware used . 66 3.12 Summary . 66 6 CONTENTS CONTENTS 4 Initial experiment 69 4.1 Classification problem . 70 4.2 Data used . 70 4.3 Heuristic used . 71 4.4 Running the theorem prover . 71 4.5 Training and test data sets . 71 4.6 Features measured . 72 4.7 Using SVMLight and kernel selection . 72 4.7.1 Linear kernel . 72 4.7.2 Radial basis function kernel . 73 4.7.3 Sigmoid tanh kernel . 74 4.7.4 Polynomial kernel . 74 4.7.5 Further investigation of the radial basis function . 74 4.8 Filtering features . 76 4.9 Results for reduced feature set . 77 4.10 Summary . 77 5 Heuristic selection experiment 79 5.1 Selecting a working set of heuristics . 79 5.2 Data used . 80 5.3 Feature sets . 81 5.4 Initial separate classifiers . 82 5.5 Automatic heuristic selection . 82 5.6 Performance measures for classifiers . 83 5.7 Unextended feature set experiments . 84 5.7.1 First classifications . 84 5.7.2 Identical learning and test sets . 88 5.7.3 First results of heuristic selection . 89 5.8 Experiments with extended feature sets . 91 5.8.1 Classifications with extended feature set . 91 5.8.2 Heuristic selection with extended feature set . 92 5.9 Further analysis . 94 5.10 Conclusions . 95 7 CONTENTS CONTENTS 6 Feature selection 97 6.1 Selectively removing features . 97 6.1.1 Ordering features for removal . 98 6.1.2 Including optimisation and other improvements . 99 6.2 Testing small and large feature subsets . 100 6.2.1 Improving the data sets . 101 6.2.2 Enumerating the subsets . 101 6.2.3 Coding for parallel execution . 101 6.2.4 Looking at large subsets . 102 6.2.5 Analysis of three feature subset results . 102 6.2.6 Partial extension to four feature subsets . 103 6.2.7 Results for fixed heuristics . 104 6.2.8 Varying gamma for the best subsets . 105 6.2.9 The best two features (7 and 52) . 106 6.3 Small subset results without H0 . 106 6.3.1 Results without H0 . 106 6.3.2 Feature scores without H0 . 107 6.3.3 The best three features without H0 (10, 14 and 15) . 107 6.4 Random heuristic selection . ..
Recommended publications
  • Safety and Conservativity of Definitions in HOL and Isabelle/HOL
    This is a repository copy of Safety and conservativity of definitions in HOL and Isabelle/HOL. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/161207/ Version: Published Version Article: Kunčar, O. and Popescu, A. orcid.org/0000-0001-8747-0619 (2018) Safety and conservativity of definitions in HOL and Isabelle/HOL. Proceedings of the ACM on Programming Languages, 2 (POPL). 24. ISSN 2475-1421 https://doi.org/10.1145/3158112 Reuse This article is distributed under the terms of the Creative Commons Attribution (CC BY) licence. This licence allows you to distribute, remix, tweak, and build upon the work, even commercially, as long as you credit the authors for the original work. More information and the full terms of the licence here: https://creativecommons.org/licenses/ Takedown If you consider content in White Rose Research Online to be in breach of UK law, please notify us by emailing [email protected] including the URL of the record and the reason for the withdrawal request. [email protected] https://eprints.whiterose.ac.uk/ Safety and Conservativity of Definitions in HOL and Isabelle/HOL ONDŘEJ KUNČAR, Technische Universität München, Germany ANDREI POPESCU, Middlesex University London, United Kingdom and Institute of Mathematics Simion Stoilow of the Romanian Academy, Romania Deinitions are traditionally considered to be a safe mechanism for introducing concepts on top of a logic known to be consistent. In contrast to arbitrary axioms, deinitions should in principle be treatable as a form of abbreviation, and thus compiled away from the theory without losing provability.
    [Show full text]
  • Why Machines Don't
    KI - Künstliche Intelligenz https://doi.org/10.1007/s13218-019-00599-w SURVEY Why Machines Don’t (yet) Reason Like People Sangeet Khemlani1 · P. N. Johnson‑Laird2,3 Received: 15 February 2019 / Accepted: 19 June 2019 © This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply 2019 Abstract AI has never come to grips with how human beings reason in daily life. Many automated theorem-proving technologies exist, but they cannot serve as a foundation for automated reasoning systems. In this paper, we trace their limitations back to two historical developments in AI: the motivation to establish automated theorem-provers for systems of mathematical logic, and the formulation of nonmonotonic systems of reasoning. We then describe why human reasoning cannot be simulated by current machine reasoning or deep learning methodologies. People can generate inferences on their own instead of just evaluating them. They use strategies and fallible shortcuts when they reason. The discovery of an inconsistency does not result in an explosion of inferences—instead, it often prompts reasoners to abandon a premise. And the connectives they use in natural language have diferent meanings than those in classical logic. Only recently have cognitive scientists begun to implement automated reasoning systems that refect these human patterns of reasoning. A key constraint of these recent implementations is that they compute, not proofs or truth values, but possibilities. Keywords Reasoning · Mental models · Cognitive models 1 Introduction problems for mathematicians). Their other uses include the verifcation of computer programs and the design of com- The great commercial success of deep learning has pushed puter chips.
    [Show full text]
  • An Overview of Automated Reasoning
    An overview of automated reasoning John Harrison Intel Corporation 3rd February 2014 (10:30{11:30) Talk overview I What is automated reasoning? I Early history and taxonomy I Automation, its scope and limits I Interactive theorem proving I Hobbes (1651): \Reason . is nothing but reckoning (that is, adding and subtracting) of the consequences of general names agreed upon, for the marking and signifying of our thoughts." I Leibniz (1685) \When there are disputes among persons, we can simply say: Let us calculate [calculemus], without further ado, to see who is right." Nowadays, by `automatic and algorithmic' we mean `using a computer program'. What is automated reasoning? Attempting to perform logical reasoning in an automatic and algorithmic way. An old dream! I Leibniz (1685) \When there are disputes among persons, we can simply say: Let us calculate [calculemus], without further ado, to see who is right." Nowadays, by `automatic and algorithmic' we mean `using a computer program'. What is automated reasoning? Attempting to perform logical reasoning in an automatic and algorithmic way. An old dream! I Hobbes (1651): \Reason . is nothing but reckoning (that is, adding and subtracting) of the consequences of general names agreed upon, for the marking and signifying of our thoughts." Nowadays, by `automatic and algorithmic' we mean `using a computer program'. What is automated reasoning? Attempting to perform logical reasoning in an automatic and algorithmic way. An old dream! I Hobbes (1651): \Reason . is nothing but reckoning (that is, adding and subtracting) of the consequences of general names agreed upon, for the marking and signifying of our thoughts." I Leibniz (1685) \When there are disputes among persons, we can simply say: Let us calculate [calculemus], without further ado, to see who is right." What is automated reasoning? Attempting to perform logical reasoning in an automatic and algorithmic way.
    [Show full text]
  • Submission Data for 2020-2021 CORE Conference Ranking Process International Joint Conference on Automated Reasoning
    Submission Data for 2020-2021 CORE conference Ranking process International Joint Conference on Automated Reasoning Franz Baader, Viorica Sofronie-Stokkermans Conference Details Conference Title: International Joint Conference on Automated Reasoning Acronym : IJCAR Rank: A* Requested Rank Rank: A* Recent Years Proceedings Publishing Style Proceedings Publishing: series Link to most recent proceedings: https://link.springer.com/conference/ijcar Further details: The proceedings are published in Springer Lecture Notes in Computer Science (LNCS) subseries Lecture Notes in Artificial Intelligence (LNAI). The proceedings usually contain regular papers and system descriptions accepted at the main conference. (IJCAR 2020 also contained a small number of short papers, which were clearly marked as short papers.) Most Recent Years Most Recent Year Year: 2018 URL: http://ijcar2018.org Location: Oxford, UK Papers submitted: 108 Papers published: 46 Acceptance rate: 43 Source for numbers: http://ijcar.org//conferences General Chairs Name: Ian Horrocks Affiliation: University of Oxford, UK Gender: M H Index: 96 GScholar url: https://scholar.google.com/citations?hl=en&user=0ypdmcYAAAAJ DBLP url: https://dblp.org/pid/h/IanHorrocks.html Program Chairs Name: Didier Galmiche Affiliation: UniversitÃľ de Lorraine, France Gender: M H Index: 19 (according to Semantic Scholar) GScholar url: https://scholar.google.com/scholar?hl=de&as_sdt=0%2C5&q=Didier+Galmiche&oq= DBLP url: https://dblp.org/pid/82/76.html 1 Second Most Recent Year Year: 2016 URL: https://www.uc.pt/en/congressos/ijcar2016
    [Show full text]
  • Towards a Practical, Verified Kernel
    Towards a Practical, Verified Kernel Kevin Elphinstone∗ Gerwin Klein∗ PhilipDerrin TimothyRoscoe† Gernot Heiser∗‡ National ICT Australia§ Abstract properties such as guaranteed termination of system calls, and the kernel never throwing an internal exception. In the paper we examine one of the issues in designing, Successful OS kernels have generally been the result of specifying, implementing and formally verifying a small careful attention to performance issues, and repeatedly it- operating system kernel — how to provide a productive erating bottom-up implementations of low-level function- and iterative developmentmethodology for both operating ality, in some cases changing high-level interfaces and system developers and formal methods practitioners. functionality to accommodate implementation constraints We espouse the use of functional programming lan- and performance goals. This is, unfortunately, in conflict guages as a medium for prototyping that is readily with formal methods, which typically work by top-down amenable to formalisation with a low barrier to entry for refining models of system properties, and rarely deal with kernel developers, and report early experience in the pro- low-level implementation features. cess of designing and building seL4: a new, practical, and This paper describes our approach to resolving this ten- formally verified microkernel. sion, and reports on our experience so far in applying it to seL4. We use a high-level language (Literate Haskell) to simultaneously develop a specification of the kernel and a 1 Introduction reference implementation for evaluation and testing. The implementation can be used in conjunction with a sim- We describe our approachto constructing seL4 — a useful ulator such as QEMU for running real application bina- yet formally verified operating system kernel, by means of ries, while the specification generates input to an interac- a novel development process which aims to reconcile the tive theorem prover (Isabelle) for formal proof of proper- conflicting methodologies of kernel developers and for- ties.
    [Show full text]
  • Verification of Sequential Imperative Programs in Isabelle/HOL
    f f f f f f f f f f f f f f f f f f f f Verification of Sequential Imperative Programs in Isabelle/HOL Norbert Schirmer Lehrstuhl fur¨ Software & Systems Engineering Institut fur¨ Informatik Technische Universitat¨ Munchen¨ Lehrstuhl fur¨ Software & Systems Engineering Institut fur¨ Informatik Technische Universitat¨ Munchen¨ Verification of Sequential Imperative Programs in Isabelle/HOL Norbert Schirmer Vollstandiger¨ Abdruck der von der Fakultat¨ fur¨ Informatik der Technischen Universitat¨ Munchen¨ zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften (Dr. rer. nat.) genehmigten Dissertation. Vorsitzender: Univ.-Prof. Dr. Helmut Seidl Prufer¨ der Dissertation: 1. Univ.-Prof. Tobias Nipkow, Ph.D. 2. Univ.-Prof. Dr. Wolfgang J. Paul Universitat¨ des Saarlandes Die Dissertation wurde am 31.10.2005 bei der Technischen Universitat¨ Munchen¨ eingereicht und durch die Fakultat¨ fur¨ Informatik am 21.04.2006 angenommen. i Kurzfassung Ziel der Dissertation ist es, eine Verifikationsumgebung fur¨ sequentielle imperative Programme zu schaffen. Zunachst¨ wird unabhangig¨ von einer konkreten Program- miersprache ein allgemeines Sprachmodell entwickelt, das ausdrucksstark genug ist um alle gangigen¨ Programmiersprachkonzepte abzudecken: Gegenseitig rekur- sive Prozeduren, abrupte Terminierung und Ausnahmebehandlung, Laufzeitfeh- ler, lokale und globale Variablen, Zeiger und Halde, Ausdrucke¨ mit Seiteneffekten, Zeiger auf Prozeduren, partielle Applikation, dynamischer Methoden Aufruf und unbeschrankter¨ Indeterminismus. Fur¨ dieses Sprachmodell wird eine Hoare Logik sowohl fur¨ partielle alsauch fur¨ totale Korrektheit entwickelt. Darauf aufbauend wird ein Verifikations-Bedingungs- Generator implementiert. Die Hoare Logik erlaubt die Integration von statischer Programmanalyse und Software Model Checkern in die Verifikation. Desweiteren wird eine Teilsprache von C in die Verifikationsumgebung eingebet- tet, um die Durchgangigkeit¨ zu einer realen Programmiersprache zu demonstrieren.
    [Show full text]
  • Automated Reasoning for Explainable Artificial Intelligence∗
    EPiC Series in Computing Volume 51, 2017, Pages 24–28 ARCADE 2017. 1st International Workshop on Automated Reasoning: Challenges, Appli- cations, Directions, Exemplary Achievements Automated Reasoning for Explainable Artificial Intelligence∗ Maria Paola Bonacina1 Dipartimento di Informatica Universit`adegli Studi di Verona Strada Le Grazie 15 I-37134 Verona, Italy, EU [email protected] Abstract Reasoning and learning have been considered fundamental features of intelligence ever since the dawn of the field of artificial intelligence, leading to the development of the research areas of automated reasoning and machine learning. This paper discusses the relationship between automated reasoning and machine learning, and more generally be- tween automated reasoning and artificial intelligence. We suggest that the emergence of the new paradigm of XAI, that stands for eXplainable Artificial Intelligence, is an oppor- tunity for rethinking these relationships, and that XAI may offer a grand challenge for future research on automated reasoning. 1 Artificial Intelligence, Automated Reasoning, and Ma- chine Learning Ever since the beginning of artificial intelligence, scientists concurred that the capability to learn and the capability to reason about problems and solve them are fundamental pillars of intelligent behavior. Similar to many subfields of artificial intelligence, automated reasoning and machine learning have grown into highly specialized research areas. Both have experienced the dichotomy between the ideal of imitating human intelligence, whereby the threshold of success would be the indistinguishability of human and machine, as in the Turing’s test [12], and the ideal that machines can and should learn and reason in their own way. According to the latter ideal, the measure of success is independent of similarity to human behavior.
    [Show full text]
  • The Ratiolog Project: Rational Extensions of Logical Reasoning
    The RatioLog Project: Rational Extensions of Logical Reasoning Ulrich Furbach · Claudia Schon · Frieder Stolzenburg · Karl-Heinz Weis · Claus-Peter Wirth Abstract Higher-level cognition includes logical reasoning 1 Rational Reasoning and Question Answering and the ability of question answering with common sense. The RatioLog project addresses the problem of rational rea- The development of formal logic played a big role in the soning in deep question answering by methods from auto- field of automated reasoning, which led to the development mated deduction and cognitive computing. In a first phase, of the field of artificial intelligence (AI). Applications of au- we combine techniques from information retrieval and ma- tomated deduction in mathematics have been investigated chine learning to find appropriate answer candidates from from the early years on. Nowadays automated deduction the huge amount of text in the German version of the free techniques are successfully applied in hard- and software encyclopedia “Wikipedia”. In a second phase, an automated verification and many other areas (for an overview see [2]). theorem prover tries to verify the answer candidates on In contrast to formal logical reasoning, however, human the basis of their logical representations. In a third phase reasoning does not strictly follow the rules of classical logic. — because the knowledge may be incomplete and inconsis- Reasons may be incomplete knowledge, incorrect beliefs, tent —, we consider extensions of logical reasoning to im- and inconsistent norms. From the very beginning of AI re- prove the results. In this context, we work toward the appli- search, there has been a strong emphasis on incorporating cation of techniques from human reasoning: We employ de- mechanisms for rationality, such as abductive or defeasible feasible reasoning to compare the answers w.r.t.
    [Show full text]
  • Automated Reasoning on the Web
    Automated Reasoning on the Web Slim Abdennadher1, Jos´eJ´ulioAlves Alferes2, Grigoris Antoniou3, Uwe Aßmann4, Rolf Backofen5, Cristina Baroglio6, Piero A. Bonatti7, Fran¸coisBry8, W lodzimierz Drabent9, Norbert Eisinger8, Norbert E. Fuchs10, Tim Geisler11, Nicola Henze12, Jan Ma luszy´nski4, Massimo Marchiori13, Alberto Martelli14, Sara Carro Mart´ınez15, Wolfgang May16, Hans J¨urgenOhlbach8, Sebastian Schaffert8, Michael Schr¨oder17, Klaus U. Schulz8, Uta Schwertel8, and Gerd Wagner18 1 German University in Cairo (Egypt), 2 New University of Lisbon (Portugal), 3 Institute of Computer Science, Foundation for Research and Technology – Hel- las (Greece), 4 University of Link¨oping (Sweden), 5 University of Jena (Germany), 6 University of Turin (Italy), 7 University of Naples (Italy), 8 University of Mu- nich (Germany), 9 Polish Academy of Sciences (Poland), 10 University of Zurich (Switzerland), 11 webXcerpt (Germany), 12 University of Hannover (Germany), 13 University of Venice (Italy), 14 University of Turin (Italy), 15 Telef´onica Investi- gaci´ony Desarollo (Spain), 16 University of G¨ottingen(Germany), 17 University of Dresden (Germany), 18 University of Eindhoven (The Netherlands) Abstract Automated reasoning is becoming an essential issue in many Web systems and applications, especially in emerging Semantic Web applications. This article first discusses reasons for this evolution. Then, it presents research issues currently investigated towards automated reasoning on the Web and it introduces into selected applications demonstrating the practical impact of the approach. Finally, it introduces a research endeavor called REWERSE (cf. http://rewerse.net) recently launched by the authors of this article which is concerned with developing automated reasoning methods and tools for the Web as well as demonstrator applications.
    [Show full text]
  • Curriculum Vitae of Danny Dorling
    January 2021 1993 to 1996: British Academy Fellow, Department of Geography, Newcastle University 1991 to 1993: Joseph Rowntree Foundation Curriculum Vitae Fellow, Many Departments, Newcastle University 1987 to 1991: Part-Time Researcher/Teacher, Danny Dorling Geography Department, Newcastle University Telephone: +44(0)1865 275986 Other Posts [email protected] skype: danny.dorling 2020-2023 Advisory Board Member: ‘The political economies of school exclusion and their consequences’ (ESRC project ES/S015744/1). Current appointment: Halford Mackinder 2020-Assited with the ‘Time to Care’ Oxfam report. Professor of Geography, School of 2020- Judge for data visualisation competition Geography and the Environment, The Nuffield Trust, the British Medical Journal, the University of Oxford, South Parks Road, British Medical Association and NHS Digital. Oxford, OX1 3QY 2019- Judge for the annual Royal Geographical th school 6 form essay competition. 2019 – UNDP (United Nations Development Other Appointments Programme) Human Development Report reviewer. 2019 – Advisory Broad member: Sheffield Visiting Professor, Department of Sociology, University Nuffield project on an Atlas of Inequality. Goldsmiths, University of London, 2013-2016. 2019 – Advisory board member - Glasgow Centre for Population Health project on US mortality. Visiting Professor, School of Social and 2019- Editorial Board Member – Bristol University Community Medicine, University of Bristol, UK Press, Studies in Social Harm Book Series. 2018 – Member of the Bolton Station Community Adjunct Professor in the Department of Development Partnership. Geography, University of Canterbury, NZ 2018-2022 Director of the Graduate School, School of Geography and the Environment, Oxford. 2018 – Member of the USS review working group of the Council of the University of Oxford.
    [Show full text]
  • Frege's Influence on the Modern Practice of Doing Mathematics
    논리연구 20-1(2017) pp. 97-112 Frege’s influence on the modern practice of doing mathematics 1) Gyesik Lee 【Abstract】We discuss Frege’s influence on the modern practice of doing mathematical proofs. We start with explaining Frege’s notion of variables. We also talk of the variable binding issue and show how successfully his idea on this point has been applied in the field of doing mathematics based on a computer software. 【Key Words】Frege, Begriffsschrift, variable binding, mechanization, formal proofs Received: Nov. 11, 2016. Revised: Feb. 10, 2017. Accepted: Feb. 11, 2017. 98 Gyesik Lee 1. Introduction Around the turn of the 20th century, mathematicians and logicians were interested in a more exact investigation into the foundation of mathematics and soon realized that ordinary mathematical arguments can be represented in formal axiomatic systems. One prominent figure in this research was Gottlob Frege. His main concern was twofold: (1) whether arithmetical judgments could be proved in a purely logical manner, and (2) how far one could progress in arithmetic merely by using the laws of logic. In Begriffsschrift (Frege, 1879), he invented a special kind of language system, where statements can be proved as true based only upon some general logical laws and definitions. This system is then used later in the two volumes of Grundgesetze der Arithmetik (Frege, 1893, 1903), where he provided and analyzed a formal system for second order arithmetic. Although the system in (Frege, 1893, 1903) is known to be inconsistent, it contains all of the essential materials to provide a fundamental basis for dealing with propositions of arithmetic based on an axiomatic system.
    [Show full text]
  • Reasoning-Driven Question-Answering for Natural Language Understanding Daniel Khashabi University of Pennsylvania, [email protected]
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2019 Reasoning-Driven Question-Answering For Natural Language Understanding Daniel Khashabi University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Artificial Intelligence and Robotics Commons Recommended Citation Khashabi, Daniel, "Reasoning-Driven Question-Answering For Natural Language Understanding" (2019). Publicly Accessible Penn Dissertations. 3271. https://repository.upenn.edu/edissertations/3271 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/3271 For more information, please contact [email protected]. Reasoning-Driven Question-Answering For Natural Language Understanding Abstract Natural language understanding (NLU) of text is a fundamental challenge in AI, and it has received significant attention throughout the history of NLP research. This primary goal has been studied under different tasks, such as Question Answering (QA) and Textual Entailment (TE). In this thesis, we investigate the NLU problem through the QA task and focus on the aspects that make it a challenge for the current state-of-the-art technology. This thesis is organized into three main parts: In the first part, we explore multiple formalisms to improve existing machine comprehension systems. We propose a formulation for abductive reasoning in natural language and show its effectiveness, especially in domains with limited training data. Additionally, to help reasoning systems cope with irrelevant or redundant information, we create a supervised approach to learn and detect the essential terms in questions. In the second part, we propose two new challenge datasets. In particular, we create two datasets of natural language questions where (i) the first one requires reasoning over multiple sentences; (ii) the second one requires temporal common sense reasoning.
    [Show full text]