Robust Textual Inference Via Learning and Abductive Reasoning

Total Page:16

File Type:pdf, Size:1020Kb

Robust Textual Inference Via Learning and Abductive Reasoning RobustTextualInferenceViaLearningandAbductiveReasoning Rajat Raina, Andrew Y. Ng and Christopher D. Manning Computer Science Department Stanford University Stanford, CA 94305 Abstract potentially be useful for many of the above text processing We present a system for textual inference (the task of infer- applications (PASCAL RTE Challenge, PASCAL Recogniz- ring whether a sentence follows from another text) that uses ing Textual Entailment Challenge 2005). Along these lines, learning and a logical-formula semantic representation of the the basic task we consider is to decide whether the mean- text. More precisely, our system begins by parsing and then ing of one natural language fragment can be approximately transforming sentences into a logical formula-like represen- inferred from another. The most general case of natural lan- tation similar to the one used by (Harabagiu et al., 2000). guage reasoning might require rich commonsense reason- An abductive theorem prover then tries to find the minimum ing, which is known to be hard. Our focus is on inferences “cost” set of assumptions necessary to show that one state- that can be made using language features and semantics, ment follows from the other. These costs reflect how likely possibly with some world knowledge. different assumptions are, and are learned automatically us- In this paper, we present an abductive inference algorithm ing information from syntactic/semantic features and from linguistic resources such as WordNet. If one sentence follows to perform semantic reasoning. More precisely, it learns how from the other given only highly plausible, low cost assump- to combine information from several knowledge sources to tions, then we conclude that it can be inferred. Our approach decide what are plausible “assumptions” during its reason- can be viewed as combining statistical machine learning and ing process. Thus, our system is capable of performing flex- classical logical reasoning, in the hope of marrying the ro- ible semantic reasoning, given just a training set of inferable bustness and scalability of learning with the preciseness and sentences (see Table 1 for examples). elegance of logical theorem proving. We give experimental Even beyond natural language inference, our methods can results from the recent PASCAL RTE 2005 challenge compe- be used for robust inference with other logical representa- tition on recognizing textual inferences, where a system us- tions. Logical inference is well known to frequently be brit- ing this inference algorithm achieved the highest confidence tle and have poor coverage, especially when it uses axioms weighted score. that must be manually tweaked; this limits its applicability to real-world tasks. However, by using abduction and learning, Introduction we combine the elegance and preciseness of the logical for- The performance of many text processing applications malism with the robustness and scalability of a statistically would improve substantially if they were able to rea- trained system. son with natural language representations. These applica- tions typically use information specified in natural language Problem definition (e.g., broad-coverage question answering), and process in- We describe our algorithms on the task of “recognizing tex- put/output in natural language (e.g., document summariza- tual entailment” (PASCAL RTE Challenge, PASCAL Rec- tion). ognizing Textual Entailment Challenge 2005), but the ideas Several semantic reasoning tasks crop up within these ap- extend easily to the general case of textual inference. plications. For example, in an automatic question answering Each example in this domain consists of two parts: one or system, a question is given and we might require the sys- more text sentences and a hypothesis sentence. The task is tem to find a piece of text from which the answer can be to identify whether the hypothesis sentence can be plausibly inferred. In a document summarization system, we might inferred (“entailed”) from the text sentences. Throughout stipulate that the summary should not contain any sentences this section, we use the following as our running example: that can be inferred from the rest of the summary. These se- TEXT: Bob purchased an old convertible. mantic reasoning tasks are largely addressed one application HYPOTHESIS: Bob bought an old car. at a time, despite the many intersections. Actual examples in the task are significantly more complex; There have been recent suggestions to isolate the core see the examples in Table 1. reasoning aspects and formulate a generic task that could Many successful text applications rely only on “keyword” Copyright c 2005, American Association for Artificial Intelli- (or phrasal) counting. But information retrieval techniques gence (www.aaai.org). All rights reserved. that use only word counts (such as bag-of-words representa- AAAI-05 / 1099 Class Text Hypothesis Entailed Comparable A Filipino hostage in Iraq was released. A Filipino hostage was freed in Iraq. Yes Documents Information Global oil prices clung near their highest levels in Oil prices rise. Yes Extraction at least 21 years yesterday after Russian oil giant Yukos was ordered to stop sales, . Information Spain pulled its 1,300 troops from Iraq last month. Spain sends troops to Iraq. No Retrieval Machine Trans- The economy created 228,000 new jobs after a dis- The economy created 228,000 jobs af- No lation appointing 112,000 in June. ter disappointing the 112,000 in June. Paraphrase Clinton is a very charismatic person. Clinton is articulate. Yes Acquisition Question VCU School of the Arts In Qatar is located in Qatar is located in Doha. No Answering Doha, the capital city of Qatar. Reading Eating lots of foods that are a good source of fiber Fiber improves blood sugar control. Yes Comprehension may keep your blood glucose from rising fast after you eat. Table 1: Some illustrative examples from different classes of the PASCAL RTE dataset (PASCAL RTE Challenge, PASCAL Recognizing Textual Entailment Challenge 2005). tions) are not well suited to such semantic reasoning tasks. S (purchased) For example, a successful system must differentiate between the hypothesis above and a similar-looking one which can- (Bob) (purchased) not be inferred: NP VP HYPOTHESIS: Old Bob bought a car. (convertible) Thus, rather than relying on keyword counting, we use a NP significantly richer representation for the syntax of the sen- tence, and augment it with semantic annotations. We choose a logical formula-like representation similar to (Harabagiu, NNP VBD DT JJ NN Pasca, & Maiorano, 2000; Moldovan et al., 2003), which allows us to pose the textual inference problem as one of “approximate” (or, more formally, abductive) logical Bob purchased an old convertible inference. For example, the sentences above become: (9 A,B,C) Bob(A) ^ convertible(B) ^ old(B) ^ purchased(C,A,B) (9 X,Y,Z) Bob(X) ^ car(Y) ^ old(Y) ^ Figure 1: Procedure to discover syntactic dependencies, il- bought(Z,X,Y) lustrated on the parse tree for an example sentence. Heads In our notation, C and Z are event variables that denote the are marked on nonterminal nodes in parentheses. The dotted events of purchasing and buying, respectively. lines show the dependencies discovered. With this representation, the hypothesis is entailed by the text if and only if it can be logically proved from the latter. However, the main obstacle to using such logical infer- scalable than one which requires axioms that are always ences is that most non-trivial entailments require making true (Moldovan et al., 2003). Given this cost model, we can certain “leap-of-faith” assumptions. Thus, in order to cor- then use a search procedure to find the minimum-cost proof, rectly infer the entailment above, one must either know or and judge entailment based on this cost. assume that “a convertible is a car”. Building on (Hobbs et One of the challenges is to obtain a cost model for a large al., 1993), we propose an algorithm in which each “assump- set of assumptions. Such costs have typically been manually tion” is associated with a cost, and a hypothesis is plausible tuned in previous work, or have been left out completely. We if it has a simple—meaning low-cost—proof. describe a learning algorithm that automatically discovers Instead of representing all possible leaps-of-faith as ex- “good” costs given a training set of labeled examples (as in plicit logical axioms (“rules”), we take the view that these Table 1). leaps-of-faith correspond to assumptions about the real world and have some degree of plausibility; we assign a Representation cost to each assumption to quantify its plausibility. The cost In this section, we sketch how our logical representation is of an assumption is computed using a cost model that in- derived from raw English text. tegrates features of the assumption based on many knowl- The first step in processing a sentence is to construct a edge sources. We believe this approach is more robust and syntactic dependency graph. The sentence is parsed using AAAI-05 / 1100 the parser in (Klein & Manning, 2003). Hand-written rules the negation of the goal logical formula to a knowledge are used to find the heads of all nodes in this parse tree (these base consisting of the given axioms, and then deriving reles are an adaptation of those in Collins, 1999). a null clause through successive resolution steps. This This procedure implicitly discovers syntactic relation- corresponds to justifying (i.e., “proving”) the goal by ships between the words in the sentence, because when a deriving a contradiction for its negation. Thus, to use the word is chosen as the head, its siblings must be syntactic method of resolution refutation, we construct a knowledge modifiers to that word. These relationships can be laid out base with the text logical formula and the negation of the as a dependency graph where each node is a word/phrase hypothesis logical formula; finding a proof then corresponds and the links represent particular dependencies.
Recommended publications
  • Complexity Classifications for Propositional Abduction in Post's
    Complexity Classifications for Propositional Abduction in Post’s Framework∗ Nadia Creignou1, Johannes Schmidt1, Michael Thomas2 1 LIF, UMR CNRS 6166, Aix-Marseille Universit´e 163, Avenue de Luminy, 13288 Marseille Cedex 9, France [email protected] [email protected] 2 Institut f¨ur Theoretische Informatik, Gottfried Wilhelm Leibniz Universit¨at Appelstr. 4, 30167 Hannover, Germany [email protected] Abstract. In this paper we investigate the complexity of abduction, a fundamental and important form of non-monotonic reasoning. Given a knowledge base explaining the world’s behavior it aims at finding an explanation for some observed manifestation. In this paper we consider propositional abduction, where the knowledge base and the manifesta- tion are represented by propositional formulae. The problem of deciding p whether there exists an explanation has been shown to be Σ2 -complete in general. We focus on formulae in which the allowed connectives are taken from certain sets of Boolean functions. We consider different vari- ants of the abduction problem in restricting both the manifestations and the hypotheses. For all these variants we obtain a complexity classifica- tion for all possible sets of Boolean functions. In this way, we identify easier cases, namely NP-complete, coNP-complete and polynomial cases. Thus, we get a detailed picture of the complexity of the propositional abduction problem, hence highlighting sources of intractability. Further, we address the problem of counting the explanations and draw a com- plete picture for the counting complexity. Keywords: abduction, computational complexity, Post’s lattice, propo- sitional logic, boolean connective arXiv:1006.4923v1 [cs.CC] 25 Jun 2010 ∗ Supported by ANR Algorithms and complexity 07-BLAN-0327-04 and DFG grant VO 630/6-1.
    [Show full text]
  • Abduction, Reason, and Science Abduction, Reason, and Science Processes of Discovery and Explanation
    Abduction, Reason, and Science Abduction, Reason, and Science Processes of Discovery and Explanation Lorenzo Magnani University of Pavia Pavia, Italy, and Georgia Institute of Technology Atlanta, Georgia Springer Science+Business Media, LLC Library of Congress Cataloging-in-Publication Data Magnani, Lorenzo Abduction, reason, and ,cience: processes of discovcry and explanation/Lorenzo Magnani. p. cm. IncIudes bibliographical references and index. ISBN 978-1-4613-4637-1 ISBN 978-1-4419-8562-0 (eBook) DOI 10.1007/978-1-4419-8562-0 1. Science-Philosophy. 2. Abduction (Logic). 3. Discoveries in science. I. Tirle. Q175.32.A24 M34 2001 501-dc21 00-052061 Front cover: Descartes's explanation of the rainbow (from his Meteora, 1656). ISBN 978-1-4613-4637-1 © 2001 Springer Science+Business Media New York Originally published by Kluwer Academic / Plenum Publishers, New York in 2001 Softcover reprint of the hardcover 1st edition 1990 http://www.wkap.nl/ 1098765432 A c.I.P. record for this book is available from the Library of Congress. AII rights reserved No par! of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, microfilming, recording, or otherwise, without wrilten permis sion from the Publisher To my daughter Giovanna Science does not rest upon solid bedrock. The bold structure of its theories rises, as it were, above a swamp. It is like a building erected on piles. The piles are driven down from above into the swamp, but not down to any natural or "given" base; and if we stop driving the piles deeper, it is not because we have reached firm ground.
    [Show full text]
  • European Journal of Pragmatism and American Philosophy, X-1 | 2018 Eco and Peirce on Abduction 2
    European Journal of Pragmatism and American Philosophy X-1 | 2018 Eco and Pragmatism Eco and Peirce on Abduction Francesco Bellucci Electronic version URL: http://journals.openedition.org/ejpap/1122 DOI: 10.4000/ejpap.1122 ISSN: 2036-4091 Publisher Associazione Pragma Electronic reference Francesco Bellucci, « Eco and Peirce on Abduction », European Journal of Pragmatism and American Philosophy [Online], X-1 | 2018, Online since 20 July 2018, connection on 19 April 2019. URL : http:// journals.openedition.org/ejpap/1122 ; DOI : 10.4000/ejpap.1122 This text was automatically generated on 19 April 2019. Licence Creative Commons Author retains copyright and grants the European Journal of Pragmatism and American Philosophy right of first publication with the work simultaneously licensed under a Creative Commons Attribution- NonCommercial-NoDerivatives 4.0 International License. Eco and Peirce on Abduction 1 Eco and Peirce on Abduction Francesco Bellucci 1 One of the key concepts that Eco took from Peirce is that of “abduction,” or reasoning to an explanatory hypothesis. This concept is central to his foundation of a semiotic or interpretative semantics, one of whose component is a theory of inference, and especially of abductive inference. Though there is probably only one “official” place where Eco expounds his theory of abductive reasoning – the article “Horns, Hooves, and Insteps” in The Sign of Three – yet Eco’s theory is spanned over tens of papers and books, including his fictional books. 2 Eco’s fictional books have often been considered as, to say the least, “profoundly connected” with his scholarly work. But Paolucci has persuasively argued that not only Eco’s philosophical work influenced his fictional work, but also, and more crucially, that his fictional works are an integral part of his philosophy: “[t]he union of the theoretical and nontheoretical works constitute Eco’s philosophical legacy, his ‘philosophy’.” (Paolucci 2017a: 254).
    [Show full text]
  • Abductive Reasoning: Challenges Ahead
    Abductive Reasoning: Challenges Ahead Atocha ALISEDA BIBLID [0495-4548 (2007) 22: 60; pp. 261-270] ABSTRACT: The motivation behind the collection of papers presented in this THEORIA forum on Abductive reasoning is my book Abductive Reasoning: Logical Investigations into the Processes of Discovery and Explanation. These contributions raise fundamental questions. One of them concerns the conjectural character of abduction. The choice of a logical framework for abduction is also discussed in detail, both its inferential aspect and search strategies. Abduction is also analyzed as inference to the best explanation, as well as a process of epistemic change, both of which chal- lenge the argument-like format of abduction. Finally, the psychological question of whether humans reason abduc- tively according to the models proposed is also addressed. I offer a brief summary of my book and then comment on and respond to several challenges that were posed to my work by the contributors to this issue. Keywords: abduction, abductive inference, discovery, heuristics. A Summary In Abductive Reasoning: Logical Investigations into Discovery and Explanation, I offer a logical analysis of a particular type of scientific reasoning, namely abduction, that is, reasoning from an observation to its possible explanations. This approach naturally leads to connec- tions with theories of explanation and empirical progress in the philosophy of science, to computationally oriented theories of belief change in artificial intelligence, and to the phi- losophical position known as Pragmatism, proposed by Charles Peirce, to whom the term abduction owes its name. More precisely, the book is divided into three parts: (I) Conceptual Framework, (II) Logical Foundations, and (III) Applications, each of which is briefly described in what follows.
    [Show full text]
  • Abductive Reasoning in Multiple Fault Diagnosis
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by ScholarlyCommons@Penn University of Pennsylvania ScholarlyCommons Technical Reports (CIS) Department of Computer & Information Science June 1988 Abductive Reasoning in Multiple Fault Diagnosis Timothy Finin Unisys, Inc. Gary Morris Internal Revenue Service Follow this and additional works at: https://repository.upenn.edu/cis_reports Recommended Citation Timothy Finin and Gary Morris, "Abductive Reasoning in Multiple Fault Diagnosis", . June 1988. University of Pennsylvania Department of Computer and Information Science Technical Report No. MS-CIS-88-61. This paper is posted at ScholarlyCommons. https://repository.upenn.edu/cis_reports/693 For more information, please contact [email protected]. Abductive Reasoning in Multiple Fault Diagnosis Abstract Abductive reasoning involves generating an explanation for a given set of observations about the world. Abduction provides a good reasoning framework for many AI problems, including diagnosis, plan recognition and learning. This paper focuses on the use of abductive reasoning in diagnostic systems in which there may be more than one underlying cause for the observed symptoms. In exploring this topic, we will review and compare several different approaches, including Binary Choice Bayesian, Sequential Bayesian, Causal Model Based Abduction, Parsimonious Set Covering, and the use of First Order Logic. Throughout the paper we will use as an example a simple diagnostic problem involving automotive
    [Show full text]
  • Charles Sanders Peirce and Karl Popper
    A Brief History of Scientific Thoughts Lecture 4 Palash Sarkar Applied Statistics Unit Indian Statistical Institute, Kolkata India [email protected] Palash Sarkar (ISI, Kolkata) Thoughts on Science 1 / 20 David Hume (1711–1776) on Induction Deduction cannot justify rules for induction. Circularity: The principles for induction are justified by induction. So, induction cannot be justified. No certainty that the future will resemble the past. Inductive reasoning cannot conclusively establish that the Sun will continue to rise in the East. Practical skepticism: we have to rely on induction even if it cannot be justified. Palash Sarkar (ISI, Kolkata) Thoughts on Science 2 / 20 John Stuart Mill on Induction Mathematical truths are highly confirmed generalisations from experience. Knowledge of any kind is not from direct experience, but, an inductive experience from direct experience. An extreme position on the “competitive” roles of inductive and deductive inferences. Syllogistic logic adds nothing to our knowledge. Its rules merely reflect our determination to reason consistently with the ways in which we have reasoned in the past. All ampliative (i.e., which increase knowledge) inferences are inductive. Palash Sarkar (ISI, Kolkata) Thoughts on Science 3 / 20 Charles Sanders Peirce (1839–1914) http://en.wikipedia.org/wiki/Charles_Sanders_Peirce Contributions. Abductive reasoning (as a logic of pragmatism). Placed inductive reasoning and deductive reasoning in a complementary rather than competitive mode. Doctrine of fallibilism. Approach to the scientific method. The objects of knowledge are real things. The properties of real things do not depend on our perceptions of them. Everyone who has sufficient experience of real things will agree on the truth about them.
    [Show full text]
  • Modeling As Scientific Reasoning—The Role of Abductive
    education sciences Article Modeling as Scientific Reasoning—The Role of Abductive Reasoning for Modeling Competence Annette Upmeier zu Belzen 1,* , Paul Engelschalt 1 and Dirk Krüger 2 1 Biology Education, Humboldt-Universität zu Berlin, 10099 Berlin, Germany; [email protected] 2 Biology Education, Freie Universität Berlin, 14195 Berlin, Germany; [email protected] * Correspondence: [email protected] Abstract: While the hypothetico-deductive approach, which includes inductive and deductive rea- soning, is largely recognized in scientific reasoning, there is not much focus on abductive reasoning. Abductive reasoning describes the theory-based attempt of explaining a phenomenon by a cause. By integrating abductive reasoning into a framework for modeling competence, we strengthen the idea of modeling being a key practice of science. The framework for modeling competence theoret- ically describes competence levels structuring the modeling process into model construction and model application. The aim of this theoretical paper is to extend the framework for modeling com- petence by including abductive reasoning, with impact on the whole modeling process. Abductive reasoning can be understood as knowledge expanding in the process of model construction. In com- bination with deductive reasoning in model application, such inferences might enrich modeling processes. Abductive reasoning to explain a phenomenon from the best fitting guess is important for model construction and may foster the deduction of hypotheses from the model and further testing them empirically. Recent studies and examples of learners’ performance in modeling processes support abductive reasoning being a part of modeling competence within scientific reasoning. The ex- Citation: Upmeier zu Belzen, A.; tended framework can be used for teaching and learning to foster scientific reasoning competences Engelschalt, P.; Krüger, D.
    [Show full text]
  • Abductive Reasoning As a Way of Worldmaking
    Foundations of Science, special issue on "The Impact of Radical Constructivism on Science", edited by A. Riegler, 2001, vol. 6, no.4: 361–383. Abductive Reasoning as a Way of Worldmaking Hans Rudi Fischer Heidelberger Institut für systemische Forschung und Therapie Kussmaulstr. 10 D-69120 Heidelberg, Germany The Gods have certainty, whereas to us as men conjecture [only is possible]..” Alcmaion von Kroton (Diels /Kranz, Vol. I,p. 214) "...the true method of philosophical construction is to frame a scheme of ideas , the best that one can, and unflinchingly to explore the interpretation of experience in terms of that scheme." (Alfred North Whitehead 1978, p.xiv) "...man's truth is never absolute because the basis of fact is hypothesis." (Ch. S. Peirce, Writings vol. I, p. 7.) Abstract The author deals with the operational core of logic, i.e. its diverse procedures of inference, in order to show that logically false inferences may in fact be right because - in contrast to logical rationality - they actually enlarge our knowledge of the world. This does not only mean that logically true inferences say nothing about the world, but also that all our inferences are invented hypotheses the adequacy of which cannot be proved within logic but only pragmatically. In conclusion the author demonstrates, through the relationship between rule-following and rationality, that it is most irrational to want to exclude the irrational: it may, at times, be most rational to think and infer irrationally. Focussing on the operational aspects of knowing as inferring does away with the hiatus between logic and life, cognition and the world (reality) - or whatever other dualism one wants to invoke -: knowing means inferring, inferring means rule-governed interpreting, interpreting is a constructive, synthetic act, and a construction that proves adequate (viable) in the “world of experience”, in life, in the praxis of living, is, to the constructivist mind, knowledge.
    [Show full text]
  • Abductive Reasoning in Mathematics Education: Approaches to and Theorisations of a Complex Idea
    EURASIA Journal of Mathematics, Science and Technology Education, 2018, 14(9), em1584 ISSN:1305-8223 (online) OPEN ACCESS Research Paper https://doi.org/10.29333/ejmste/92552 Abductive Reasoning in Mathematics Education: Approaches to and Theorisations of a Complex Idea David A Reid 1* 1 University of Bremen, GERMANY Received 28 March 2018 ▪ Revised 18 May 2018 ▪ Accepted 25 May 2018 ABSTRACT There is a growing literature on the importance of abductive reasoning in mathematics education. However, there are some important variations in what exactly is referred to as ‘abductive reasoning’. This article identifies key theorisations of abductive reasoning in the work of Peirce, Habermas and Eco, as well as describing different approaches to abductive reasoning found the mathematics education literature. A framework in which the different approaches taken in the research literature can be placed and compared is proposed. Keywords: abductive reasoning, conjecturing, function of reasoning, logical form, proof and proving INTRODUCTION There is a growing literature on the importance of abductive reasoning in mathematics education (e.g., Cifarelli, 1999; Krummheuer, 2007; Pedemonte, 2007; Pedemonte & Reid 2011; Rivera & Becker, 2007; Weber, Maher, Powell, & Lee, 2008). However, there are some important variations in what exactly is referred to as ‘abductive reasoning’. This chapter will explore the origins of the concept of abductive reasoning, identify the most significant approaches in mathematics education literature that refers to abductive reasoning, and propose a framework in which the different approaches taken in the research literature can be placed and compared. THEORISATIONS OF ABDUCTIVE REASONING The current usages of the words “abduction” and “abductive” (in logic) are due to Charles Saunders Peirce.
    [Show full text]
  • Abduction and Complexity
    Which is the least complex explanation? Abduction and complexity∗ Fernando Soler-Toscano Grupo de L´ogica, Lenguaje e Informaci´on University of Seville [email protected] 1 How an abductive problem arises? What is abductive reasoning and when is it used? First, it is a kind of inference. In a broad sense, a logical inference is any operation that, starting with some information, allows us to obtain some other information. People are continually doing inference. For example, I cannot remember which of the two keys in my pocket opens the door of my house. I try with the first one but it does not open. So I conclude that it should be the other. In this case, the inference starts with some data (premises): one of the two keys opens the door, but the first I tried did not open. I reach some new information (conclusion): it is the second key. Inference (or reasoning) does not always follow the same way. In the ex- ample, if I know that one of the keys opens my house and I cannot open with the first key, it is necessary that the second opens. When this is the case (that is, the conclusion follows necessarily from the premises), then we are facing a deductive inference. Given that the conclusion is a necessary consequence of the premises, there is no doubt that the conclusion is true, whenever premises are all true: the door must be opened with the second key. arXiv:1902.05479v1 [math.LO] 14 Feb 2019 But there are many contexts in which we cannot apply deductive reasoning.
    [Show full text]
  • Abductive Commonsense Reasoning
    Published as a conference paper at ICLR 2020 ABDUCTIVE COMMONSENSE REASONING Chandra Bhagavatula}, Ronan Le Bras}, Chaitanya Malaviya}, Keisuke Sakaguchi}, Ari Holtzman}, Hannah Rashkin}, Doug Downey}, Scott Wen-tau Yih|, Yejin Choi}~ }Allen Institute for AI, Seattle, WA, USA, |Facebook AI, Seattle, WA, USA ~Paul G. Allen School of Computer Science & Engineering, WA, USA fchandrab,ronanlb,chaitanyam,[email protected] farih,hannahr,[email protected] [email protected] [email protected]∗ ABSTRACT Abductive reasoning is inference to the most plausible explanation. For example, if Jenny finds her house in a mess when she returns from work, and remembers that she left a window open, she can hypothesize that a thief broke into her house and caused the mess, as the most plausible explanation. While abduction has long been considered to be at the core of how people interpret and read between the lines in natural language (Hobbs et al., 1988), there has been relatively little research in support of abductive natural language inference and generation. We present the first study that investigates the viability of language-based abduc- tive reasoning. We introduce a challenge dataset, ART, that consists of over 20k commonsense narrative contexts and 200k explanations. Based on this dataset, we conceptualize two new tasks – (i) Abductive NLI: a multiple-choice question an- swering task for choosing the more likely explanation, and (ii) Abductive NLG: a conditional generation task for explaining given observations in natural language. On Abductive NLI, the best model achieves 68.9% accuracy, well below human performance of 91.4%. On Abductive NLG, the current best language generators struggle even more, as they lack reasoning capabilities that are trivial for humans.
    [Show full text]
  • Abductive Reasoning and Similarity: Some Computational Tools
    To appear as: Schvaneveldt, R. W, & Cohen, T. A. (in press). Abductive reasoning and similarity: Some computational tools. In D. Ifenthaler, P. Pirnay-Dummer, & N. M. Seel (Eds.), Computer based diagnostics and systematic analysis of knowledge. New York: Springer. Chapter 11 ABDUCTIVE REASONING AND SIMILARITY Some Computational Tools Roger W. Schvaneveldt & Trevor A. Cohen Arizona State University University of Texas at Houston Abstract Abductive reasoning includes discovering new hypotheses or explanations. This chapter identifies several factors involved in abductive reasoning in an effort to move toward a theory of such reasoning. The chapter has an ultimate focus on the nature and influence of similarity. A major goal of our work is to develop computational tools that provide intelligent abductive suggestions to people engaged in discovering new knowledge. Novel abductive inferences often exhibit interesting similarities to the phenomena under investigation. Such similarities are not strong or direct but rather are often only obvious once the inference has been drawn. Some of our research is directed at discovering indirect similarities from text by using measures that are sensitive to indirect relations between terms in the text. By focusing on terms that are related but do not co-occur, potentially interesting indirect relations can be explored. Our work employs Random Indexing methods and Pathfinder networks to identify and display relations among terms in a text corpus. These displays are provided to individuals to suggest possible abductive inferences. We explore a variety of methods for identifying indirect similarities. The degree to which surprising and interesting inferences are suggested is the primary measure of success.
    [Show full text]