A Multi-Level Neural Network for Implicit Causality Detection in Web Texts

Total Page:16

File Type:pdf, Size:1020Kb

A Multi-Level Neural Network for Implicit Causality Detection in Web Texts A Multi-level Neural Network for Implicit Causality Detection in Web Texts Shining Lianga,b, Wanli Zuoa,b, Zhenkun Shib,c, Sen Wangd, Junhu Wange, Xianglin Zuoa,b aKey Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, Changchun, China bCollege of Computer Science and Technology, Jilin University, Jilin, China cTianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences dSchool of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Australia eSchool of Information and Communication Technology, Griffith University, Queensland, Australia Abstract Mining causality from text is a complex and crucial natural language un- derstanding task corresponding to the human cognition. Existing studies at its solution can be grouped into two primary categories: feature engineer- ing based and neural model based methods. In this paper, we find that the former has incomplete coverage and inherent errors but provide prior knowl- edge; while the latter leverages context information but causal inference of which is insufficiency. To handle the limitations, we propose a novel causality detection model named MCDN to explicitly model causal reasoning process, and furthermore, to exploit the advantages of both methods. Specifically, we adopt multi-head self-attention to acquire semantic feature at word level and develop the SCRN to infer causality at segment level. To the best of our knowledge, with regards to the causality tasks, this is the first time that arXiv:1908.07822v3 [cs.CL] 27 May 2021 the Relation Network is applied. The experimental results show that: i) the proposed approach performs prominent performance on causality detection; ii) further analysis manifests the effectiveness and robustness of MCDN. Keywords: Causality Detection, Multi-level Neural Network, Relation Network, Transformer Preprint submitted to Neurocomputing May 28, 2021 1. Introduction Automatic text causality mining is a critical but difficult task because causality is thought to play an essential role in human cognition when making decisions [1]. Thus, automatic text causality has been studied extensively in a wide range of areas, such as medical [2], question answering [3] and event prediction [4], etc. A tool automatically extracts meaningful causal relations could help us construct causality graphs [5] to unveil previously unknown relationships between events and accelerate the discovery of the intrinsic logic of the events [6]. Many research efforts have been made to mine causality from text corpus with complex sentence structures in books or newspapers [7, 8, 9]. However, the scale of textual data in the world, e.g., on the web, is much larger than that in books and newspapers. Despite the success of existing studies on extracting explicit causality, there are two reasons why most cannot be di- rectly applied to causality mining on the web text where a large number of implicit causality cases exist. First, most public available causality mining datasets are collected from books and newspapers. The language expressions of them are usually formal but lack diversity than the web text. Second, the existing works mainly focus on explicit causal relations expressed by intra- sentence or inter-sentence connectives, without considering ambiguous and implicit cases. As is well known that implicit causality always has a simple sentence structure without any connectives as below. In Example 1, “got wet” is the cause of “fever” and there are no connectives available for infer- ence. Contrastively in Example 2, there are explicit connectives (i.e. “since” and “result”) benefiting the causality detection. • Example 1: I got wet during the day and came home with a fever at night. • Example 2: Since computers merely execute the instructions they are given, bugs are nearly always the result of programmer error or an oversight made in the program’s design. Consequently, it would make the perception of causality incomplete if we ignore those implicit ones. There is a huge demand to investigate an approach for mining both explicit and implicit causality from the web text. In this paper, we formulate causality mining as two sub-sequential tasks: causality detection [4, 10, 11] and cause-effect pair extraction [12, 13], which is 2 Table 1: Examples of ambiguous AltLexes in the parallel data. Label English Wikipedia Simple Wikipedia Causal A moving observer thus sees the A moving observer thus sees the light coming from a slightly dif- light coming from a slightly dif- ferent direction and consequently ferent direction and consequently sees the source at a position sees the source at a position shifted from its original position. shifted from its original position. Non-causal His studies were interrupted by However, he had to stop study- World War I, and consequently ing because of the World War I, taught at schools in Voronezh and instead taught at schools in and Saratov. Voronezh and Saratov. also investigated in SemEval-2020 Task5 [14] and FinCausal 2020 [15]. When dealing with large-scale web text, detecting causalities is a foundation system before extracting cause-effect pairs. It could help build a high quality corpus with diverse linguistic characteristics for causal pairs extraction, which leads to lower annotation cost and less model complexity in downstream tasks. In recent years, Hidey and McKeown [10] utilize the “AltLex” (Alternative lex- icalization) to build a large open-domain causality detection dataset based on parallel Wikipedia articles especially for ambiguous and implicit cases as shown in Table 1. Most existing works on the causality detection task fall into two categories: i) feature engineering based methods, widely apply lin- guistic features (part-of-speech (POS) tagging, dependency parsing) [4, 10] and statistic features (template, co-occurrence) [11]. There exists thousands of AltLexes among which may appear in both causal and non-causal cases as “consequently” in Table 1 in fact. However, the complicated features hardly capture the subtle discrepancies of various causality expressions and the in- herent errors of natural language processing (NLP) tools will be accumulated and propagated; ii) neural model based methods, which have achieved promi- nent results with the end-to-end paradigm, is prevalent for design and usage. We conduct empirical study to asses the application of neural text classi- fiers on causality detection. The performance of most neural model based methods fall behind feature engineering based methods (in Table 3). The reason is encapsulated as they mainly focus on the interactions of the words 3 and treat the sentence as a whole but lack explicit inference for causality within sentences. Recently, pre-trained language models (PLMs) [16, 17] de- velop dramatically and even exceed human performance on many NLP tasks. Nevertheless, when it comes to large-scale web text data, the memory and time consumption is very considerable. BL L AL he has subsequently written a further nine plays BL-L L-AL B-A A-B cause/effect-connective cause-effect interaction interaction Figure 1: An example for different segments within an sentence where “subsequently” is the AltLex word. Faced with the above problems, we propose the Multi-level Causality De- tection Network (MCDN) for causality detection in web texts based on the following observations: i) neural network-based methods can reduce the labor cost and inherent errors of feature engineering based methods, whereas com- bining the prior knowledge of the latter benefits the former [18]; ii) causality reasoning is a high-level ability of human [1], which calls for multi-level anal- ysis of the model. MCDN modifies a Transformer Encoder module to obtain semantic representation at word level and integrates a novel Self Causal Re- lation Network (SCRN) module at segment level inferring causality via the segments on both sides of the connectives. Moreover, we argue that integrat- ing multi-level knowledge could facilitate the token level feature overfitting proposed in [14]. Specifically, MCDN splits the sentence into three segments on the ground of “segment before AltLex” (BL), “AltLex” (L) and “segment after AltLex” (AL) as shown in Figure 1. Intuitively, the cause and effect part usually exist on both sides on the AltLex. This simple prior feature minimizes the impact of feature engineering complexity and errors. Motivated by explic- itly modeling the causal reasoning process, the SCRN module encodes the segments and aggregates them into pair-wise groups that are concatenated 4 with a sentence representation in the meantime. The interaction between the cause/effect part with the connective, BL-L and L-AL, describes if each segment conveys causation depending on the current AltLex. And by the interaction among causal segments, B-A and A-B, SCRN directly infers the potential causality when they’re coupled in the context. Above information comprises the segment-level representation. Next, we utilize Transformer architecture [19] at word level. To keep the framework fast and light un- der the large-scale web text scenario, the heads and blocks of Transformer Encoder module are clipped consequently don’t use the pre-trained weights. Furthermore, we extend the segment embedding to adopt multi-segments in the input. With this end-to-end module, MCDN combines local context and long-distance dependency to obtain a word-level representation. Finally, we perform detection with word-level and segment-level representations. In general, the contributions
Recommended publications
  • Philosophy of Science and Philosophy of Chemistry
    Philosophy of Science and Philosophy of Chemistry Jaap van Brakel Abstract: In this paper I assess the relation between philosophy of chemistry and (general) philosophy of science, focusing on those themes in the philoso- phy of chemistry that may bring about major revisions or extensions of cur- rent philosophy of science. Three themes can claim to make a unique contri- bution to philosophy of science: first, the variety of materials in the (natural and artificial) world; second, extending the world by making new stuff; and, third, specific features of the relations between chemistry and physics. Keywords : philosophy of science, philosophy of chemistry, interdiscourse relations, making stuff, variety of substances . 1. Introduction Chemistry is unique and distinguishes itself from all other sciences, with respect to three broad issues: • A (variety of) stuff perspective, requiring conceptual analysis of the notion of stuff or material (Sections 4 and 5). • A making stuff perspective: the transformation of stuff by chemical reaction or phase transition (Section 6). • The pivotal role of the relations between chemistry and physics in connection with the question how everything fits together (Section 7). All themes in the philosophy of chemistry can be classified in one of these three clusters or make contributions to general philosophy of science that, as yet , are not particularly different from similar contributions from other sci- ences (Section 3). I do not exclude the possibility of there being more than three clusters of philosophical issues unique to philosophy of chemistry, but I am not aware of any as yet. Moreover, highlighting the issues discussed in Sections 5-7 does not mean that issues reviewed in Section 3 are less im- portant in revising the philosophy of science.
    [Show full text]
  • Aristotle's Anticommunism Author(S): Darrell Dobbs Source: American Journal of Political Science, Vol
    Aristotle's Anticommunism Author(s): Darrell Dobbs Source: American Journal of Political Science, Vol. 29, No. 1 (Feb., 1985), pp. 29-46 Published by: Midwest Political Science Association Stable URL: http://www.jstor.org/stable/2111210 Accessed: 10/12/2010 23:50 Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/action/showPublisher?publisherCode=mpsa. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Midwest Political Science Association is collaborating with JSTOR to digitize, preserve and extend access to American Journal of Political Science. http://www.jstor.org Aristotle'sAnticommunism DarrellDobbs, Universityof Houston This essayexamines Aristotle's critical review of Plato's Republic,the focus of whichreview is restricted,surprisingly, to Socrates'communistic political institutions; Aristotle hardly men- tionsany of theother important themes developed in thedialogue.
    [Show full text]
  • Introduction: the New Metaphysics of Time Over the Last Several Years
    History and Theory, Virtual Issue 1 (August 2012) © Wesleyan University 2012 ISSN: 1468-2303 INTRODUCTION: THE NEW METAPHYSICS OF TIME ETHAN KLEINBERG Over the last several years, the editors of History and Theory have tracked a growing trend evident not only in the articles published in our pages but also in manuscripts submitted to the journal as well as articles published in other venues. In the broadest strokes, this trend can be described as a response to the perceived limitations of a discourse predicated on language, rhetoric, and the question of representation. Whether one subscribes to the idea that historians and theorists of history took a “linguistic turn” in the 1980s,1 an increasing number of articles and monographs have taken issue with the general influence of French “post- structuralism” or “postmodernism,” with constructivism, and specifically with the work of Hayden White. These works seek to address the perceived faults of an overemphasis on the issues of “language” and representation that has obscured or misdirected the goal of actually addressing the past by perpetually worrying over how we might go about that task. Combined with the growing interest in material culture, the ways of science, and the nature of the body, it is hard to doubt the return to the real. But what strikes us as most interesting about this trend is the way that some of these theorists have sought to move beyond the emphasis on language and representation not by returning to a crude variant of objectivism or empiricism but by re-examining our relationship to the past and the past’s very nature and by attempting to construct a new metaphysics of time.
    [Show full text]
  • Kant's Doctrine of Schemata
    Kant’s Doctrine of Schemata By Joseph L. Hunter Thesis submitted to the faculty of Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of MASTERS OF ARTS IN PHILOSOPHY APPROVED: _______________________________ Eric Watkins, Chair _______________________________ _______________________________ Roger Ariew Joseph C. Pitt August 25, 1999 Blacksburg, Virginia Keywords: Kant, Schemata, Experience, Knowledge, Categories, Construction, Mathematics. i Kant’s Doctrine of Shemata Joseph L. Hunter (ABSTRACT) The following is a study of what may be the most puzzling and yet, at the same time, most significant aspect of Kant’s system: his theory of schemata. I will argue that Kant’s commentators have failed to make sense of this aspect of Kant’s philosophy. A host of questions have been left unanswered, and the doctrine remains a puzzle. While this study is not an attempt to construct a complete, satisfying account of the doctrine, it should be seen as a step somewhere on the road of doing so, leaving much work to be done. I will contend that one way that we may shed light on Kant’s doctrine of schemata is to reconsider the manner in which Kant employs schemata in his mathematics. His use of the schemata there may provide some inkling into the nature of transcendental schemata and, in doing so, provide some hints at how the transcendental schemata allow our representations of objects to be subsumed under the pure concepts of the understanding. In many ways, then, the aims of the study are modest: instead of a grand- scale interpretation of Kant's philosophy, a detailed textual analysis and interpretation are presented of his doctrine of schemata.
    [Show full text]
  • Peirce, Pragmatism, and the Right Way of Thinking
    SANDIA REPORT SAND2011-5583 Unlimited Release Printed August 2011 Peirce, Pragmatism, and The Right Way of Thinking Philip L. Campbell Prepared by Sandia National Laboratories Albuquerque, New Mexico 87185 and Livermore, California 94550 Sandia National Laboratories is a multi-program laboratory managed and operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin Corporation, for the U.S. Department of Energy’s National Nuclear Security Administration under Contract DE-AC04-94AL85000.. Approved for public release; further dissemination unlimited. Issued by Sandia National Laboratories, operated for the United States Department of Energy by Sandia Corporation. NOTICE: This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government, nor any agency thereof, nor any of their employees, nor any of their contractors, subcontractors, or their employees, make any warranty, express or implied, or assume any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represent that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise, does not necessarily con- stitute or imply its endorsement, recommendation, or favoring by the United States Government, any agency thereof, or any of their contractors or subcontractors. The views and opinions expressed herein do not necessarily state or reflect those of the United States Government, any agency thereof, or any of their contractors. Printed in the United States of America. This report has been reproduced directly from the best available copy.
    [Show full text]
  • 5. Immanuel Kant and Critical Idealism Robert L
    Contemporary Civilization (Ideas and Institutions Section XII: The osP t-Enlightenment Period of Western Man) 1958 5. Immanuel Kant and Critical Idealism Robert L. Bloom Gettysburg College Basil L. Crapster Gettysburg College Harold A. Dunkelberger Gettysburg College See next page for additional authors Follow this and additional works at: https://cupola.gettysburg.edu/contemporary_sec12 Part of the European Languages and Societies Commons, History Commons, and the Philosophy Commons Share feedback about the accessibility of this item. Bloom, Robert L. et al. "5. Immanuel Kant and Critical Idealism. Pt XII: The osP t-Enlightenment Period." Ideas and Institutions of Western Man (Gettysburg College, 1958), 53-69. This is the publisher's version of the work. This publication appears in Gettysburg College's institutional repository by permission of the copyright owner for personal use, not for redistribution. Cupola permanent link: https://cupola.gettysburg.edu/ contemporary_sec12/5 This open access book chapter is brought to you by The uC pola: Scholarship at Gettysburg College. It has been accepted for inclusion by an authorized administrator of The uC pola. For more information, please contact [email protected]. 5. Immanuel Kant and Critical Idealism Abstract The ideas of Immanuel Kant (1724-1804) are significant enough to be compared to a watershed in Western thought. In his mind were gathered up the major interests of the Enlightenment: science, epistemology, and ethics; and all of these were given a new direction which he himself described as another Copernican revolution. As Copernicus had shown that the earth revolved around the sun, rather than the sun around the earth, so Kant showed that the knowing subject played an active and creative role in the production of his world picture, rather than the static and passive role which the early Enlightenment had assigned him.
    [Show full text]
  • Causality in the Mind: Estimating Contextual and Conjunctive Power
    This excerpt from Explanation and Cognition. Frank C. Keil and Robert A. Wilson, editors. © 2000 The MIT Press. is provided in screen-viewable form for personal use only by members of MIT CogNet. Unauthorized use or dissemination of this information is expressly forbidden. If you have any questions about this material, please contact [email protected]. 9 Causality in the Mind: Estimating Contextual and Conjunctive Causal Power Patricia W. Cheng I would like to argue that humans, and perhaps all species capable of flex- ibly adaptive goal-directed actions, are born with a conviction that they use for inferring cause-and-effect relations in the world.This conviction— that entities and events may have causal powers with respect to other enti- ties or events—provides a simple framework that enables reasoners to incrementally construct a picture of causal relations in a complex world. In the reasoner’s mind, causal powers are invariant properties of relations that allow the prediction of the consequences of actions regardless of the context in which the action occurs, with “context” being the background causes of an effect (those other than the candidate causes) that happen to occur in a situation. The reasoner’s goal is to infer these powers. The causal power scheme, however, is coherent only if the inferred powers can be tested in contexts other than the one in which they are inferred. When predictions based on simple causal powers fail, reasoners may be motivated to evaluate conjunctive causal power. All conventional statistical measures of independence, none of which is based on a causal power analysis, contradict measures of conjunctive causal power (Novick and Cheng 1999).
    [Show full text]
  • PHIL 3302: Philosophy of Aristotle (3 Credits)
    Date prepared: 8/17/14 The University of New Orleans Dept. of Philosophy PHIL 3302: Philosophy of Aristotle (3 credits) SECTION 001: LA 194, Thursday, 4:30 p.m – 7:15 p.m. Instructor: Dr. Surprenant Office: UNO: LA 387 Office Hours: (office) M: 11-12pm, Th: 1-4pm (and by appointment) (Skype, csurprenant) Tues: 12-2pm (and by appointment) Office Phone: (504) 280-6818 Contact Contact Information Email: [email protected] Course Webpages: Accessed via Moodle. Aristotle: Nicomachean Ethics, ed. Crisp (Cambridge, 2000). ISBN: 0521635462 Aristotle: The Politics and Constitution of Athens, ed. Everson (Oxford, 1996). ISBN: 0521484006 Text Required Required CATALOG DESCRIPTION: Aristotle's ideas are examined through careful analysis of his main works with emphasis on his criticisms of the basic theories of his teacher, Plato, and Aristotle's influence on subsequent Western Philosophy, literature, and science. COURSE OVERVIEW: This course is an advanced study of Aristotle’s moral and political philosophy. We will examine Aristotle’s answers to such questions as: May the state legitimately use the law to impose a certain conception of morality on its citizens? Or must the state aim, rather, to remain neutral when its citizens disagree strongly about the best way of life, protecting its citizens’ freedom to choose their own visions of the good life? A primary focus throughout the entire semester will be on learning how to read, understand, Course Description Course evaluate, and construct philosophical arguments, and then engaging with contemporary scholars about these arguments and ideas. In this way, this course will also serve as an introduction to being a philosopher.
    [Show full text]
  • Criminal Responsibility and Causal Determinism J
    Washington University Jurisprudence Review Volume 9 | Issue 1 2016 Criminal Responsibility and Causal Determinism J. G. Moore Follow this and additional works at: https://openscholarship.wustl.edu/law_jurisprudence Part of the Criminal Law Commons, Jurisprudence Commons, Legal History Commons, Legal Theory Commons, Philosophy Commons, and the Rule of Law Commons This document is a corrected version of the article originally published in print. To access the original version, follow the link in the Recommended Citation then download the file listed under “Previous Versions." Recommended Citation J. G. Moore, Criminal Responsibility and Causal Determinism, 9 Wash. U. Jur. Rev. 043 (2016, corrected 2016). Available at: http://openscholarship.wustl.edu/law_jurisprudence/vol9/iss1/6 This Article is brought to you for free and open access by the Law School at Washington University Open Scholarship. It has been accepted for inclusion in Washington University Jurisprudence Review by an authorized administrator of Washington University Open Scholarship. For more information, please contact [email protected]. Criminal Responsibility and Causal Determinism Cover Page Footnote This document is a corrected version of the article originally published in print. To access the original version, follow the link in the Recommended Citation then download the file listed under “Previous Versions." This article is available in Washington University Jurisprudence Review: https://openscholarship.wustl.edu/law_jurisprudence/vol9/ iss1/6 CRIMINAL RESPONSIBILITY AND CAUSAL DETERMINISM: CORRECTED VERSION J G MOORE* INTRODUCTION In analytical jurisprudence, determinism has long been seen as a threat to free will, and free will has been considered necessary for criminal responsibility.1 Accordingly, Oliver Wendell Holmes held that if an offender were hereditarily or environmentally determined to offend, then her free will would be reduced, and her responsibility for criminal acts would be correspondingly diminished.2 In this respect, Holmes followed his father, Dr.
    [Show full text]
  • Additionality: a Useful Way to Construct the Counterfactual Qualitatively?
    REFEREED ARTICLE Evaluation Journal of Australasia, Vol. 10, No. 1, 2010, pp. 28–35 Additionality: a useful way to construct the counterfactual qualitatively? Abstract Julie Hind One concern about impact evaluation is whether a program has made a difference (Owen 2006). Essentially, in trying to understand this, an evaluation seeks to establish causality—the causal link between the social program and the outcomes (Mohr 1999; Rossi & Freeman 1993). However, attributing causality to social programs is difficult because of the inherent complexity and the many and varied factors at play (House 2001; Mayne 2001; Pawson & Tilley 1998; White 2010). Evaluators can choose from a number of theories and methods to help address causality (Davidson 2000). Measuring the counterfactual—the difference between actual outcomes and what would have occurred without Julie Hind is Director of Evolving the intervention—has been at the heart of traditional Ways, Albury–Wodonga. Email: impact evaluations. While this has traditionally been <[email protected]> measured using experimental and quasi-experimental methods, a counterfactual does not always need a This article was written towards comparison group (White 2010) and can be constructed completion of a Master of Assessment qualitatively (Cummings 2006). and Evaluation degree at the With these in mind, this article explores the usefulness of University of Melbourne. the concept of additionality, a mixed-methods framework developed by Buisseret et al. (cited in Georghiou 2002) as a means of evaluative comparison of the counterfactual. Introduction Over recent decades, there has been a move away from an emphasis on procedural matters within government agencies towards a focus on achievements (Mayne 2001).
    [Show full text]
  • Learning a Theory of Causality
    Learning a Theory of Causality Noah D. Goodman, Tomer D. Ullman, Joshua B. Tenenbaum ndg, tomeru, jbt @mit.edu MIT, Dept.f of Brain and Cognitiveg Sciences The very early appearance of abstract knowledge is often taken as evidence for innateness. We explore the relative learning speeds of abstract and specific knowledge within a Bayesian framework, and the role for innate structure. We focus on knowledge about causality, seen as a domain-general intuitive theory, and ask whether this knowledge can be learned from co- occurrence of events. We begin by phrasing the causal Bayes nets theory of causality, and a range of alternatives, in a logical language for relational theories. This allows us to explore simultaneous inductive learning of an abstract theory of causality and a causal model for each of several causal systems. We find that the correct theory of causality can be learned relatively quickly, often becoming available before specific causal theories have been learned—an effect we term the blessing of abstraction. We then explore the effect of providing a variety of auxiliary evidence, and find that a collection of simple “perceptual input analyzers” can help to bootstrap abstract knowledge. Together these results suggest that the most efficient route to causal knowledge may be to build in not an abstract notion of causality, but a powerful inductive learning mechanism and a variety of perceptual supports. While these results are purely computational, they have implications for cognitive development, which we explore in the conclusion. Pre-print June 2010—to appear in Psych. Review. essary building block in models of cognitive development or because it was not clear how such abstract knowledge could Introduction be learned.
    [Show full text]
  • Elements of Causal Inference
    Elements of Causal Inference Foundations and Learning Algorithms Adaptive Computation and Machine Learning Francis Bach, Editor Christopher Bishop, David Heckerman, Michael Jordan, and Michael Kearns, As- sociate Editors A complete list of books published in The Adaptive Computation and Machine Learning series appears at the back of this book. Elements of Causal Inference Foundations and Learning Algorithms Jonas Peters, Dominik Janzing, and Bernhard Scholkopf¨ The MIT Press Cambridge, Massachusetts London, England c 2017 Massachusetts Institute of Technology This work is licensed to the public under a Creative Commons Attribution- Non- Commercial-NoDerivatives 4.0 license (international): http://creativecommons.org/licenses/by-nc-nd/4.0/ All rights reserved except as licensed pursuant to the Creative Commons license identified above. Any reproduction or other use not licensed as above, by any electronic or mechanical means (including but not limited to photocopying, public distribution, online display, and digital information storage and retrieval) requires permission in writing from the publisher. This book was set in LaTeX by the authors. Printed and bound in the United States of America. Library of Congress Cataloging-in-Publication Data Names: Peters, Jonas. j Janzing, Dominik. j Scholkopf,¨ Bernhard. Title: Elements of causal inference : foundations and learning algorithms / Jonas Peters, Dominik Janzing, and Bernhard Scholkopf.¨ Description: Cambridge, MA : MIT Press, 2017. j Series: Adaptive computation and machine learning series
    [Show full text]