Causal Inference in Randomized and Non-Randomized Studies: the Definition, Identification, and Estimation of Causal Parameters

Total Page:16

File Type:pdf, Size:1020Kb

Causal Inference in Randomized and Non-Randomized Studies: the Definition, Identification, and Estimation of Causal Parameters 1 Causal Inference in Randomized and Non-Randomized Studies: The Definition, Identification, and Estimation of Causal Parameters Michael E. Sobel INTRODUCTION between the illusory and non-illusory correlations, Yule invented partial correlation The distinction between causation and associ- to ‘control’ for the influence of a common ation has figured prominently in science and factor, arguing in context that because the philosophy for several hundred years at least, relationship between pauperism and out and, more recently, in statistical science as relief did not vanish when ‘controlling’ for well, indeed, since Galton, Pearson and, Yule poverty, this relationship could be deemed developed the theory of correlation. causal. A half century later, philosophers, Statisticians have pioneered two psychologists and social scientists (e.g., approaches to causal inference that have Reichenbach, 1956; Simon 1954; Suppes proven influential in the natural and 1970) rediscovered Yule’s approach to behavioral sciences. The oldest dates back distinguishing between causal and non-causal to Yule (1896), who wrote extensively about relationships, and econometricians (e.g., ‘illusory’ correlations, by which he meant Granger 1969) extended this idea to the correlations that should not be endowed time-series setting. Graphical models, path with a causal interpretation. To distinguish analysis and, more generally, structural 4 DESIGN AND INFERENCE equation models, when these methods are section, ‘Estimation of causal parameters in used to make causal inferences, also rely randomized studies’, discusses estimation. on this type of reasoning. The theory of The section ‘Mediation analyses’ takes up the experimental design, which emerges in topic of mediation, which is of special interest the 1920s and thereafter, and is associated to psychologists and prevention scientists. especially with Neyman (1923) and Fisher I show that the usual approach to mediation, (1925), forms the basis for a second approach which uses structural equation modeling, does to inferring causal relationships. Here, the not yield estimates of causal parameters, use of good design, especially randomization, even in randomized studies. Several other is emphasized, apparently obviating the need approaches to mediation, including principal to worry about spurious relationships. stratification and instrumental variables, are Despite these important contributions, dur- also considered. ing the majority of the twentieth century, most statisticians espoused the view that statistics had little to do with causation. CAUSATION AND PROBABILISTIC But the situation has reversed dramatically CAUSATION in the last 30 years, since Rubin (1974, 1977, 1978, 1980) rediscovered Neyman’s Regularity theories of causation are concerned potential outcomes notation and extended with the full (or philosophical) cause of an the theory of experimental design to obser- effect, by which is meant a set of conditions vational studies. Currently, there is a large that is sufficient (or necessary or necessary and growing inference in statistics on the and sufficient) for the effect to occur. This topic of causal inference and this second type of theory descends from Hume, who approach to inferring causal relationships claimed that causation (as it exists in the is coming to dominate the first approach, real world) consists only of the following: even in disciplines such as economics, (1) temporal priority, i.e., the cause must which rely on observational studies and precede the effect in time; (2) spatiotemporal where the first approach has traditionally contiguity, i.e., the cause and effect are dominated. ‘near’ in time and space; and (3) constant This chapter provides an introduction, conjunction, i.e., if the same circumstances tailored to the concerns of behavioral sci- are repeated, the same outcome will occur. entists, to this second approach to causal Many subsequent writers argued that Hume’s inference. Because causal inference is the analysis cannot distinguish between regular- act of making inferences about the causal ities that are not causal, such as a relation relation and notions of the causal relation between two events brought about by a differ, it is important to understand what common factor, and genuine causation. At the notion of causation is under consideration minimum, this suggests that Hume’s account when such an inference is made. Thus, in is incomplete. A number of philosophers the next section, I briefly review several [e.g., Bunge (1979) and Harré and Madden notions of causation and also briefly examine (1975)] argue that the causal relation is the approach to causal inference that derives generative. While this idea is appealing, from Yule. In the section ‘Unit and average especially to modern scientists who speak of causal effects’ the second approach, which mechanisms, attempts to elaborate this idea is built on the idea that a causal relation have not been entirely successful. Another should sustain a counterfactual conditional approach (examined later) is to require statement, is introduced, and a number of causal relationships to sustain counterfactual estimands of interest are defined. The section conditional statements. ‘Identification of causal parameters under Hume’s analysis is also deterministic, ignorable treatment assignment’discusses the and the literature on probabilistic causation identification of causal effects and the next that descends from Yule can be viewed as CAUSAL INFERENCE IN RANDOMIZED AND NON-RANDOMIZED STUDIES 5 an attempt to both relax this feature and whether causation is viewed as probabilistic distinguish between causal and non-causal in some inherent sense or if probability arises regularities. The basic idea is as follows. First, in some other way. The deficiencies of this there is a putative cause Z prior in some approach are evident in the psychological sense to an outcome Y. Further, Z and Y literature on casual modeling, where a variety are associated (correlated). However, if the of extra-mathematical considerations (such as Z − Y association vanishes when a (set of) model specification) are used to suggest that variable(s) X prior to Z is conditioned on model coefficients can be endowed with a (or in some accounts, if such a set exists), causal interpretation (see, for example, Sobel this is taken to mean that Z ‘does not cause’ 1995 on this point). Y, that is, the relationship is ‘spurious’. To By way of contrast to regularity theo- complete the picture, various examples where ries, manipulability theories view causes this criterion would seem to work well have as variables that can be manipulated, with been constructed. the outcome depending on the state of the Granger causation and structural equa- manipulated variable. Here, as opposed to tion models use this type of reasoning to specifying all the variables and the functional distinguish between empirical relationships relationship between these and the outcome that are regarded as causal and not causal. (which would constitute a successful causal For example, consider a structural equation account of a phenomenon under a regularity model, with X a vector of variables, Z an theory), the goal is more modest, to examine outcome occurring after X, and Y an outcome the ‘effect’ of a particular variable. See Sobel after Z. If the ‘direct effect’ of Y on Z is 0 (1995) for a reconciliation of these two (not 0), Z is not viewed (viewed) as a cause approaches. of Y. A sufficient condition for this direct Manipulability theories require the causal effect to be 0 is that Y and Z are conditionally relation to sustain a counterfactual conditional independent, given X. Of course, this same statement (e.g., eating the poison caused John kind of reasoning can be extended to the to die means that John ate the poison and consideration of other types of direct and died, but had John not eaten the poison, indirect effects. For example, consider the he would not have died). This is closer to case of a variable X associated with Z, with the way an experimentalist thinks of cau- X and Y conditionally independent, given Z, sation. However, many philosophers regard implying the ‘direct effect of X on Y is 0, and manipulability theories as anthropomorphic. Z and Y not conditionally independent given Further, many questions that scientists ask X, implying the ‘direct effect’ of Z on Y is are not amenable to experimentation, e.g, non-zero. Here there is an effect of X on Y, the effect of education on longevity or but the effect is indirect, through Z. the effect of marriage on happiness. This There are a number of problems with this would appear to seriously limit the value of approach. First and foremost, it confounds this approach for addressing real scientific causation with the act of inferring causation, questions. as evidenced by the fact that the criteria However, even without manipulating a per- above for inferring causation are typically son’s level of education, one might imagine put forth independently of any explicit notion that had this person’s level of education of the causal relation. As notions of the taken on a different value than that actually causal relation vary, this method of inferring realized, this person might also have a causation may be appropriate for some different outcome. This suggests adopting the notions of causation, for example, the case broader view that it is not the manipulation where causation is regarded as a predictive per se , but the idea that the causal variable relationship among variables, but not for could take on a different value than it others. Nor (because the nature of the casual actually takes on, which is key. This is the relation is not explicitly considered), is it clear idea underlying counterfactual theories of 6 DESIGN AND INFERENCE causation (e.g., Lewis 1973), where the closest show that these methods rest on a number of possible world to the one we live in serves as implausible assumptions. the basis for the counterfactual. Counterfactual theories also have their difficulties, both theoretical and practical. UNIT AND AVERAGE CAUSAL EFFECTS One criticism goes under the rubric of ‘preemption’.
Recommended publications
  • Causal Inference with Information Fields
    Causal Inference with Information Fields Benjamin Heymann Michel De Lara Criteo AI Lab, Paris, France CERMICS, École des Ponts, Marne-la-Vallée, France [email protected] [email protected] Jean-Philippe Chancelier CERMICS, École des Ponts, Marne-la-Vallée, France [email protected] Abstract Inferring the potential consequences of an unobserved event is a fundamental scientific question. To this end, Pearl’s celebrated do-calculus provides a set of inference rules to derive an interventional probability from an observational one. In this framework, the primitive causal relations are encoded as functional dependencies in a Structural Causal Model (SCM), which maps into a Directed Acyclic Graph (DAG) in the absence of cycles. In this paper, by contrast, we capture causality without reference to graphs or functional dependencies, but with information fields. The three rules of do-calculus reduce to a unique sufficient condition for conditional independence: the topological separation, which presents some theoretical and practical advantages over the d-separation. With this unique rule, we can deal with systems that cannot be represented with DAGs, for instance systems with cycles and/or ‘spurious’ edges. We provide an example that cannot be handled – to the extent of our knowledge – with the tools of the current literature. 1 Introduction As the world shifts toward more and more data-driven decision-making, causal inference is taking more space in applied sciences, statistics and machine learning. This is because it allows for better, more robust decision-making, and provides a way to interpret the data that goes beyond correlation Pearl and Mackenzie [2018].
    [Show full text]
  • Data Collection: Randomized Experiments
    9/2/15 STAT 250 Dr. Kari Lock Morgan Knee Surgery for Arthritis Researchers conducted a study on the effectiveness of a knee surgery to cure arthritis. Collecting Data: It was randomly determined whether people got Randomized Experiments the knee surgery. Everyone who underwent the surgery reported feeling less pain. SECTION 1.3 Is this evidence that the surgery causes a • Control/comparison group decrease in pain? • Clinical trials • Placebo Effect (a) Yes • Blinding • Crossover studies / Matched pairs (b) No Statistics: Unlocking the Power of Data Lock5 Statistics: Unlocking the Power of Data Lock5 Control Group Clinical Trials Clinical trials are randomized experiments When determining whether a treatment is dealing with medicine or medical interventions, effective, it is important to have a comparison conducted on human subjects group, known as the control group Clinical trials require additional aspects, beyond just randomization to treatment groups: All randomized experiments need a control or ¡ Placebo comparison group (could be two different ¡ Double-blind treatments) Statistics: Unlocking the Power of Data Lock5 Statistics: Unlocking the Power of Data Lock5 Placebo Effect Study on Placebos Often, people will experience the effect they think they should be experiencing, even if they aren’t actually Blue pills are better than yellow pills receiving the treatment. This is known as the placebo effect. Red pills are better than blue pills Example: Eurotrip 2 pills are better than 1 pill One study estimated that 75% of the
    [Show full text]
  • The Practice of Causal Inference in Cancer Epidemiology
    Vol. 5. 303-31 1, April 1996 Cancer Epidemiology, Biomarkers & Prevention 303 Review The Practice of Causal Inference in Cancer Epidemiology Douglas L. Weedt and Lester S. Gorelic causes lung cancer (3) and to guide causal inference for occu- Preventive Oncology Branch ID. L. W.l and Comprehensive Minority pational and environmental diseases (4). From 1965 to 1995, Biomedical Program IL. S. 0.1. National Cancer Institute, Bethesda, Maryland many associations have been examined in terms of the central 20892 questions of causal inference. Causal inference is often practiced in review articles and editorials. There, epidemiologists (and others) summarize evi- Abstract dence and consider the issues of causality and public health Causal inference is an important link between the recommendations for specific exposure-cancer associations. practice of cancer epidemiology and effective cancer The purpose of this paper is to take a first step toward system- prevention. Although many papers and epidemiology atically reviewing the practice of causal inference in cancer textbooks have vigorously debated theoretical issues in epidemiology. Techniques used to assess causation and to make causal inference, almost no attention has been paid to the public health recommendations are summarized for two asso- issue of how causal inference is practiced. In this paper, ciations: alcohol and breast cancer, and vasectomy and prostate we review two series of review papers published between cancer. The alcohol and breast cancer association is timely, 1985 and 1994 to find answers to the following questions: controversial, and in the public eye (5). It involves a common which studies and prior review papers were cited, which exposure and a common cancer and has a large body of em- causal criteria were used, and what causal conclusions pirical evidence; over 50 studies and over a dozen reviews have and public health recommendations ensued.
    [Show full text]
  • Statistics and Causal Inference (With Discussion)
    Applied Statistics Lecture Notes Kosuke Imai Department of Politics Princeton University February 2, 2008 Making statistical inferences means to learn about what you do not observe, which is called parameters, from what you do observe, which is called data. We learn the basic principles of statistical inference from a perspective of causal inference, which is a popular goal of political science research. Namely, we study statistics by learning how to make causal inferences with statistical methods. 1 Statistical Framework of Causal Inference What do we exactly mean when we say “An event A causes another event B”? Whether explicitly or implicitly, this question is asked and answered all the time in political science research. The most commonly used statistical framework of causality is based on the notion of counterfactuals (see Holland, 1986). That is, we ask the question “What would have happened if an event A were absent (or existent)?” The following example illustrates the fact that some causal questions are more difficult to answer than others. Example 1 (Counterfactual and Causality) Interpret each of the following statements as a causal statement. 1. A politician voted for the education bill because she is a democrat. 2. A politician voted for the education bill because she is liberal. 3. A politician voted for the education bill because she is a woman. In this framework, therefore, the fundamental problem of causal inference is that the coun- terfactual outcomes cannot be observed, and yet any causal inference requires both factual and counterfactual outcomes. This idea is formalized below using the potential outcomes notation.
    [Show full text]
  • Chapter 4: Fisher's Exact Test in Completely Randomized Experiments
    1 Chapter 4: Fisher’s Exact Test in Completely Randomized Experiments Fisher (1925, 1926) was concerned with testing hypotheses regarding the effect of treat- ments. Specifically, he focused on testing sharp null hypotheses, that is, null hypotheses under which all potential outcomes are known exactly. Under such null hypotheses all un- known quantities in Table 4 in Chapter 1 are known–there are no missing data anymore. As we shall see, this implies that we can figure out the distribution of any statistic generated by the randomization. Fisher’s great insight concerns the value of the physical randomization of the treatments for inference. Fisher’s classic example is that of the tea-drinking lady: “A lady declares that by tasting a cup of tea made with milk she can discriminate whether the milk or the tea infusion was first added to the cup. ... Our experi- ment consists in mixing eight cups of tea, four in one way and four in the other, and presenting them to the subject in random order. ... Her task is to divide the cups into two sets of 4, agreeing, if possible, with the treatments received. ... The element in the experimental procedure which contains the essential safeguard is that the two modifications of the test beverage are to be prepared “in random order.” This is in fact the only point in the experimental procedure in which the laws of chance, which are to be in exclusive control of our frequency distribution, have been explicitly introduced. ... it may be said that the simple precaution of randomisation will suffice to guarantee the validity of the test of significance, by which the result of the experiment is to be judged.” The approach is clear: an experiment is designed to evaluate the lady’s claim to be able to discriminate wether the milk or tea was first poured into the cup.
    [Show full text]
  • Bayesian Causal Inference
    Bayesian Causal Inference Maximilian Kurthen Master’s Thesis Max Planck Institute for Astrophysics Within the Elite Master Program Theoretical and Mathematical Physics Ludwig Maximilian University of Munich Technical University of Munich Supervisor: PD Dr. Torsten Enßlin Munich, September 12, 2018 Abstract In this thesis we address the problem of two-variable causal inference. This task refers to inferring an existing causal relation between two random variables (i.e. X → Y or Y → X ) from purely observational data. We begin by outlining a few basic definitions in the context of causal discovery, following the widely used do-Calculus [Pea00]. We continue by briefly reviewing a number of state-of-the-art methods, including very recent ones such as CGNN [Gou+17] and KCDC [MST18]. The main contribution is the introduction of a novel inference model where we assume a Bayesian hierarchical model, pursuing the strategy of Bayesian model selection. In our model the distribution of the cause variable is given by a Poisson lognormal distribution, which allows to explicitly regard discretization effects. We assume Fourier diagonal covariance operators, where the values on the diagonal are given by power spectra. In the most shallow model these power spectra and the noise variance are fixed hyperparameters. In a deeper inference model we replace the noise variance as a given prior by expanding the inference over the noise variance itself, assuming only a smooth spatial structure of the noise variance. Finally, we make a similar expansion for the power spectra, replacing fixed power spectra as hyperparameters by an inference over those, where again smoothness enforcing priors are assumed.
    [Show full text]
  • Survey Experiments
    IU Workshop in Methods – 2019 Survey Experiments Testing Causality in Diverse Samples Trenton D. Mize Department of Sociology & Advanced Methodologies (AMAP) Purdue University Survey Experiments Page 1 Survey Experiments Page 2 Contents INTRODUCTION ............................................................................................................................................................................ 8 Overview .............................................................................................................................................................................. 8 What is a survey experiment? .................................................................................................................................... 9 What is an experiment?.............................................................................................................................................. 10 Independent and dependent variables ................................................................................................................. 11 Experimental Conditions ............................................................................................................................................. 12 WHY CONDUCT A SURVEY EXPERIMENT? ........................................................................................................................... 13 Internal, external, and construct validity ..........................................................................................................
    [Show full text]
  • Articles Causal Inference in Civil Rights Litigation
    VOLUME 122 DECEMBER 2008 NUMBER 2 © 2008 by The Harvard Law Review Association ARTICLES CAUSAL INFERENCE IN CIVIL RIGHTS LITIGATION D. James Greiner TABLE OF CONTENTS I. REGRESSION’S DIFFICULTIES AND THE ABSENCE OF A CAUSAL INFERENCE FRAMEWORK ...................................................540 A. What Is Regression? .........................................................................................................540 B. Problems with Regression: Bias, Ill-Posed Questions, and Specification Issues .....543 1. Bias of the Analyst......................................................................................................544 2. Ill-Posed Questions......................................................................................................544 3. Nature of the Model ...................................................................................................545 4. One Regression, or Two?............................................................................................555 C. The Fundamental Problem: Absence of a Causal Framework ....................................556 II. POTENTIAL OUTCOMES: DEFINING CAUSAL EFFECTS AND BEYOND .................557 A. Primitive Concepts: Treatment, Units, and the Fundamental Problem.....................558 B. Additional Units and the Non-Interference Assumption.............................................560 C. Donating Values: Filling in the Missing Counterfactuals ...........................................562 D. Randomization: Balance in Background Variables ......................................................563
    [Show full text]
  • Analysis of Variance and Analysis of Variance and Design of Experiments of Experiments-I
    Analysis of Variance and Design of Experimentseriments--II MODULE ––IVIV LECTURE - 19 EXPERIMENTAL DESIGNS AND THEIR ANALYSIS Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur 2 Design of experiment means how to design an experiment in the sense that how the observations or measurements should be obtained to answer a qqyuery inavalid, efficient and economical way. The desigggning of experiment and the analysis of obtained data are inseparable. If the experiment is designed properly keeping in mind the question, then the data generated is valid and proper analysis of data provides the valid statistical inferences. If the experiment is not well designed, the validity of the statistical inferences is questionable and may be invalid. It is important to understand first the basic terminologies used in the experimental design. Experimental unit For conducting an experiment, the experimental material is divided into smaller parts and each part is referred to as experimental unit. The experimental unit is randomly assigned to a treatment. The phrase “randomly assigned” is very important in this definition. Experiment A way of getting an answer to a question which the experimenter wants to know. Treatment Different objects or procedures which are to be compared in an experiment are called treatments. Sampling unit The object that is measured in an experiment is called the sampling unit. This may be different from the experimental unit. 3 Factor A factor is a variable defining a categorization. A factor can be fixed or random in nature. • A factor is termed as fixed factor if all the levels of interest are included in the experiment.
    [Show full text]
  • The Politics of Random Assignment: Implementing Studies and Impacting Policy
    The Politics of Random Assignment: Implementing Studies and Impacting Policy Judith M. Gueron Manpower Demonstration Research Corporation (MDRC) As the only nonacademic presenting a paper at this conference, I see it as my charge to focus on the challenge of implementing random assignment in the field. I will not spend time arguing for the methodological strengths of social experiments or advocating for more such field trials. Others have done so eloquently.1 But I will make my biases clear. For 25 years, I and many of my MDRC colleagues have fought to implement random assignment in diverse arenas and to show that this approach is feasible, ethical, uniquely convincing, and superior for answering certain questions. Our organization is widely credited with being one of the pioneers of this approach, and through its use producing results that are trusted across the political spectrum and that have made a powerful difference in social policy and research practice. So, I am a believer, but not, I hope, a blind one. I do not think that random assignment is a panacea or that it can address all the critical policy questions, or substitute for other types of analysis, or is always appropriate. But I do believe that it offers unique power in answering the “Does it make a difference?” question. With random assignment, you can know something with much greater certainty and, as a result, can more confidently separate fact from advocacy. This paper focuses on implementing experiments. In laying out the ingredients of success, I argue that creative and flexible research design skills are essential, but that just as important are operational and political skills, applied both to marketing the experiment in the first place and to helping interpret and promote its findings down the line.
    [Show full text]
  • The Theory of the Design of Experiments
    The Theory of the Design of Experiments D.R. COX Honorary Fellow Nuffield College Oxford, UK AND N. REID Professor of Statistics University of Toronto, Canada CHAPMAN & HALL/CRC Boca Raton London New York Washington, D.C. C195X/disclaimer Page 1 Friday, April 28, 2000 10:59 AM Library of Congress Cataloging-in-Publication Data Cox, D. R. (David Roxbee) The theory of the design of experiments / D. R. Cox, N. Reid. p. cm. — (Monographs on statistics and applied probability ; 86) Includes bibliographical references and index. ISBN 1-58488-195-X (alk. paper) 1. Experimental design. I. Reid, N. II.Title. III. Series. QA279 .C73 2000 001.4 '34 —dc21 00-029529 CIP This book contains information obtained from authentic and highly regarded sources. Reprinted material is quoted with permission, and sources are indicated. A wide variety of references are listed. Reasonable efforts have been made to publish reliable data and information, but the author and the publisher cannot assume responsibility for the validity of all materials or for the consequences of their use. Neither this book nor any part may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, microfilming, and recording, or by any information storage or retrieval system, without prior permission in writing from the publisher. The consent of CRC Press LLC does not extend to copying for general distribution, for promotion, for creating new works, or for resale. Specific permission must be obtained in writing from CRC Press LLC for such copying. Direct all inquiries to CRC Press LLC, 2000 N.W.
    [Show full text]
  • Key Items to Get Right When Conducting Randomized Controlled Trials of Social Programs
    Key Items to Get Right When Conducting Randomized Controlled Trials of Social Programs February 2016 This publication was produced by the Evidence-Based Policy team of the Laura and John Arnold Foundation (now Arnold Ventures). This publication is in the public domain. Authorization to reproduce it in whole or in part for educational purposes is granted. We welcome comments and suggestions on this document ([email protected]). 1 Purpose This is a checklist of key items to get right when conducting a randomized controlled trial (RCT) to evaluate a social program or practice. The checklist is designed to be a practical resource for researchers and sponsors of research. It describes items that are critical to the success of an RCT in producing valid findings about a social program’s effectiveness. This document is limited to key items, and does not address all contingencies that may affect a study’s success.1 Items in this checklist are categorized according to the following phases of an RCT: 1. Planning the study; 2. Carrying out random assignment; 3. Measuring outcomes for the study sample; and 4. Analyzing the study results. 1. Key items to get right in planning the study Choose (i) the program to be evaluated, (ii) target population for the study, and (iii) key outcomes to be measured. These should include, wherever possible, ultimate outcomes of policy importance. As illustrative examples: . An RCT of a pregnancy prevention program preferably should measure outcomes such as pregnancies or births, and not just intermediate outcomes such as condom use. An RCT of a remedial reading program preferably should measure outcomes such as reading comprehension, and not just participants’ ability to sound out words.
    [Show full text]