Causal Inference Methods for Bias Correction in Data Analyses

Total Page:16

File Type:pdf, Size:1020Kb

Causal Inference Methods for Bias Correction in Data Analyses CAUSAL INFERENCE METHODS FOR BIAS CORRECTION IN DATA ANALYSES by Razieh Nabi A dissertation submitted to Johns Hopkins University in conformity with the requirements for the degree of Doctor of Philosophy Baltimore, Maryland March 2021 © 2021 Razieh Nabi All rights reserved Abstract Many problems in the empirical sciences and rational decision making require causal, rather than associative, reasoning. The field of causal inference is concerned with establishing and quantifying cause-effect relationships to inform interventions, even in the absence of direct experimentation or randomization. With the proliferation of massive datasets, it is crucial that we develop principled approaches to drawing actionable conclusions from imperfect information. Inferring valid causal conclusions is impeded by the fact that data are unstructured and filled with different sources of bias. The types of bias that we consider in this thesis include: confounding bias induced by common causes of observed exposures and outcomes, bias in estimation induced by high dimensional data and curse of dimensionality, discriminatory bias encoded in data that reflect historical patterns of discrimination and inequality, and missing data bias where instantiations of variables are systematically missing. The focus of this thesis is on the development of novel causal and statistical methodologies to better understand and resolve these pressing challenges. We draw on methodological insights from both machine learning/artificial intelligence and statistical theory. Specifically, we use ideas from graphical modeling to encode our assumptions about the underlying data generating mechanisms in a clear and succinct manner. Further, we use ideas from nonparametric and semiparametric theories to enable the use of flexible machine learning modes in the estimation of causal effects that are identified as functions of observed data. There are four main contributions to this thesis. First, we bridge the gap between ii identification and semiparametric estimation of causal effects that are identified in causal graphical models with unmeasured confounders. Second, we use semiparametric inference theory for marginal structural models to give the first general approach to causal sufficient dimension reduction of a high dimensional treatment. Third,we address conceptual, methodological, and practical gaps in assessing and overcoming disparities in automated decision making using causal inference and constrained optimization. Fourth, we use graphical representations of missing data mechanisms and provide a complete characterization of identification of the underlying joint distribution where some variables are systematically missing and others are unmeasured. iii Committee Members Dr. Ilya Shpitser (Primary Advisor) John C. Malone Assistant Professor Department of Computer Science Whiting School of Engineering Johns Hopkins University Dr. Daniel Scharfstein Professor of Biostatistics Department of Population Health Sciences School of Medicine University of Utah Dr. Eric Tchetgen Tchetgen Luddy Family President’s Distinguished Professor Statistics Department Wharton School of Business University of Pennsylvania Dr. Elizabeth Ogburn Associate Professor Department of Biostatistics Bloomberg School of Public Health Johns Hopkins University iv Preface Prior to working with Ilya, I had almost no exposure to the field of causal inference. My first encounter with “causal reasoning” was a cosmological argument in my pre-college theology courses, which I was not impressed by. Years later when I was doing my masters in statistics, the mantra of “correlation is not causation” got stuck in my head. The summer before applying to PhD programs, I visited my sister, Marzieh, in California and found the Causality book by Judea Pearl in her bookshelf. I started reading parts of it, and came across this quote: “I would rather discover one causal law than be King of Persia” – Democritus. Semi-seriously I thought to myself: maybe if I pursue a degree in causal inference, one day if I am presented with the throne to be the Queen of Persia, I can decline because at that point I might have learned many causal laws! Marzieh’s book now sits in my bookshelf. When I joined the CS program at Hopkins, I started working with Ilya on two separate projects. The first one was on a causal view of algorithmic fairness. There are many stories where AI algorithms demonstrate discriminatory, and potentially harmful, behaviors towards minorities. Initially, I relied on this work as a positive vehicle for addressing the discrimination I felt due to the restrictive immigration policies and the travel ban in the US. Over time, I found more purpose in my research as it seeks to raise awareness and improve the lives of underrepresented minorities. My research on algorithmic fairness (described in Chapter3) led to the development of a causal framework to interrogate and modify AI algorithms to not rely on sensitive attributes, like race or gender, in inappropriate ways. v The basis for my research on use of semiparametric theory in estimation of causal quantities originally stemmed from my passion for developing a method that establishes the cause-effect relations between the high dimensional treatment of radiation therapy and salivary dysfunctions. This was the second project I was working on in parallel with algorithmic fairness (described in Chapter2). This launched me into reading the book on Semiparametric Theory and Missing Data by Anastasios Tsiatis. In a few months, we (Ilya’s group) joined forces with folks at the Biostats department, and our discussions turned into a regular story time narrated by Dan Scharfstein. For over a year, every Friday we would gather in the library on the 3rd floor of the School of Public Health, and enjoy story time accompanied with coffee and donuts from Dunkin’. Receiving validation from senior researchers in the semiparametrics field like Dan boosted my confidence and I grew to enjoy it even more. On the other hand, a colleague of mine, Rohit, was not quite as impressed as I was about the theory. His main issue was lack of an automated procedure to derive influence functions and perform projections. Focusing on average causal effects, Rohit and I started thinking about an automated procedure to find influence functions for effects that are identified in causal graphical models with unmeasured confounders (described in part in Chapter2). Prior to this, Rohit and I worked on two papers on missing data identification (Chapter4) which stemmed from working on structure learning with missing data and getting stuck at the “wasteland” of non-identifiable laws. We paused the structure learning project and started thinking more carefully about the identifiability aspects of missing data models. If I am ever given a chance to go back in time and re-do my PhD, I would try harder to stick to the principles beautifully presented in this quote: “The important thing in life is not to conquer but to fight well and not to win but to take part.” – Pierre de Coubertin. vi To my lovely mom for being a role model of a brave and independent woman & To my late hardworking dad for teaching me to be ambitious and responsible vii Acknowledgments Acknowledgments are by far the hardest part of the dissertation to write. Feelings, emotions, and sensations cannot easily be summarized in a few pages. There are many people to thank. There are even people whose names or faces I do not know that have played a crucial role to place me where I am today; like the immigration officers who decided for me that I should not pursue a PhD degree in Aerospace Engineering at UW! Some may think that was unfortunate. I cannot agree or disagree with this sentiment as exploring all the counterfactual worlds that I could have ended up in is unattainable. What I can say with certainty however, is that I am happy and thankful to all the causes that brought me to Hopkins. At Hopkins, I started working with Ilya on multiple causal inference projects. I immediately fell in love with the field and here I am today concluding my PhD work on causal inference. Ilya’s passion for research and his support throughout these years has kept me swirling in this world of counterfactuals and I cannot thank him enough for this. Ilya has gathered a wonderful group of scholars in “House-of-Ayli.” It has been a pleasure growing up with them as a person and a researcher. I have learned a lot from each and every one of the “fellow-kids.” I thank Amir, Dan, Eli, Jaron, Noam, Numair, Ranjani, Rohit, and Zach for all the nice memories, collaborations, and friendships. I would like to thank Ilya, Dan S, Betsy, Eric, and Emre for writing strong letters of recommendations for me when I was on the job market, and it is a true honor to have Dan S, Betsy, and Eric in my thesis committee. I especially would like to viii thank Dan S for his gracious mentorship when I needed it the most during my job search and interviews, for planting the seed of passion for semiparametrics in me, and for narrating the semiparametric story time for over a year to the “merry band.” This brought me closer to Bonnie, Lamar, Youjin, and Ryan (who made coffee and donuts a story time tradition.) I learned a lot from them during our discussions and brainstorming sessions over the “clearly” stated claims in the “yellow book.” I would also like to thank Dr. Su for his extensive support when I was writing my first paper in grad school. I had the pleasure of having wonderful mentors like the late Dr. Joan Staniswalis, Dr. Ahmet Bulut, and Dr. Tarik Arici. This whole journey would have been a total wreck if it was not for friends to share with them the ups and downs, the laughter and tears along the way, to celebrate the successes and to get inspired in failures. My heart is filled with memories of my friends in undergrad in Tehran, the Taksim Square and Acibadem in Istanbul, the third and second floor of Malone (especially the time spent procrastinating and making coffee in the kitchen of Malone).
Recommended publications
  • Set Identification and Sensitivity Analysis with Tobin Regressors
    A Service of Leibniz-Informationszentrum econstor Wirtschaft Leibniz Information Centre Make Your Publications Visible. zbw for Economics Chernozhukov, Victor; Rigobon, Roberto; Stoker, Thomas M. Article Set identification and sensitivity analysis with Tobin regressors Quantitative Economics Provided in Cooperation with: The Econometric Society Suggested Citation: Chernozhukov, Victor; Rigobon, Roberto; Stoker, Thomas M. (2010) : Set identification and sensitivity analysis with Tobin regressors, Quantitative Economics, ISSN 1759-7331, Wiley, Hoboken, NJ, Vol. 1, Iss. 2, pp. 255-277, http://dx.doi.org/10.3982/QE1 This Version is available at: http://hdl.handle.net/10419/150311 Standard-Nutzungsbedingungen: Terms of use: Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen Documents in EconStor may be saved and copied for your Zwecken und zum Privatgebrauch gespeichert und kopiert werden. personal and scholarly purposes. Sie dürfen die Dokumente nicht für öffentliche oder kommerzielle You are not to copy documents for public or commercial Zwecke vervielfältigen, öffentlich ausstellen, öffentlich zugänglich purposes, to exhibit the documents publicly, to make them machen, vertreiben oder anderweitig nutzen. publicly available on the internet, or to distribute or otherwise use the documents in public. Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen (insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten, If the documents have been made available under an Open gelten abweichend von diesen Nutzungsbedingungen die in der dort Content Licence (especially Creative Commons Licences), you genannten Lizenz gewährten Nutzungsrechte. may exercise further usage rights as specified in the indicated licence. https://creativecommons.org/licenses/by-nc/3.0/ www.econstor.eu Quantitative Economics 1 (2010), 255–277 1759-7331/20100255 Set identification and sensitivity analysis with Tobin regressors Victor Chernozhukov Department of Economics, MIT Roberto Rigobon Sloan School of Management, MIT Thomas M.
    [Show full text]
  • Estimation and Inference for Set&Identified Parameters Using Posterior Lower Probability
    Estimation and Inference for Set-identi…ed Parameters Using Posterior Lower Probability Toru Kitagawa CeMMAP and Department of Economics, UCL First Draft: September 2010 This Draft: March, 2012 Abstract In inference about set-identi…ed parameters, it is known that the Bayesian probability state- ments about the unknown parameters do not coincide, even asymptotically, with the frequen- tist’scon…dence statements. This paper aims to smooth out this disagreement from a multiple prior robust Bayes perspective. We show that there exist a class of prior distributions (am- biguous belief), with which the posterior probability statements drawn via the lower envelope (lower probability) of the posterior class asymptotically agree with the frequentist con…dence statements for the identi…ed set. With such class of priors, we analyze statistical decision problems including point estimation of the set-identi…ed parameter by applying the gamma- minimax criterion. From a subjective robust Bayes point of view, the class of priors can serve as benchmark ambiguous belief, by introspectively assessing which one can judge whether the frequentist inferential output for the set-identi…ed model is an appealing approximation of his posterior ambiguous belief. Keywords: Partial Identi…cation, Bayesian Robustness, Belief Function, Imprecise Probability, Gamma-minimax, Random Set. JEL Classi…cation: C12, C15, C21. Email: [email protected]. I thank Andrew Chesher, Siddhartha Chib, Jean-Pierre Florens, Charles Manski, Ulrich Müller, Andriy Norets, Adam Rosen, Kevin Song, and Elie Tamer for valuable discussions and comments. I also thank the seminar participants at Academia Sinica Taiwan, Cowles Summer Conference 2011, EC2 Conference 2010, MEG 2010, RES Annual Conference 2011, Simon Fraser University, and the University of British Columbia for helpful comments.
    [Show full text]
  • Microeconometrics with Partial Identification
    Microeconometrics with Partial Identification Francesca Molinari Cornell University Department of Economics [email protected]∗ March 12, 2020 Abstract This chapter reviews the microeconometrics literature on partial identification, focus- ing on the developments of the last thirty years. The topics presented illustrate that the available data combined with credible maintained assumptions may yield much informa- tion about a parameter of interest, even if they do not reveal it exactly. Special attention is devoted to discussing the challenges associated with, and some of the solutions put forward to, (1) obtain a tractable characterization of the values for the parameters of interest which are observationally equivalent, given the available data and maintained assumptions; (2) estimate this set of values; (3) conduct test of hypotheses and make confidence statements. The chapter reviews advances in partial identification analysis both as applied to learning (functionals of) probability distributions that are well-defined in the absence of models, as well as to learning parameters that are well-defined only in the context of particular models. A simple organizing principle is highlighted: the source of the identification problem can often be traced to a collection of random variables that are consistent with the available data and maintained assumptions. This collection may be part of the observed data or be a model implication. In either case, it can be formal- ized as a random set. Random set theory is then used as a mathematical framework to unify a number of special results and produce a general methodology to carry out partial identification analysis. arXiv:2004.11751v1 [econ.EM] 24 Apr 2020 ∗This manuscript was prepared for the Handbook of Econometrics, Volume 7A c North Holland, 2019.
    [Show full text]
  • Counterfactual Framework and Assumptions
    CHAPTER 2 Counterfactual Framework and Assumptions T his chapter examines conceptual frameworks that guide the estimation of treatment effects as well as important assumptions that are embedded in observational studies. Section 2.1 defines causality and describes threats to internal validity. In addition, it reviews concepts that are generally discussed in the evaluation literature, emphasizing their links to statistical analysis. Section 2.2 summarizes the key features of the Neyman- Rubin counterfactual framework. Section 2.3 discusses the ignorable treatment assignment assumption. Section 2.4 describes the stable unit treatment value assumption (SUTVA). Section 2.5 provides an overview of statistical approaches developed to handle selection bias. With the aim of showing the larger context in which new evaluation methods are developed, this focuses on a variety of models, including the seven models covered in this book, and two popular approaches widely employed in economics—the instrumental variables estimator and regression discontinuity designs. Section 2.6 reviews the underlying logic of statistical inference for both randomized experiments and observational studies. Section 2.7 summarizes a range of treatment effects and extends the discussion of the SUTVA. We examine treatment effects by underscoring the maxim that different research questions imply different treatment effects and different analytic models must be matched to the kinds of effects expected. Section 2.8 discusses treatment effect heterogeneity and two recently developed tests of effect heterogeneity. With illustrations, this section shows how to use the tests to evaluate effect heterogeneity and the plausibility of the strongly ignorable treatment assignment assumption. Section 2.9 reviews Heckman’s scientific model of causality, which is a comprehensive, causal inference framework developed by econometricians.
    [Show full text]
  • Partial Identification in Applied Research
    Partial Identification in Applied Research: Benefits and Challenges∗ Kate Hoy Adam M. Rosenz Columbia University and NBER UCL and CEMMAP August 5, 2016 Abstract Advances in the study of partial identification allow applied researchers to learn about parameters of interest without making assumptions needed to guarantee point identification. We discuss the roles that assumptions and data play in partial identification analysis, with the goal of providing information to applied researchers that can help them employ these methods in practice. To this end, we present a sample of econometric models that have been used in a variety of recent applications where parameters of interest are partially identified, highlighting common features and themes across these papers. In addition, in order to help illustrate the combined roles of data and assumptions, we present numerical illustrations for a particular application, the joint determination of wages and labor supply. Finally we discuss the benefits and challenges of using partially identifying models in empirical work and point to possible avenues of future research. 1 Introduction The goal of identification analysis is to determine what can be learned using deductive reasoning through the combination of models (sets of assumptions) and data. Standard approaches to econometric modeling ∗This paper was prepared for an invited session at the 2015 Econometric Society World Congress in Montreal. We thank Richard Blundell, Andrew Chesher, Alon Eizenberg, Charles Manski, Francesca Molinari, and Ariel Pakes for helpful comments and suggestions. Adam Rosen gratefully acknowledges financial support from from the UK Economic and Social Research Council through a grant (RES-589-28-0001) to the ESRC Centre for Microdata Methods and Practice (CeMMAP), from the European Research Council (ERC) grant ERC-2009-StG-240910-ROMETA, and from a British Academy Mid-Career Fellowship.
    [Show full text]
  • Sharp Identification Regions in Models with Convex Moment Predictions
    http://www.econometricsociety.org/ Econometrica, Vol. 79, No. 6 (November, 2011), 1785–1821 SHARP IDENTIFICATION REGIONS IN MODELS WITH CONVEX MOMENT PREDICTIONS ARIE BERESTEANU University of Pittsburgh, Pittsburgh, PA 15260, U.S.A. ILYA MOLCHANOV Institute of Mathematical Statistics and Actuarial Science, University of Bern, CH-3012 Bern, Switzerland FRANCESCA MOLINARI Cornell University, Ithaca, NY 14853-7601, U.S.A. The copyright to this Article is held by the Econometric Society. It may be downloaded, printed and reproduced only for educational or research purposes, including use in course packs. No downloading or copying may be done for any commercial purpose without the explicit permission of the Econometric Society. For such commercial purposes contact the Office of the Econometric Society (contact information may be found at the website http://www.econometricsociety.org or in the back cover of Econometrica). This statement must be included on all copies of this Article that are made available electronically or in any other format. Econometrica, Vol. 79, No. 6 (November, 2011), 1785–1821 SHARP IDENTIFICATION REGIONS IN MODELS WITH CONVEX MOMENT PREDICTIONS BY ARIE BERESTEANU,ILYA MOLCHANOV, AND FRANCESCA MOLINARI1 We provide a tractable characterization of the sharp identification region of the pa- rameter vector θ in a broad class of incomplete econometric models. Models in this class have set-valued predictions that yield a convex set of conditional or unconditional moments for the observable model variables. In short, we call these models with con- vex moment predictions. Examples include static, simultaneous-move finite games of complete and incomplete information in the presence of multiple equilibria; best lin- ear predictors with interval outcome and covariate data; and random utility models of multinomial choice in the presence of interval regressors data.
    [Show full text]
  • Econometric Methods for the Analysis of Assignment Problems in the Presence of Complementarity and Social Spillovers1
    Econometric methods for the analysis of assignment problems in the presence of complementarity and social spillovers1 Bryan S. Graham† forthcoming in the Handbook of Social Economics (J. Benhabib, A. Bisin, & M. Jackson, Eds.). Initial Draft: January 2009 This Draft: June 28, 2010 Abstract: Concern over the distributional effects of policies which induce changes in peer group structure, or ‘associational redistributions’ (Durlauf, 1996c), motivates a sub- stantial body of theoretical and empirical research in economics, sociology, psychology, and education. A growing collection of econometric methods for characterizing the effects of such policies are now available. This chapter surveys these methods. I discuss the identification and estimation of the distributional effects of partner reassignment in one-on-one matching models, the average outcome and inequality effects of segregation, and treatment response in the presence of spillovers. Key Words: matching models, assignment problems, peer group effects, social spillovers, segregation, associational redistribution, complementarity, treatment effects, semiparametric M-estimation JEL Classification: C14, C25, C60, C61, J4, J12, I21 1 I would like to thank Stephane Bonhomme, Charles Manski, Konrad Menzel, and Geert Ridder for helpful discussions. More generally my thinking on the material presented below has benefited from discussions with my co-authors, Guido Imbens and Geert Ridder, and with members of the former MacArthur Research Network on Economic Inequality and Social Interactions (especially Steven Durlauf). Some of the new material in this chapter was presented to the applied micro- economics group at the Federal Reserve Bank of New York to whom I am thankful for feedback. I have especially benefitedfromcommentsonanearlierdraftbyPierre-AndréChiappori,Jeremy Fox, Guido Imbens, Jim Powell, and Bernard Salanie Financial support from the National Science Foundation (SES #0820361) is gratefully acknowledged.
    [Show full text]
  • Econometrics with Partial Identification
    Econometrics with Partial Identification Francesca Molinari The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP25/19 Econometrics with Partial Identification Francesca Molinari Cornell University Department of Economics [email protected]∗ May 30, 2019 Abstract Econometrics has traditionally revolved around point identification. Much effort has been devoted to finding the weakest set of assumptions that, together with the available data, deliver point identification of population parameters, finite or infinite dimensional that these might be. And point identification has been viewed as a necessary prerequi- site for meaningful statistical inference. The research program on partial identification has begun to slowly shift this focus in the early 1990s, gaining momentum over time and developing into a widely researched area of econometrics. Partial identification has forcefully established that much can be learned from the available data and assumptions imposed because of their credibility rather than their ability to yield point identification. Within this paradigm, one obtains a set of values for the parameters of interest which are observationally equivalent given the available data and maintained assumptions. I refer to this set as the parameters' sharp identification region. Econometrics with partial identification is concerned with: (1) obtaining a tractable characterization of the parameters' sharp identification region; (2) providing methods to estimate it; (3) conducting test of hypotheses and making confidence statements about the partially identified parameters. Each of these goals poses challenges that differ from those faced in econometrics with point identification. This chapter discusses these challenges and some of their solution. It reviews advances in partial identification analysis both as applied to learning (functionals of) probability distributions that are well-defined in the absence of models, as well as to learning parameters that are well-defined only in the context of particular models.
    [Show full text]
  • Treatment Effects with Unobserved Heterogeneity: a Set Identification Approach
    Syracuse University SURFACE Maxwell School of Citizenship and Public Center for Policy Research Affairs Summer 7-2014 Treatment Effects with Unobserved Heterogeneity: A Set Identification Approach Sung Jae Jun The Pennsylvania State University, [email protected] Yoonseok Lee Syracuse University, [email protected] Youngki Shin University of Western Ontario, [email protected] Follow this and additional works at: https://surface.syr.edu/cpr Part of the Health Policy Commons, Public Economics Commons, and the Social Welfare Commons Recommended Citation Jun, Sung Jae; Lee, Yoonseok; and Shin, Youngki, "Treatment Effects with Unobserved Heterogeneity: A Set Identification Approach" (2014). Center for Policy Research. 201. https://surface.syr.edu/cpr/201 This Working Paper is brought to you for free and open access by the Maxwell School of Citizenship and Public Affairs at SURFACE. It has been accepted for inclusion in Center for Policy Research by an authorized administrator of SURFACE. For more information, please contact [email protected]. Treatment Effects with Unobserved Heterogeneity: A Set Identification Approach Sung Jae Jun, Yoonseok Lee and Youngki Shin Paper No. 169 July 2014 CENTER FOR POLICY RESEARCH –Summer 2014 Leonard M. Lopoo, Director Associate Professor of Public Administration and International Affairs (PAIA) _________ Associate Directors Margaret Austin Associate Director Budget and Administration John Yinger Trustee Professor of Economics and PAIA Associate Director, Metropolitan Studies Program SENIOR RESEARCH ASSOCIATES Badi
    [Show full text]
  • Econometric Causality
    IZA DP No. 3425 Econometric Causality James J. Heckman DISCUSSION PAPER SERIES DISCUSSION PAPER March 2008 Forschungsinstitut zur Zukunft der Arbeit Institute for the Study of Labor Econometric Causality James J. Heckman University of Chicago, American Bar Foundation, University College Dublin and IZA Discussion Paper No. 3525 March 2008 IZA P.O. Box 7240 53072 Bonn Germany Phone: +49-228-3894-0 Fax: +49-228-3894-180 E-mail: [email protected] Any opinions expressed here are those of the author(s) and not those of IZA. Research published in this series may include views on policy, but the institute itself takes no institutional policy positions. The Institute for the Study of Labor (IZA) in Bonn is a local and virtual international research center and a place of communication between science, politics and business. IZA is an independent nonprofit organization supported by Deutsche Post World Net. The center is associated with the University of Bonn and offers a stimulating research environment through its international network, workshops and conferences, data service, project support, research visits and doctoral program. IZA engages in (i) original and internationally competitive research in all fields of labor economics, (ii) development of policy concepts, and (iii) dissemination of research results and concepts to the interested public. IZA Discussion Papers often represent preliminary work and are circulated to encourage discussion. Citation of such a paper should account for its provisional character. A revised version may be available directly from the author. IZA Discussion Paper No. 3525 March 2008 ABSTRACT Econometric Causality This paper presents the econometric approach to causal modeling.
    [Show full text]
  • Identification Region of the Potential Outcome Distributions Under
    Identification region of the potential outcome distributions under instrument independence Toru Kitagawa The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP30/09 Identi…cation Region of the Potential Outcome Distributions under Instrument Independence Toru Kitagaway University College London and CeMMAP October 2009. Abstract This paper examines identi…cation power of the instrument exogeneity assumption in the treatment e¤ect model. We derive the identi…cation region: The set of potential outcome distributions that are compatible with data and the model restriction. The model restrictions whose identifying power is investigated are (i) instrument independence of each of the potential outcome (marginal independence), (ii) instrument joint independence of the potential outcomes and the selection heterogeneity, and (iii) instrument monotonicity in addition to (ii) (the LATE restriction of Imbens and Angrist (1994)), where these restrictions become stronger in the order of listing. By comparing the size of the identi…cation region under each restriction, we show that the joint independence restriction can provide further identifying information for the potential outcome distributions than marginal independence, but the LATE restriction never does since it solely constrains the distribution of data. We also derive the tightest possible bounds for the average treatment e¤ects under each restriction. Our analysis covers both the discrete and continuous outcome case, and extends the treatment e¤ect bounds of Balke and Pearl (1997) that are available only for the binary outcome case to a wider range of settings including the continuous outcome case. Keywords: Partial Identi…cation, Program Evaluation, Treatment E¤ects, Instrumental Vari- ables JEL Classi…cation: C14, C21.
    [Show full text]
  • “Set Identification, Moment Restrictions and Inference”
    16‐752 January 2017 “Set Identification, Moment Restrictions and Inference” Christian Bontemps and Thierry Magnac Set Identification, Moment Restrictions and Inference Christian Bontemps∗ Thierry Magnacy First Version: September 2016 This Version: January 12, 2017 Abstract For the last ten years, the topic of set identification has been much studied in the econo- metric literature. Classical inference methods have been generalized to the case in which moment inequalities and equalities define a set instead of a point. We review several instances of partial identification by focusing on examples in which the underlying economic restrictions are expressed as linear moments. This setting illustrates the fact that convex analysis helps not only in characterizing the identified set but also for inference. In this perspective, we review inference methods using convex analysis or inversion of tests and detail how geometric characterizations can be useful. Keywords: set identification, moment inequality, convex set, support function. ∗Toulouse School of Economics, ENAC, Universit´ede Toulouse, 21 all´eede Brienne, 31000 Toulouse, France, email: [email protected] yToulouse School of Economics, Universit´ede Toulouse Capitole, 21 all´eede Brienne, 31000 Toulouse, France, email: [email protected] 1 1 Introduction1 The importance of the standard notion of point identification, that appears in standard econometric textbooks (for instance, Wooldridge, 2010), has been questioned for the last thirty years especially by Manski and his coauthors (1989 and seq) who reintroduced and developed the notion of set or partial identification in the literature. Many other scholars have followed up, contributing to a blossoming literature on selection models, structural models and models of treatment effects.
    [Show full text]