Generalized Fiducial Inference for Graded Response Models

Total Page:16

File Type:pdf, Size:1020Kb

Generalized Fiducial Inference for Graded Response Models GENERALIZED FIDUCIAL INFERENCE FOR GRADED RESPONSE MODELS Yang Liu A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Department of Psychology. Chapel Hill 2015 Approved by: David Thissen Daniel J. Bauer Patrick J. Curran Jan Hannig Andrea Hussong c 2015 Yang Liu ALL RIGHTS RESERVED ii ABSTRACT YANG LIU: GENERALIZED FIDUCIAL INFERENCE FOR GRADED RESPONSE MODELS. (Under the direction of David Thissen) Generalized fiducial inference (GFI) has been proposed as an alternative inferential frame- work in the statistical literature. Inferences of various sorts, such as confidence regions for (possibly transformed) model parameters, making prediction about future observations, and goodness of fit evaluation, can be constructed from a fiducial distribution defined on the parameter space in a fashion similar to those used with a Bayesian posterior. However, no prior distribution needs to be specified. In this work, the general recipe of GFI is applied to the graded response models, which are widely used in psychological and educational stud- ies for analyzing ordered categorical survey questionnaire data. Asymptotic optimality of GFI is established (Chapter 2), and a Markov chain Monte Carlo algorithm is developed for sampling from the resulting fiducial distribution (Chapter 3). The comparative performance of GFI, maximum likelihood and Bayesian approaches is evaluated via Monte Carlo simula- tions (Chapter 4). The use of GFI as a convenient and powerful tool to quantify sampling variability in various inferential procedures is illustrated by an empirical data analysis using the patient-reported emotional distress data (Chapter 5). iii To my dearest wife Flora. iv ACKNOWLEDGMENTS I would like to show my gratitude to all the committee members, especially my academic advisors Drs. David Thissen and Jan Hannig. I could not have finished the work without their valuable feedbacks and advices. The project is sponsored by the Harold Gulliksen Psychometric Research Fellowship from the Educational Testing Service (ETS). I am grateful to my mentors Drs. Shelby Haberman and Yi-Hsuan Lee at ETS. It has been a great experience working under their supervision, and I have been greatly benefited from their expertise in statistics. In addition, I would like to offer my sincere appreciation to the help and support from the current and former members of the Thurstone Lab|especially Brooke, Jim, and Jolynn, and those from Hannig's fiducial lab|Jessi, Qing, Dimitris, Abhishek, Rosie, and Jenny. I also cannot express enough thanks to Drs. Li Cai, Ji Seung Yang, and Scott Monroe for their continued support and encouragement for my work. Finally, to my caring, loving, and supportive wife, Flora: my heartfelt thanks. I deeply appreciate you taking care of most household activities in the past year; it was a great comfort and relief for me to focus on completing my dissertation. Your encouragement during the difficult and painful times are duly noted; I could not have gone through the frustration and depression without your company. v TABLE OF CONTENTS LIST OF TABLES ............................................................................ ix LIST OF FIGURES ........................................................................... x 1 INTRODUCTION ......................................................................... 1 1.1 Overview............................................................................... 1 1.2 The graded response model .......................................................... 2 1.2.1 Point estimation ................................................................ 3 1.2.2 Confidence interval/set ......................................................... 4 1.2.3 Goodness of fit testing ......................................................... 5 1.3 Generalized fiducial inference ........................................................ 7 2 THEORY ................................................................................... 16 2.1 A generalized fiducial distribution for item parameters ............................ 16 2.2 A fiducial Bernstein-von Mises theorem ............................................. 24 2.3 Fiducial predictive inference.......................................................... 27 2.3.1 Consistency ..................................................................... 28 2.3.2 Example: Response pattern scoring ........................................... 30 2.4 Goodness of fit testing with a fiducial predictive check (FPC) .................... 32 2.4.1 The centering approach ........................................................ 33 2.4.2 The partial predictive approach ............................................... 33 2.4.3 Choice of test statistics ........................................................ 35 3 COMPUTATION .......................................................................... 37 3.1 General structure ..................................................................... 37 3.2 Conditional sampling steps ........................................................... 38 ? 3.2.1 Conditional sampling of Aij ................................................... 38 vi ? 3.2.2 Conditional sampling of Zid ................................................... 40 3.3 Updating interior polytopes .......................................................... 42 3.4 Starting values ........................................................................ 45 3.5 Heavy-tailedness and a workaround ................................................. 46 3.6 Computational time................................................................... 48 4 MONTE CARLO SIMULATIONS........................................................ 50 4.1 Unidimensional models: Simulation design.......................................... 50 4.2 Unidimensional models: Parameter recovery........................................ 54 4.3 Unidimensional models: Response pattern scoring ................................. 62 4.4 Unidimensional models: Goodness of fit testing .................................... 72 4.5 Bifactor models: Simulation design .................................................. 85 4.6 Bifactor models: Parameter recovery ................................................ 88 4.7 Conclusion ............................................................................. 93 5 EMPIRICAL EXAMPLE.................................................................. 95 5.1 A unidimensional model .............................................................. 96 5.2 A three-dimensional exploratory model ............................................. 99 5.3 A bifactor model ...................................................................... 101 5.4 Summary .............................................................................. 119 6 DISCUSSION AND CONCLUSION ...................................................... 121 Appendix A BASIC PROPERTIES ......................................................... 123 A.1 Calculating the fiducial density ...................................................... 123 A.2 The invariance property .............................................................. 124 Appendix B A BERNSTEIN-VON MISES THEOREM ................................... 126 Appendix C NON-UNIQUENESS DUE TO SELECTION RULES....................... 137 Appendix D PREDICTIVE INFERENCE .................................................. 149 Appendix E FIDUCIAL PREDICTIVE CHECK .......................................... 150 E.1 Asymptotic covariance with the sample score function............................. 150 vii E.2 Normal approximation of the likelihood ............................................. 152 REFERENCES ................................................................................ 153 viii LIST OF TABLES 3.1 The average CPU time (in seconds) consumed by a single MCMC iteration under different combinations of sample size n, test length m, latent dimensionality r (exploratory model, minimally constrained), and number of categories K (Kj = K for all j) ................................................................................... 49 4.1 Data-generating parameter values for the unidimensional GRM (m = 9) ............ 52 4.2 Data-generating parameter values for the bifactor GRM (m = 9)..................... 87 5.1 PROMIS emotional distress short-form items .......................................... 97 5.2 Fiducial point and interval estimates for item slopes and intercepts under the bifactor GRM ............................................................................. 106 ix LIST OF FIGURES 1.1 The binomial proportion example ...................................................... 12 2.1 Set inverse functions..................................................................... 18 3.1 Trace plot for a slope parameter before and after implementing the workaround... 47 4.1 Empirical coverage and median length of CIs for unidimensional GRM parameters (n = 100, m = 9)......................................................................... 56 4.2 Empirical coverage and median length of CIs for unidimensional GRM parameters (n = 200, m = 9)......................................................................... 57 4.3 Empirical coverage and median length of CIs for unidimensional GRM parameters (n = 500, m = 9)......................................................................... 58 4.4 Empirical coverage and median length
Recommended publications
  • Methods to Calculate Uncertainty in the Estimated Overall Effect Size from a Random-Effects Meta-Analysis
    Methods to calculate uncertainty in the estimated overall effect size from a random-effects meta-analysis Areti Angeliki Veroniki1,2*; Email: [email protected] Dan Jackson3; Email: [email protected] Ralf Bender4; Email: [email protected] Oliver Kuss5,6; Email: [email protected] Dean Langan7; Email: [email protected] Julian PT Higgins8; Email: [email protected] Guido Knapp9; Email: [email protected] Georgia Salanti10; Email: [email protected] 1 Li Ka Shing Knowledge Institute, St. Michael’s Hospital, 209 Victoria Street, East Building. Toronto, Ontario, M5B 1T8, Canada 2 Department of Primary Education, School of Education, University of Ioannina, Ioannina, Greece 3 MRC Biostatistics Unit, Institute of Public Health, Robinson Way, Cambridge CB2 0SR, U.K | downloaded: 28.9.2021 4 Department of Medical Biometry, Institute for Quality and Efficiency in Health Care (IQWiG), Im Mediapark 8, 50670 Cologne, Germany 5 Institute for Biometrics and Epidemiology, German Diabetes Center, Leibniz Institute for Diabetes Research at Heinrich Heine University, 40225 Düsseldorf, Germany 6 Institute of Medical Statistics, Heinrich-Heine-University, Medical Faculty, Düsseldorf, Germany This article has been accepted for publication and undergone full peer review but has not https://doi.org/10.7892/boris.119524 been through the copyediting, typesetting, pagination and proofreading process which may lead to differences between this version and the Version of Record. Please cite this article as doi: 10.1002/jrsm.1319 source: This article is protected by copyright. All rights reserved. 7 Institute of Child Health, UCL, London, WC1E 6BT, UK 8 Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, U.K.
    [Show full text]
  • Fiducial Inference: an Approach Based on Bootstrap Techniques
    U.P.B. Sci. Bull., Series A, Vol. 69, No. 1, 2007 ISSN 1223-7027 FIDUCIAL INFERENCE: AN APPROACH BASED ON BOOTSTRAP TECHNIQUES H.-D. HEIKE1, C-tin TÂRCOLEA2, Adina I. TARCOLEA3, M. DEMETRESCU4 În prima parte a acestei lucrări sunt prezentate conceptele de bază ale inferenţei fiduciale. La aplicarea acestui principiu inferenţial apar dificultăţi când distribuţia variabilei pivotale folosite nu este cunoscută. În partea a doua este propusă o soluţie pentru această problemă constând în folosirea de metode bootstrap. The basic concepts of fiducial inference are presented in the first part of this paper. Problems arise if the form of the distribution of the pivot used by the fiducial argument is not known. Our main result consists in showing how bootstrap techniques can be used to handle this kind of situation. Keywords: Fiducial distribution, estimation of parameters, pivotal methods, approximation of distributions, approximate pivot. AMS 2000 Classification: 62F99 1. Introduction Commenting on Fisher’s work, [3] stated at the beginning of his paper: ’The history of fiducial probability dates back over thirty years and so is long by statistical standards; however, thirty years have not proved long enough for agreement to be reached among statisticians as to the derivation, manipulation and interpretation of fiducial probability’. Forty years later, the situation is very much the same. What is actually the fiducial theory? The fiducial model designed by Fisher leads us to objective probability statements about values of unknown parameters solely on the basis of given data, without resorting to any prior distribution. The idea that the initially assumed randomness may be preserved was one of Fisher’s great contributions to the theory of statistical estimation; the lack of acceptance arose from the fact that 1 Prof., Statistics and Econometrics, Technical University Darmstadt, Residenzschloss, D-64289 Darmstadt, GERMANY.
    [Show full text]
  • The Significance Test Controversy Revisited: the Fiducial Bayesian
    Bruno LECOUTRE Jacques POITEVINEAU The Significance Test Controversy Revisited: The fiducial Bayesian Alternative July 26, 2014 Springer Contents 1 Introduction ................................................... 3 1.1 The fiducial Bayesian Inference. .4 1.2 The Stranglehold of Significance Tests . .5 1.3 Beyond the Significance Test Controversy . .6 1.4 The Feasibility of Fiducial Bayesian Methods . .6 1.5 Plan of the Book . .7 2 Preamble - Frequentist and Bayesian Inference .................... 9 2.1 Two Different Approaches to Statistical Inference . .9 2.2 The Frequentist Approach: From Unknown to Known . 10 2.2.1 Sampling Probabilities . 10 2.2.2 Null Hypothesis Significance Testing in Practice . 11 2.2.3 Confidence Interval . 12 2.3 The Bayesian Approach: From Known to Unknown . 12 2.3.1 The Likelihood Function and the Bayesian Probabilities . 12 2.3.2 An Opinion-Based Analysis . 14 2.3.3 A “No Information Initially” Analysis . 16 3 The Fisher, Neyman-Pearson and Jeffreys views of Statistical Tests . 21 3.1 The Fisher Test of Significance . 21 3.2 The Neyman-Pearson Hypothesis Test . 23 3.3 The Jeffreys Bayesian Approach to Testing . 25 3.4 Different Views of Statistical Inference . 28 3.4.1 Different Scopes of Applications: The Aim of Statistical Inference . 28 3.4.2 The Role of Bayesian Probabilities . 30 3.4.3 Statistical Tests: Judgment, Action or Decision? . 32 3.5 Is It possible to Unify the Fisher and Neyman-Pearson Approaches? 34 3.6 Concluding Remarks . 35 v vi Contents 4 GHOST: An officially Recommended Practice ..................... 37 4.1 Null Hypothesis Significance Testing . 37 4.1.1 An Amalgam .
    [Show full text]
  • Principles of Statistical Inference
    Principles of Statistical Inference In this important book, D. R. Cox develops the key concepts of the theory of statistical inference, in particular describing and comparing the main ideas and controversies over foundational issues that have rumbled on for more than 200 years. Continuing a 60-year career of contribution to statistical thought, Professor Cox is ideally placed to give the comprehensive, balanced account of the field that is now needed. The careful comparison of frequentist and Bayesian approaches to inference allows readers to form their own opinion of the advantages and disadvantages. Two appendices give a brief historical overview and the author’s more personal assessment of the merits of different ideas. The content ranges from the traditional to the contemporary. While specific applications are not treated, the book is strongly motivated by applications across the sciences and associated technologies. The underlying mathematics is kept as elementary as feasible, though some previous knowledge of statistics is assumed. This book is for every serious user or student of statistics – in particular, for anyone wanting to understand the uncertainty inherent in conclusions from statistical analyses. Principles of Statistical Inference D.R. COX Nuffield College, Oxford CAMBRIDGE UNIVERSITY PRESS Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo Cambridge University Press The Edinburgh Building, Cambridge CB2 8RU, UK Published in the United States of America by Cambridge University Press, New York www.cambridge.org Information on this title: www.cambridge.org/9780521866736 © D. R. Cox 2006 This publication is in copyright. Subject to statutory exception and to the provision of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press.
    [Show full text]
  • Confidence Distribution
    Confidence Distribution Xie and Singh (2013): Confidence distribution, the frequentist distribution estimator of a parameter: A Review Céline Cunen, 15/09/2014 Outline of Article ● Introduction ● The concept of Confidence Distribution (CD) ● A classical Definition and the History of the CD Concept ● A modern definition and interpretation ● Illustrative examples ● Basic parametric examples ● Significant (p-value) functions ● Bootstrap distributions ● Likelihood functions ● Asymptotically third-order accurate confidence distributions ● CD, Bootstrap, Fiducial and Bayesian approaches ● CD-random variable, Bootstrap estimator and fiducial-less interpretation ● CD, fiducial distribution and Belief function ● CD and Bayesian inference ● Inferences using a CD ● Confidence Interval ● Point estimation ● Hypothesis testing ● Optimality (comparison) of CDs ● Combining CDs from independent sources ● Combination of CDs and a unified framework for Meta-Analysis ● Incorporation of Expert opinions in clinical trials ● CD-based new methodologies, examples and applications ● CD-based likelihood caluculations ● Confidence curve ● CD-based simulation methods ● Additional examples and applications of CD-developments ● Summary CD: a sample-dependent distribution that can represent confidence intervals of all levels for a parameter of interest ● CD: a broad concept = covers all approaches that can build confidence intervals at all levels Interpretation ● A distribution on the parameter space ● A Distribution estimator = contains information for many types of
    [Show full text]
  • Fisher's Fiducial Argument and Bayes Theorem
    R. A. Fisher's Fiducial Argument and Bayes' Theorem Teddy Seidenfeld Statistical Science, Vol. 7, No. 3. (Aug., 1992), pp. 358-368. Stable URL: http://links.jstor.org/sici?sici=0883-4237%28199208%297%3A3%3C358%3ARAFFAA%3E2.0.CO%3B2-G Statistical Science is currently published by Institute of Mathematical Statistics. Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/journals/ims.html. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. The JSTOR Archive is a trusted digital repository providing for long-term preservation and access to leading academic journals and scholarly literature from around the world. The Archive is supported by libraries, scholarly societies, publishers, and foundations. It is an initiative of JSTOR, a not-for-profit organization with a mission to help the scholarly community take advantage of advances in technology. For more information regarding JSTOR, please contact [email protected]. http://www.jstor.org Tue Mar 4 10:35:54 2008 Statistical Science 1992, Vol.
    [Show full text]
  • Improper Priors & Fiducial Inference
    IMPROPER PRIORS & FIDUCIAL INFERENCE Gunnar Taraldsen & Bo Henry Lindqvist Norwegian University of Science and Technology The 4th Bayesian, Fiducial and Frequentist Workshop Harvard University May 2017 1 / 22 Abstract The use of improper priors flourish in applications and is as sucha central part of contemporary statistics. Unfortunately, this is most often presented without a theoretical basis: “Improper priors are just limits of proper priors ... ” We present ingredients in a mathematical theory for statistics which generalize the axioms of Kolmogorov so that improper priors are included. A particular by-product is an elimination of the famous marginalization paradoxes in Bayesian and structural inference. Secondly, we demonstrate that structural and fiducial inference can be formulated naturally in this theory of conditional probability spaces. A particular by-product is then a proof of conditions which ensure coincidence between a Bayesian posterior and the fiducial distribution. The concept of a conditional fiducial model is introduced, and the interpretation of the fiducial distribution is discussed. It isin particular explained that the information given by the prior distribution in Bayesian analysis is replaced by the information given by the fiducial relation in fiducial inference. 2 / 22 Table of Contents Introduction Statistics with improper priors Fiducial inference 3 / 22 The Large Picture 4 / 22 A motivating problem gives it all I Initial problem: Generate data X = χ(U; θ) conditionally given a sufficient statistic T = τ(U; θ) = t. I Tentative solution: Adjust parameter value θ for simulated data so that the sufficient statistic is kept fixed equal to t (Trotter-Tukey, 1956; Engen-Lillegård, Biometrika 1997).
    [Show full text]
  • GENERALIZED FIDUCIAL INFERENCE for LOGISTIC GRADED RESPONSE MODELS Yang Liu
    psychometrika doi: 10.1007/s11336-017-9554-0 GENERALIZED FIDUCIAL INFERENCE FOR LOGISTIC GRADED RESPONSE MODELS Yang Liu UNIVERSITY OF CALIFORNIA, MERCED Jan Hannig THE UNIVERSITY OF NORTH CAROLINA, CHAPEL HILL Samejima’s graded response model (GRM) has gained popularity in the analyses of ordinal response data in psychological, educational, and health-related assessment. Obtaining high-quality point and interval estimates for GRM parameters attracts a great deal of attention in the literature. In the current work, we derive generalized fiducial inference (GFI) for a family of multidimensional graded response model, implement a Gibbs sampler to perform fiducial estimation, and compare its finite-sample performance with several commonly used likelihood-based and Bayesian approaches via three simulation studies. It is found that the proposed method is able to yield reliable inference even in the presence of small sample size and extreme generating parameter values, outperforming the other candidate methods under investigation. The use of GFI as a convenient tool to quantify sampling variability in various inferential procedures is illustrated by an empirical data analysis using the patient-reported emotional distress data. Key words: generalized fiducial inference, confidence interval, Markov chain Monte Carlo, Bernstein–von Mises theorem, item response theory, graded response model, bifactor model. 1. Introduction Ordinal rating scales frequently appear in psychological, educational, and health-related measurement. For instance, Likert-type items are routinely used to elicit responses to, e.g., the degree to which a statement can be concurred with, the frequency of substance use, or the severity of a disease’s interference on daily activities, etc. Another example, more common in proficiency assessments, is assigning partial credits to constructed responses that agree only in part with the answer key, or that reflect progressive levels of mastery.
    [Show full text]
  • Should the Widest Cleft in Statistics - How and Why Fisher Oppos Ed Neyman and Pearson
    School of Economics and Management TECHNICAL UNIVERSITY OF LISBON Department of Economics Carlos Pestana Barros & Nicolas Peypoch Francisco Louçã A Comparative Analysis of Productivity Change in Italian and Portuguese Airports Should The Widest Cleft in Statistics - How and Why Fisher oppos ed Neyman and Pearson WP 02/2008/DE/UECE WP 006/2007/DE _________________________________________________________ WORKING PAPERS ISSN Nº 0874-4548 The Widest Cleft in Statistics - How and Why Fisher opposed Neyman and Pearson Francisco Louçã (UECE-ISEG, UTL, Lisbon) [email protected] Abstract The paper investigates the “widest cleft”, as Savage put it, between frequencists in the foundation of modern statistics: that opposing R.A. Fisher to Jerzy Neyman and Egon Pearson. Apart from deep personal confrontation through their lives, these scientists could not agree on methodology, on definitions, on concepts and on tools. Their premises and their conclusions widely differed and the two groups they inspired ferociously opposed in all arenas of scientific debate. As the abyss widened, with rare exceptions economists remained innocent of this confrontation. The introduction of probability in economics occurred in fact after these ravaging battles began, even if they were not as public as they became in the 1950s. In any case, when Haavelmo, in the 1940s, suggested a reinterpretation of economics according to the probability concepts, he chose sides and inscribed his concepts in the Neyman-Pearson tradition. But the majority of the profession indifferently used tools developed by each of the opposed groups of statisticians, and many puzzled economists chose to ignore the debate. Economics became, as a consequence, one of the experimental fields for “hybridization”, a synthesis between Fisherian and Neyman-Pearsonian precepts, defined as a number of practical proceedings for statistical testing and inference that were developed notwithstanding the original authors, as an eventual convergence between what they considered to be radically irreconcilable.
    [Show full text]
  • Generalized Fiducial Inference for Normal Linear Mixed Models
    The Annals of Statistics 2012, Vol. 40, No. 4, 2102–2127 DOI: 10.1214/12-AOS1030 c Institute of Mathematical Statistics, 2012 GENERALIZED FIDUCIAL INFERENCE FOR NORMAL LINEAR MIXED MODELS By Jessi Cisewski1 and Jan Hannig Carnegie Mellon University and University of North Carolina at Chapel Hill While linear mixed modeling methods are foundational concepts introduced in any statistical education, adequate general methods for interval estimation involving models with more than a few vari- ance components are lacking, especially in the unbalanced setting. Generalized fiducial inference provides a possible framework that ac- commodates this absence of methodology. Under the fabric of gener- alized fiducial inference along with sequential Monte Carlo methods, we present an approach for interval estimation for both balanced and unbalanced Gaussian linear mixed models. We compare the pro- posed method to classical and Bayesian results in the literature in a simulation study of two-fold nested models and two-factor crossed designs with an interaction term. The proposed method is found to be competitive or better when evaluated based on frequentist crite- ria of empirical coverage and average length of confidence intervals for small sample sizes. A MATLAB implementation of the proposed algorithm is available from the authors. 1. Introduction. Inference on parameters of normal linear mixed models has an extensive history; see Khuri and Sahai (1985) for a survey of vari- ance component methodology, or Chapter 2 of Searle, Casella and McCul- loch (1992) for a summary. There are many inference methods for variance components such as ANOVA-based methods [Burdick and Graybill (1992), Hernandez, Burdick and Birch (1992), Hernandez and Burdick (1993), Je- arXiv:1211.1208v1 [stat.ME] 6 Nov 2012 yaratnam and Graybill (1980)], maximum likelihood estimation (MLE) and restricted maximum likelihood (REML) [Hartley and Rao (1967), Searle, Casella and McCulloch (1992)] along with Bayesian methods [Gelman (2006), Received September 2011; revised July 2012.
    [Show full text]
  • The P-Value Function and Statistical Inference
    The American Statistician ISSN: 0003-1305 (Print) 1537-2731 (Online) Journal homepage: https://www.tandfonline.com/loi/utas20 The p-value Function and Statistical Inference D. A. S. Fraser To cite this article: D. A. S. Fraser (2019) The p-value Function and Statistical Inference, The American Statistician, 73:sup1, 135-147, DOI: 10.1080/00031305.2018.1556735 To link to this article: https://doi.org/10.1080/00031305.2018.1556735 © 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group. Published online: 20 Mar 2019. Submit your article to this journal Article views: 1983 View Crossmark data Citing articles: 1 View citing articles Full Terms & Conditions of access and use can be found at https://www.tandfonline.com/action/journalInformation?journalCode=utas20 THE AMERICAN STATISTICIAN 2019, VOL. 73, NO. S1, 135–147: Statistical Inference in the 21st Century https://doi.org/10.1080/00031305.2019.1556735 The p-value Function and Statistical Inference D. A. S. Fraser Department of Statistical Sciences, University of Toronto, Toronto, Canada ABSTRACT ARTICLE HISTORY This article has two objectives. The first and narrower is to formalize the p-value function, which records Received March 2018 all possible p-values, each corresponding to a value for whatever the scalar parameter of interest is for the Revised November 2018 problem at hand, and to show how this p-value function directly provides full inference information for any corresponding user or scientist. The p-value function provides familiar inference objects: significance KEYWORDS Accept–Reject; Ancillarity; levels, confidence intervals, critical values for fixed-level tests, and the power function at all values ofthe Box–Cox; Conditioning; parameter of interest.
    [Show full text]
  • Statistical Inference from Data to Simple Hypotheses
    Statistical inference from data to simple hypotheses Jason Grossman Australian National University DRAFT 2011 Please don't cite this in its current form without permission. I am impressed also, apart from prefabricated examples of black and white balls in an urn, with how baffling the problem has always been of arriving at any explicit theory of the empirical confirmation of a synthetic statement. (Quine 1980, pp. 41–42) Typeset in Belle 12/19 using Plain TEX. CONTENTS Front Matter .......................i Chapter 1. Prologue ....................1 1. Evaluating inference procedures . 3 One option: Frequentism . 5 Another option: factualism . 6 Statistical inference is in trouble . 8 2. A simple example . 9 3. What this book will show . 16 4. Why philosophers need to read this book . 21 PART I: THE STATE OF PLAY IN STATISTICAL INFERENCE Chapter 2. Definitions and Axioms . 27 1. Introduction . 27 2. The scope of this book . 28 Four big caveats . 29 Hypotheses . 32 Theories of theory change . 34 3. Basic notation . 37 An objection to using X . 39 Non-parametric statistics . 41 4. Conditional probability as primitive . 43 5. Exchangeability and multisets . 45 i Exchangeability . 46 Multisets . 50 6. Merriment . 54 7. Jeffrey conditioning . 57 8. The words “Bayesian” and “Frequentist” . 59 9. Other preliminary considerations . 64 Chapter 3. Catalogue I: Bayesianism . 67 1. Introduction . 67 2. Bayesianism in general . 73 Bayesian confirmation theory . 79 3. Subjective Bayesianism . 83 The uniqueness property of Subjective Bayesianism . 85 4. Objective Bayesianism . 86 Restricted Bayesianism . 87 Empirical Bayesianism . 88 Conjugate Ignorance Priors I: Jeffreys . 90 Conjugate Ignorance Priors II: Jaynes . 96 Robust Bayesianism .
    [Show full text]