Statistical Inference: the Big Picture1 Robert E

Total Page:16

File Type:pdf, Size:1020Kb

Statistical Inference: the Big Picture1 Robert E Statistical Science 2011, Vol. 26, No. 1, 1–9 DOI: 10.1214/10-STS337 © Institute of Mathematical Statistics, 2011 Statistical Inference: The Big Picture1 Robert E. Kass Abstract. Statistics has moved beyond the frequentist-Bayesian controver- sies of the past. Where does this leave our ability to interpret results? I suggest that a philosophy compatible with statistical practice, labeled here statisti- cal pragmatism, serves as a foundation for inference. Statistical pragmatism is inclusive and emphasizes the assumptions that connect statistical models with observed data. I argue that introductory courses often mischaracterize the process of statistical inference and I propose an alternative “big picture” depiction. Key words and phrases: Bayesian, confidence, frequentist, statistical edu- cation, statistical pragmatism, statistical significance. 1. INTRODUCTION ple, these caricatures forfeit an opportunity to articulate a fundamental attitude of statistical practice. The protracted battle for the foundations of statis- Most modern practitioners have, I think, an open- tics, joined vociferously by Fisher, Jeffreys, Neyman, minded view about alternative modes of inference, but Savage and many disciples, has been deeply illuminat- are acutely aware of theoretical assumptions and the ing, but it has left statistics without a philosophy that matches contemporary attitudes. Because each camp many ways they may be mistaken. I would suggest that took as its goal exclusive ownership of inference, each it makes more sense to place in the center of our log- was doomed to failure. We have all, or nearly all, ical framework the match or mismatch of theoretical moved past these old debates, yet our textbook expla- assumptions with the real world of data. This, it seems nations have not caught up with the eclecticism of sta- to me, is the common ground that Bayesian and fre- tistical practice. quentist statistics share; it is more fundamental than ei- The difficulties go both ways. Bayesians have de- ther paradigm taken separately; and as we strive to fos- nied the utility of confidence and statistical signifi- ter widespread understanding of statistical reasoning, it cance, attempting to sweep aside the obvious success is more important for beginning students to appreciate of these concepts in applied work. Meanwhile, for their the role of theoretical assumptions than for them to re- part, frequentists have ignored the possibility of infer- cite correctly the long-run interpretation of confidence ence about unique events despite their ubiquitous oc- intervals. With the hope of prodding our discipline to currence throughout science. Furthermore, interpreta- right a lingering imbalance, I attempt here to describe tions of posterior probability in terms of subjective be- the dominant contemporary philosophy of statistics. lief, or confidence in terms of long-run frequency, give students a limited and sometimes confusing view of the 2. STATISTICAL PRAGMATISM nature of statistical inference. When used to introduce the expression of uncertainty based on a random sam- I propose to call this modern philosophy statistical pragmatism. I think it is based on the following atti- tudes: Robert E. Kass is Professor, Department of Statistics, Center for the Neural Basis of Cognition and Machine 1. Confidence, statistical significance, and posterior Learning Department, Carnegie Mellon University, probability are all valuable inferential tools. Pittsburgh, Pennsylvania 15213, USA (e-mail: 2. Simple chance situations, where counting argu- [email protected]). 1Discussed in 1012.14/11-STS337C, 1012.14/11-STS337A, ments may be based on symmetries that generate 1012.14/11-STS337D and 1012.14/11-STS337B; rejoinder at equally likely outcomes (six faces on a fair die; 52 1012.14/11-STS337REJ. cards in a shuffled deck), supply basic intuitions 1 2 R. E. KASS about probability. Probability may be built up to im- portant but less immediately intuitive situations us- ing abstract mathematics, much the way real num- bers are defined abstractly based on intuitions com- ing from fractions. Probability is usefully calibrated in terms of fair bets: another way to say the proba- bility of rolling a 3 with a fair die is 1/6isthat5to 1 odds against rolling a 3 would be a fair bet. 3. Long-run frequencies are important mathemati- FIG.1. The big picture of statistical inference. Statistical pro- cally, interpretively, and pedagogically. However, it cedures are abstractly defined in terms of mathematics but are used, in conjunction with scientific models and methods, to explain is possible to assign probabilities to unique events, observable phenomena. This picture emphasizes the hypothetical including rolling a 3 with a fair die or having a con- link between variation in data and its description using statistical fidence interval cover the true mean, without con- models. sidering long-run frequency. Long-run frequencies may be regarded as consequences of the law of large between models and data is a core component of both numbers rather than as part of the definition of prob- the art of statistical practice and the science of statis- ability or confidence. tical methodology. The purpose of Figure 1 is to shift 4. Similarly, the subjective interpretation of posterior the grounds for discussion. probability is important as a way of understanding Note, in particular, that data should not be confused Bayesian inference, but it is not fundamental to its with random variables. Random variables live in the use: in reporting a 95% posterior interval one need theoretical world. When we say things like, “Let us as- not make a statement such as, “My personal proba- sume the data are normally distributed” and we pro- bility of this interval covering the mean is 0.95.” ceed to make a statistical inference, we do not need 5. Statistical inferences of all kinds use statistical to take these words literally as asserting that the data models, which embody theoretical assumptions. As form a random sample. Instead, this kind of language illustrated in Figure 1, like scientific models, sta- is a convenient and familiar shorthand for the much tistical models exist in an abstract framework; to weaker assertion that, for our specified purposes, the distinguish this framework from the real world in- variability of the data is adequately consistent with habited by data we may call it a “theoretical world.” variability that would occur in a random sample. This Random variables, confidence intervals, and poste- linguistic amenity is used routinely in both frequentist rior probabilities all live in this theoretical world. and Bayesian frameworks. Historically, the distinction When we use a statistical model to make a statisti- between data and random variables, the match of the cal inference we implicitly assert that the variation model to the data, was set aside, to be treated as a exhibited by data is captured reasonably well by separate topic apart from the foundations of inference. the statistical model, so that the theoretical world But once the data themselves were considered random corresponds reasonably well to the real world. Con- variables, the frequentist-Bayesian debate moved into clusions are drawn by applying a statistical infer- the theoretical world: it became a debate about the ence technique, which is a theoretical construct, to best way to reason from random variables to inferences some real data. Figure 1 depicts the conclusions as about parameters. This was consistent with develop- straddling the theoretical and real worlds. Statisti- ments elsewhere. In other parts of science, the distinc- cal inferences may have implications for the real tion between quantities to be measured and their the- world of new observable phenomena, but in scien- oretical counterparts within a mathematical theory can tific contexts, conclusions most often concern scien- be relegated to a different subject—to a theory of er- tific models (or theories), so that their “real world” rors. In statistics, we do not have that luxury, and it implications (involving new data) are somewhat in- seems to me important, from a pragmatic viewpoint, to direct (the new data will involve new and different bring to center stage the identification of models with experiments). data. The purpose of doing so is that it provides dif- The statistical models in Figure 1 could involve large ferent interpretations of both frequentist and Bayesian function spaces or other relatively weak probabilistic inference, interpretations which, I believe, are closer to assumptions. Careful consideration of the connection the attitude of modern statistical practitioners. STATISTICAL INFERENCE 3 (A) (B) FIG.2. (A)BARS fits to a pair of peri-stimulus time histograms displaying neural firing rate of a particular neuron under two alternative experimental conditions.(B)The two BARS fits are overlaid for ease of comparison. A familiar practical situation where these issues arise monkey. The data came from a study ultimately re- is binary regression. A classic example comes from ported by Rollenhagen and Olson (2005), which in- a psychophysical experiment conducted by Hecht, vestigated the differential response of individual neu- Schlaer and Pirenne (1942), who investigated the sen- rons under two experimental conditions. Figure 2 dis- sitivity of the human visual system by constructing an plays BARS fits under the two conditions. One way apparatus that would emit flashes of light at very low to quantify the discrepancy between the fits is to esti- intensity in a darkened room. Those authors presented mate the drop in firing rate from peak (the maximal fir- light of varying intensities repeatedly to several sub- ing rate) to the trough immediately following the peak jects and determined, for each intensity, the proportion in each condition. Let us call these peak minus trough of times each subject would respond that he or she had differences, under the two conditions, φ1 and φ2.Us- seen a flash of light. For each subject the resulting data ing BARS, DiMatteo, Genovese and Kass reported a are repeated binary observations (“yes” perceived ver- posterior mean of φˆ1 − φˆ2 = 50.0 with posterior stan- sus “no” did not perceive) at each of many intensities dard deviation (±20.8).
Recommended publications
  • The Interpretation of Probability: Still an Open Issue? 1
    philosophies Article The Interpretation of Probability: Still an Open Issue? 1 Maria Carla Galavotti Department of Philosophy and Communication, University of Bologna, Via Zamboni 38, 40126 Bologna, Italy; [email protected] Received: 19 July 2017; Accepted: 19 August 2017; Published: 29 August 2017 Abstract: Probability as understood today, namely as a quantitative notion expressible by means of a function ranging in the interval between 0–1, took shape in the mid-17th century, and presents both a mathematical and a philosophical aspect. Of these two sides, the second is by far the most controversial, and fuels a heated debate, still ongoing. After a short historical sketch of the birth and developments of probability, its major interpretations are outlined, by referring to the work of their most prominent representatives. The final section addresses the question of whether any of such interpretations can presently be considered predominant, which is answered in the negative. Keywords: probability; classical theory; frequentism; logicism; subjectivism; propensity 1. A Long Story Made Short Probability, taken as a quantitative notion whose value ranges in the interval between 0 and 1, emerged around the middle of the 17th century thanks to the work of two leading French mathematicians: Blaise Pascal and Pierre Fermat. According to a well-known anecdote: “a problem about games of chance proposed to an austere Jansenist by a man of the world was the origin of the calculus of probabilities”2. The ‘man of the world’ was the French gentleman Chevalier de Méré, a conspicuous figure at the court of Louis XIV, who asked Pascal—the ‘austere Jansenist’—the solution to some questions regarding gambling, such as how many dice tosses are needed to have a fair chance to obtain a double-six, or how the players should divide the stakes if a game is interrupted.
    [Show full text]
  • Bayesian Inference in Linguistic Analysis Stephany Palmer 1
    Bayesian Inference in Linguistic Analysis Stephany Palmer 1 | Introduction: Besides the dominant theory of probability, Frequentist inference, there exists a different interpretation of probability. This other interpretation, Bayesian inference, factors in prior knowledge of the situation of interest and lends itself to allow strongly worded conclusions from the sample data. Although having not been widely used until recently due to technological advancements relieving the computationally heavy nature of Bayesian probability, it can be found in linguistic studies amongst others to update or sustain long-held hypotheses. 2 | Data/Information Analysis: Before performing a study, one should ask themself what they wish to get out of it. After collecting the data, what does one do with it? You can summarize your sample with descriptive statistics like mean, variance, and standard deviation. However, if you want to use the sample data to make inferences about the population, that involves inferential statistics. In inferential statistics, probability is used to make conclusions about the situation of interest. 3 | Frequentist vs Bayesian: The classical and dominant school of thought is frequentist probability, which is what I have been taught throughout my years learning statistics, and I assume is the same throughout the American education system. In taking a sample, one is essentially trying to estimate an unknown parameter. When interpreting the result through a confidence interval (CI), it is not allowed to interpret that the actual parameter lies within the CI, so you have to circumvent and say that “we are 95% confident that the actual value of the unknown parameter lies within the CI, that in infinite repeated trials, 95% of the CI’s will contain the true value, and 5% will not.” Bayesian probability says that only the data is real, and as the unknown parameter is abstract, thus some potential values of it are more plausible than others.
    [Show full text]
  • The Bayesian Approach to Statistics
    THE BAYESIAN APPROACH TO STATISTICS ANTHONY O’HAGAN INTRODUCTION the true nature of scientific reasoning. The fi- nal section addresses various features of modern By far the most widely taught and used statisti- Bayesian methods that provide some explanation for the rapid increase in their adoption since the cal methods in practice are those of the frequen- 1980s. tist school. The ideas of frequentist inference, as set out in Chapter 5 of this book, rest on the frequency definition of probability (Chapter 2), BAYESIAN INFERENCE and were developed in the first half of the 20th century. This chapter concerns a radically differ- We first present the basic procedures of Bayesian ent approach to statistics, the Bayesian approach, inference. which depends instead on the subjective defini- tion of probability (Chapter 3). In some respects, Bayesian methods are older than frequentist ones, Bayes’s Theorem and the Nature of Learning having been the basis of very early statistical rea- Bayesian inference is a process of learning soning as far back as the 18th century. Bayesian from data. To give substance to this statement, statistics as it is now understood, however, dates we need to identify who is doing the learning and back to the 1950s, with subsequent development what they are learning about. in the second half of the 20th century. Over that time, the Bayesian approach has steadily gained Terms and Notation ground, and is now recognized as a legitimate al- ternative to the frequentist approach. The person doing the learning is an individual This chapter is organized into three sections.
    [Show full text]
  • Statistical Science Meets Philosophy of Science Part 2: Shallow Versus Deep Explorations
    RMM Vol. 3, 2012, 71–107 Special Topic: Statistical Science and Philosophy of Science Edited by Deborah G. Mayo, Aris Spanos and Kent W. Staley http://www.rmm-journal.de/ Deborah Mayo Statistical Science Meets Philosophy of Science Part 2: Shallow versus Deep Explorations Abstract: Inability to clearly defend against the criticisms of frequentist methods has turned many a frequentist away from venturing into foundational battlegrounds. Conceding the dis- torted perspectives drawn from overly literal and radical expositions of what Fisher, Ney- man, and Pearson ‘really thought’, some deny they matter to current practice. The goal of this paper is not merely to call attention to the howlers that pass as legitimate crit- icisms of frequentist error statistics, but also to sketch the main lines of an alternative statistical philosophy within which to better articulate the roles and value of frequentist tools. 1. Comedy Hour at the Bayesian Retreat Overheard at the comedy hour at the Bayesian retreat: Did you hear the one about the frequentist. “who defended the reliability of his radiation reading, despite using a broken radiometer, on the grounds that most of the time he uses one that works, so on average he’s pretty reliable?” or “who claimed that observing ‘heads’ on a biased coin that lands heads with probability .05 is evidence of a statistically significant improve- ment over the standard treatment of diabetes, on the grounds that such an event occurs with low probability (.05)?” Such jests may work for an after-dinner laugh, but if it turns out that, despite being retreads of ‘straw-men’ fallacies, they form the basis of why some statis- ticians and philosophers reject frequentist methods, then they are not such a laughing matter.
    [Show full text]
  • Hypothesis Testing and the Boundaries Between Statistics and Machine Learning
    Hypothesis Testing and the boundaries between Statistics and Machine Learning The Data Science and Decisions Lab, UCLA April 2016 Statistics vs Machine Learning: Two Cultures Statistical Inference vs Statistical Learning Statistical inference tries to draw conclusions on the data generation model Data Generation X Process Y Statistical learning just tries to predict Y The Data Science and Decisions Lab, UCLA 2 Statistics vs Machine Learning: Two Cultures Leo Breiman, “Statistical Modeling: The Two Cultures,” Statistical Science, 2001 The Data Science and Decisions Lab, UCLA 3 Descriptive vs. Inferential Statistics • Descriptive statistics: describing a data sample (sample size, demographics, mean and median tendencies, etc) without drawing conclusions on the population. • Inferential (inductive) statistics: using the data sample to draw conclusions about the population (conclusion still entail uncertainty = need measures of significance) • Statistical Hypothesis testing is an inferential statistics approach but involves descriptive statistics as well! The Data Science and Decisions Lab, UCLA 4 Statistical Inference problems • Inferential Statistics problems: o Point estimation Infer population Population properties o Interval estimation o Classification and clustering Sample o Rejecting hypotheses o Selecting models The Data Science and Decisions Lab, UCLA 5 Frequentist vs. Bayesian Inference • Frequentist inference: • Key idea: Objective interpretation of probability - any given experiment can be considered as one of an infinite sequence of possible repetitions of the same experiment, each capable of producing statistically independent results. • Require that the correct conclusion should be drawn with a given (high) probability among this set of experiments. The Data Science and Decisions Lab, UCLA 6 Frequentist vs. Bayesian Inference • Frequentist inference: Population Data set 1 Data set 2 Data set N .
    [Show full text]
  • Statistical Inference: Paradigms and Controversies in Historic Perspective
    Jostein Lillestøl, NHH 2014 Statistical inference: Paradigms and controversies in historic perspective 1. Five paradigms We will cover the following five lines of thought: 1. Early Bayesian inference and its revival Inverse probability – Non-informative priors – “Objective” Bayes (1763), Laplace (1774), Jeffreys (1931), Bernardo (1975) 2. Fisherian inference Evidence oriented – Likelihood – Fisher information - Necessity Fisher (1921 and later) 3. Neyman- Pearson inference Action oriented – Frequentist/Sample space – Objective Neyman (1933, 1937), Pearson (1933), Wald (1939), Lehmann (1950 and later) 4. Neo - Bayesian inference Coherent decisions - Subjective/personal De Finetti (1937), Savage (1951), Lindley (1953) 5. Likelihood inference Evidence based – likelihood profiles – likelihood ratios Barnard (1949), Birnbaum (1962), Edwards (1972) Classical inference as it has been practiced since the 1950’s is really none of these in its pure form. It is more like a pragmatic mix of 2 and 3, in particular with respect to testing of significance, pretending to be both action and evidence oriented, which is hard to fulfill in a consistent manner. To keep our minds on track we do not single out this as a separate paradigm, but will discuss this at the end. A main concern through the history of statistical inference has been to establish a sound scientific framework for the analysis of sampled data. Concepts were initially often vague and disputed, but even after their clarification, various schools of thought have at times been in strong opposition to each other. When we try to describe the approaches here, we will use the notions of today. All five paradigms of statistical inference are based on modeling the observed data x given some parameter or “state of the world” , which essentially corresponds to stating the conditional distribution f(x|(or making some assumptions about it).
    [Show full text]
  • Frequentist Interpretation of Probability
    Frequentist Inference basis the data can then be used to select d so as to Symposium of Mathematics, Statistics, and Probability. Uni­ obtain a desirable balance between these two aspects. versity of California Press, Vol. I, pp. 187-95 Criteria for selecting d (and making analogous choices Tukey J W 1960 Conclusions versus decisions. Technology 2: within a family of proposed models in other problems) 423- 33 von Mises R 1928 Wahrscheinlichkeit, Statistik and Wahrheit. have been developed by Akaike, Mallows, Schwarz, Springer, Wien, Austria and others (for more details, see, for example, Linhart WaldA 1950 Statistical Decision Functions. Wiley, New York and Zucchini 1986): Wilson E B 1952 An Introduction to Scientific Research. McGraw Bayesian solutions to this problem differ in a Hill, New York number of ways, reflecting whether we assume that the true model belongs to the hypothesized class of models P. J. Bickel and E. L. Lehmann (e.g., is really a polynomial) or can merely be approxi­ mated arbitrarily closely by such models. For more on this topic see Shao (1997). See also: Estimation: Point and Interval; Robustness in Statistics. Frequentist Interpretation of Probability If the outcome of an event is observed in a large number of independent repetitions of the event under Bibliography roughly the same conditions, then it is a fact of the real Bickel P, Doksum K 2000 Mathematical Statistics, 2nd edn. world that the frequency of the outcome stabilizes as Prentice Hall, New York, Vol. I the number of repetitions increases. If another long Blackwell D, Dubins L 1962 Merging of opinions with increasing sequence of such events is observed, the frequency of information.
    [Show full text]
  • The Role of Randomization in Bayesian and Frequentist Design of Clinical Trial Berchialla Paola1, Gregori Dario2, Baldi Ileana2
    The Role of Randomization in Bayesian and Frequentist Design of Clinical Trial Berchialla Paola1, Gregori Dario2, Baldi Ileana2 1Department of Clinical and Biological Sciences, University of Torino, Via Santena 5bis, 10126, Torino, Italy 2Unit of Biostatistics, Epidemiology and Public Health, Department of Cardiac, Thoracic and Vascular Sciences, University of Padova, Via Loredan 18, 35131, Padova, Italy Aknowledgements This research was supported by the University of Torino, Grant no. BERP_RILO_17_01. Corresponding author Paola Berchialla, PhD Dept. of Clinical and Biological Sciences University of Torino Via Santena 5 bis 10126 Torino e-mail: [email protected] 1 Abstract A key role in inference is played by randomization, which has been extensively used in clinical trials designs. Randomization is primarily intended to prevent the source of bias in treatment allocation by producing comparable groups. In the frequentist framework of inference, randomization allows also for the use of probability theory to express the likelihood of chance as a source for the difference of end outcome. In the Bayesian framework, its role is more nuanced. The Bayesian analysis of clinical trials can afford a valid rationale for selective controls, pointing out a more limited role for randomization than it is generally accorded. This paper is aimed to offer a view of randomization from the perspective of both frequentist and Bayesian statistics and discussing the role of randomization also in theoretical decision models. Keywords. Clinical Trials; Bayesian Inference; Frequentist Inference; Randomization Introduction 2 The origin of randomization in experimental design can be dated back to its application in psychophysics research in the late nineteenth century.
    [Show full text]
  • PHIL 8670 (Fall, 2015): Philosophy of Statistics
    PHIL 8670 (Fall, 2015): Philosophy of Statistics Samuel C. Fletcher September 12, 2015 Primary Email: scfl[email protected] Office Phone: 612-625-4325 Office Hours: 754 Heller Hall, Tuesdays 3 pm { 4 pm, and by appointment Class Period: 731 Heller Hall, Tuesdays 4 pm { 6 pm Course Description Statistical methods form a significant part of the methodology of every scientific discipline. This seminar will survey a number of central philosophical and conceptual issues in statistics. After a brief review of elementary probability theory and its interpretation, we will consider at length both the classical and Bayesian schools of statistics: their formulations, and arguments pro and con. We will then consider a third school, likelihoodism, that is sometimes considered a compromise between classical and Bayesian methodologies and has gained favor among some philosophers of science. This leads to the likelihood principle, a natural focal point in the debate between these schools on the nature of evidence. The remainder of the seminar will be split between topics on statistical modeling (model verification and model selection), connections between the philosophy of statistics and more traditional topics in the philosophy of science (simplicity, induction and confirmation) and the relation between statistics and causation (randomness, randomization and causal inference). Evaluation Each week, there will be a semi-formal presentation of that week's reading assignment aimed at stimulating a critical discussion. For the first meeting (or two) I will be responsible for presenting, and for each of the remaining meetings, one or two students will be responsible. Students not presenting during a given week are responsible instead for submitting a one- paragraph question about a philosophical/conceptual issue arising in that week's readings to those presenting (and me).
    [Show full text]
  • Fusion Confusion? Comments on Nancy Reid: “BFF Four–Are We Converging?”
    Fusion Confusion? Comments on Nancy Reid: “BFF Four–Are we Converging?” Deborah G. Mayo The Fourth Bayesian, Fiducial and Frequentist Workshop (BFF4): Harvard University May 2, 2017 <1> I’m delighted to Be part of a workshop linking statistics and philosophy of statistics! I thank the organizers for inviting me. Nancy Reid’s “BFF Four–Are we Converging?” gives numerous avenues for discussion She zeroes in on obstacles to fusion: Confusion or disagreement on the nature of probability and its use in statistical inference <2> From Nancy Reid: Nature of probability probability to describe physical haphazard variability • probabilities represent features of the “real” world in idealized form • suBject to empirical test and improvement • conclusions of statistical analysis expressed in terms of interpretaBle parameters • enhanced understanding of the data generating process probability to describe the uncertainty of knowledge • measures rational, supposedly impersonal, degree of Belief given relevant information (Jeffreys) • measures a particular person’s degree of Belief, suBject typically to some constraints of self-consistency … • often linked with personal decision-making <3> • As is common, she laBels the second “epistemological” But a key question for me is: what’s relevant for a normative epistemology, for an account of what’s warranted/unwarranted to infer <4> Reid quite rightly asks: • in what sense are confidence distribution functions, significance functions, structural or fiducial probaBilities to be interpreted? • empirically? degree of Belief? • literature is not very clear <5> Reid: We may avoid the need for a different version of proBaBility by appeal to a notion of calibration (Cox 2006, Reid & Cox 2015) • This is my central focus I approach this indirectly, with analogy between philosophy of statistics and statistics <6> Carnap: Bayesians as Popper: Frequentists (N-P/Fisher) Can’t solve induction But can Build logics of induction or confirmation theories (e.g., Carnap 1962).
    [Show full text]
  • Principles of Statistical Inference
    Principles of Statistical Inference In this important book, D. R. Cox develops the key concepts of the theory of statistical inference, in particular describing and comparing the main ideas and controversies over foundational issues that have rumbled on for more than 200 years. Continuing a 60-year career of contribution to statistical thought, Professor Cox is ideally placed to give the comprehensive, balanced account of the field that is now needed. The careful comparison of frequentist and Bayesian approaches to inference allows readers to form their own opinion of the advantages and disadvantages. Two appendices give a brief historical overview and the author’s more personal assessment of the merits of different ideas. The content ranges from the traditional to the contemporary. While specific applications are not treated, the book is strongly motivated by applications across the sciences and associated technologies. The underlying mathematics is kept as elementary as feasible, though some previous knowledge of statistics is assumed. This book is for every serious user or student of statistics – in particular, for anyone wanting to understand the uncertainty inherent in conclusions from statistical analyses. Principles of Statistical Inference D.R. COX Nuffield College, Oxford CAMBRIDGE UNIVERSITY PRESS Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo Cambridge University Press The Edinburgh Building, Cambridge CB2 8RU, UK Published in the United States of America by Cambridge University Press, New York www.cambridge.org Information on this title: www.cambridge.org/9780521866736 © D. R. Cox 2006 This publication is in copyright. Subject to statutory exception and to the provision of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press.
    [Show full text]
  • A Tutorial on Conducting and Interpreting a Bayesian ANOVA in JASP
    A Tutorial on Conducting and Interpreting a Bayesian ANOVA in JASP Don van den Bergh∗1, Johnny van Doorn1, Maarten Marsman1, Tim Draws1, Erik-Jan van Kesteren4, Koen Derks1,2, Fabian Dablander1, Quentin F. Gronau1, Simonˇ Kucharsk´y1, Akash R. Komarlu Narendra Gupta1, Alexandra Sarafoglou1, Jan G. Voelkel5, Angelika Stefan1, Alexander Ly1,3, Max Hinne1, Dora Matzke1, and Eric-Jan Wagenmakers1 1University of Amsterdam 2Nyenrode Business University 3Centrum Wiskunde & Informatica 4Utrecht University 5Stanford University Abstract Analysis of variance (ANOVA) is the standard procedure for statistical inference in factorial designs. Typically, ANOVAs are executed using fre- quentist statistics, where p-values determine statistical significance in an all-or-none fashion. In recent years, the Bayesian approach to statistics is increasingly viewed as a legitimate alternative to the p-value. How- ever, the broad adoption of Bayesian statistics {and Bayesian ANOVA in particular{ is frustrated by the fact that Bayesian concepts are rarely taught in applied statistics courses. Consequently, practitioners may be unsure how to conduct a Bayesian ANOVA and interpret the results. Here we provide a guide for executing and interpreting a Bayesian ANOVA with JASP, an open-source statistical software program with a graphical user interface. We explain the key concepts of the Bayesian ANOVA using two empirical examples. ∗Correspondence concerning this article should be addressed to: Don van den Bergh University of Amsterdam, Department of Psychological Methods Postbus 15906, 1001 NK Amsterdam, The Netherlands E-Mail should be sent to: [email protected]. 1 Ubiquitous across the empirical sciences, analysis of variance (ANOVA) al- lows researchers to assess the effects of categorical predictors on a continuous outcome variable.
    [Show full text]