Fisher Was Right Ronald C
Total Page:16
File Type:pdf, Size:1020Kb
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Digital Commons@Wayne State University Journal of Modern Applied Statistical Methods Volume 9 | Issue 1 Article 2 5-1-2010 Fisher Was Right Ronald C. Serlin University of Wisconsin - Madison, [email protected] Follow this and additional works at: http://digitalcommons.wayne.edu/jmasm Part of the Applied Statistics Commons, Social and Behavioral Sciences Commons, and the Statistical Theory Commons Recommended Citation Serlin, Ronald C. (2010) "Fisher Was Right," Journal of Modern Applied Statistical Methods: Vol. 9: Iss. 1, Article 2. Available at: http://digitalcommons.wayne.edu/jmasm/vol9/iss1/2 This Invited Article is brought to you for free and open access by the Open Access Journals at DigitalCommons@WayneState. It has been accepted for inclusion in Journal of Modern Applied Statistical Methods by an authorized administrator of DigitalCommons@WayneState. Journal of Modern Applied Statistical Methods Copyright © 2010 JMASM, Inc. May 2010, Vol. 9, No. 1, 2-8 1538 – 9472/10/$95.00 INVITED ARTICLES Fisher Was Right Ronald C. Serlin University of Wisconsin-Madison Invited address presented to the Educational Statistician’s Special Interest Group at the annual meeting of the American Educational Research Association, Denver, May 1, 2010. Key words: Fisher. Introduction Early Years: Up to 1922 I would like once again to thank you for Fisher held two chairs in genetics, at awarding me this honor last year. Given the University College in London and then at scholars between whom I am sandwiched, the Cambridge but, surprisingly, was never a first honoree, Ingram Olkin, and next year’s, professor of statistics. Regarding Fisher’s Joel Levin, I must try very hard to act as though accomplishments in statistics, Savage (1976) the committee did not make a serious mistake commented that it would be easier to list the few with my nomination. Tonight, I’d like to focus topics in which he was not interested. “In the art on some of the work of R. A. Fisher, who would of calculating explicit sampling distributions, have been 120 years old now, to make a couple Fisher led statistics out of its infancy, and he of points of my own. I hope that some of what I may never have been excelled in this skill” (p. say will give you the same feeling of fun in the 449). discovery of something neat and surprising as I There is much, of course, about which experienced. Fisher was right. Despite his shunning the concept of Type II errors, Fisher (1928) was the first to provide formulas for the noncentral Chi- square, t, and F distributions. (The symbol F was introduced by Snedecor in honor of Fisher, “for Ronald Serlin is Professor Emeritus in the which officiousness,” according to Savage, Department of Educational Psychology at the “Fisher seems never to have forgiven him” (p. School of Education and is in the Department of 449)). There once existed a fair amount of Biostatistics and Medical Informatics in the disagreement regarding how to count degrees of Medical School. Email: [email protected]. freedom in a contingency table, with Karl 2 SERLIN Pearson (among others) claiming rc − 1 and series of examples. She stated that compared to Fisher (1922) correcting to (r − 1)(c − 1). Fisher, the use of the minimum Chi-square method of of course, was right here. Fisher was a pioneer in fit, other approaches were arbitrary, including nonparametric statistics, having suggested the what she termed “the Gaussian ‘best’ value,” (p. use of the sign test in place of the t-test in certain 262) the maximum likelihood approach from designs, and having introduced what he called error theory that Fisher had supported in a paper exact tests to avoid the assumption of normality he wrote as an undergraduate student in 1912. in many circumstances. According to Stigler (2005), in response to a According to Stigler (2005, p. 33), of letter and manuscript that Fisher submitted to Fisher’s 97 publications from 1912 to 1920, 91 Biometrika, Karl Pearson as editor told Fisher were in the Eugenics Review, two were on that he had to demonstrate the logic of genetics related to eugenics, two were papers maximum likelihood, to justify it being better published in The Messenger of Mathematics, and than Smith’s approach. For a while Fisher could the other two (in 1915 and 1920) were on not respond. mathematical statistics. I’ll focus briefly on the The basis for Fisher’s reply came, 1915 and 1920 papers, as described by Stigler possibly by accident (Stigler, 2005), in the late (2005, 2006). spring of 1919. Fisher was considering the Mathematically, the 1915 derivation of relative merits of two alternative estimates of the the distribution of the sample correlation standard deviation of a normal distribution: one coefficient was the kind of work to which we all was based on the mean absolute deviation, the strive. Fisher found the distribution, expressions other the maximum likelihood solution. He had for moments, transformations (r-to-z) and considered combining the two estimates in some distributional relationships (including his earlier way but instead discovered that the whole of the work on the Student’s t-distribution), information regarding σ, which a sample expressions for the bias of r, and the maximum provides, is summed up in the value of the likelihood estimator of ρ. maximum likelihood estimator. Not only did it have a smaller standard deviation, it was, in a Right Nice Stuff word, sufficient. This type of work led Neyman (1951), On November 17, 1921, Fisher read a in his review of Fisher’s Contributions to paper to the Royal Society of London entitled Mathematical Statistics (1950), to describe On the Mathematical Foundations of Fisher as “a very able ‘manipulative’ Theoretical Statistics. The paper opened with a mathematician” (p. 406). The Contributions set of definitions that were, in 1921, entirely contain prefatory comments by Fisher on the new to statistical theory, but which are now various papers. For the 1915 paper, Fisher wrote familiar; they include consistency, efficiency, “Here the method of defining a sample by the estimation, likelihood, optimum, and coordinates of a point in Euclidean hyperspace sufficiency. Stigler (2005) pointed out that not in was introduced...” (p. 87). Unfortunately, the list is “…another, even more basic statistical according to Neyman (1951), representing the concept: It is in this paper of Fisher’s that the sample by a point in space was used for a similar word ‘parameter’ is first used in the modern purpose by Karl Pearson in 1900 and - Neyman statistical sense” (p. 32). Stigler notes that the suspected - had probably been used even before word parameter appears 57 times. that; thus, Fisher was wrong in this regard. According to Fisher, a consistent During the year following the estimate is called efficient if it is asymptotically publication of Fisher’s article on the correlation normal and if it has the minimum asymptotic coefficient, Kirstine Smith (1916), working at variance (Neyman, 1951). In his 1908 paper, Karl Pearson’s laboratory, published an article however, Edgeworth expressed the idea that suggesting that when fitting a frequency curve maximum likelihood estimates are always with grouped data, the constants should be efficient and made several attempts to prove his estimated using a minimum Chi-square criterion. conjecture. The proofs, however, “…of the She illustrated the use of this criterion through a efficiency of maximum likelihood estimates 3 FISHER WAS RIGHT offered both by Edgeworth and by Fisher are from the null hypothesis being true” (p. 17). inaccurate, and the assertion, taken in its full Thus, Fisher felt that one could not commit a generality, is false” (Neyman, 1951, p. 407). So Type II error, because one never drew a Fisher was wrong in the assertion, the proof, and conclusion on the basis of a non-rejection of the in not giving Edgeworth some credit for priority. null hypothesis. As he wrote (Fisher, 1973), “To Summarizing Fisher’s work, Neyman a practical man, also, who rejects a hypothesis, it (1951) wrote, “…three major concepts were is, of course, a matter of indifference with what introduced by Fisher and consistently probability he might be led to accept the propagandized by him in a number of hypothesis falsely, for in his case he is not publications. These are mathematical likelihood accepting it” (pp. 41-42). Some rightness to this as a measure of the confidence in a hypothesis, is evident. sufficient statistics, and fiducial probability,” (p. Fisher always desired to establish a 407) all employed by Fisher in the service of correct theory of statistical inference. According scientific induction. to Kempthorne (1976) “Fisher really did think that one could develop by logical reasoning a Inference probability distribution for one’s knowledge of a Fisher (1947) felt that “the null physical constant” (p. 496). Fisher, as Neyman hypothesis is never proved or established, but is (1951) pointed out, seemed proud to have possibly disproved, in the course of formulated a measure of rational belief. Thus, experimentation. Every experiment may be said Fisher (1973) wrote that the level of significance to exist only in order to give the facts a chance “in such cases fulfils the conditions of a measure of disproving the null hypothesis” (p. 16). of the rational grounds for the disbelief it Regarding the rate of error to assign to an engenders” (p. 43). Similarly, Fisher (1925a) incorrect rejection of the null hypothesis, Fisher had observed that “if the value of P so calculated wrote (1926) that “it is convenient to draw the turned out to be a small quantity such as 0.01, line at about the level at which we can say: we should conclude with some confidence that ‘Either there is something in the treatment, or a the hypothesis was not in fact true of the coincidence has occurred such as does not occur population actually sampled” (p.