Bayesian Inference for Heterogeneous Event Counts
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
F:\RSS\Me\Society's Mathemarica
School of Social Sciences Economics Division University of Southampton Southampton SO17 1BJ, UK Discussion Papers in Economics and Econometrics Mathematics in the Statistical Society 1883-1933 John Aldrich No. 0919 This paper is available on our website http://www.southampton.ac.uk/socsci/economics/research/papers ISSN 0966-4246 Mathematics in the Statistical Society 1883-1933* John Aldrich Economics Division School of Social Sciences University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract This paper considers the place of mathematical methods based on probability in the work of the London (later Royal) Statistical Society in the half-century 1883-1933. The end-points are chosen because mathematical work started to appear regularly in 1883 and 1933 saw the formation of the Industrial and Agricultural Research Section– to promote these particular applications was to encourage mathematical methods. In the period three movements are distinguished, associated with major figures in the history of mathematical statistics–F. Y. Edgeworth, Karl Pearson and R. A. Fisher. The first two movements were based on the conviction that the use of mathematical methods could transform the way the Society did its traditional work in economic/social statistics while the third movement was associated with an enlargement in the scope of statistics. The study tries to synthesise research based on the Society’s archives with research on the wider history of statistics. Key names : Arthur Bowley, F. Y. Edgeworth, R. A. Fisher, Egon Pearson, Karl Pearson, Ernest Snow, John Wishart, G. Udny Yule. Keywords : History of Statistics, Royal Statistical Society, mathematical methods. -
School of Social Sciences Economics Division University of Southampton Southampton SO17 1BJ, UK
School of Social Sciences Economics Division University of Southampton Southampton SO17 1BJ, UK Discussion Papers in Economics and Econometrics Professor A L Bowley’s Theory of the Representative Method John Aldrich No. 0801 This paper is available on our website http://www.socsci.soton.ac.uk/economics/Research/Discussion_Papers ISSN 0966-4246 Key names: Arthur L. Bowley, F. Y. Edgeworth, , R. A. Fisher, Adolph Jensen, J. M. Keynes, Jerzy Neyman, Karl Pearson, G. U. Yule. Keywords: History of Statistics, Sampling theory, Bayesian inference. Professor A. L. Bowley’s Theory of the Representative Method * John Aldrich Economics Division School of Social Sciences University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract Arthur. L. Bowley (1869-1957) first advocated the use of surveys–the “representative method”–in 1906 and started to conduct surveys of economic and social conditions in 1912. Bowley’s 1926 memorandum for the International Statistical Institute on the “Measurement of the precision attained in sampling” was the first large-scale theoretical treatment of sample surveys as he conducted them. This paper examines Bowley’s arguments in the context of the statistical inference theory of the time. The great influence on Bowley’s conception of statistical inference was F. Y. Edgeworth but by 1926 R. A. Fisher was on the scene and was attacking Bayesian methods and promoting a replacement of his own. Bowley defended his Bayesian method against Fisher and against Jerzy Neyman when the latter put forward his concept of a confidence interval and applied it to the representative method. * Based on a talk given at the Sample Surveys and Bayesian Statistics Conference, Southampton, August 2008. -
The Contrasting Strategies of Almroth Wright and Bradford Hill to Capture the Nomenclature of Controlled Trials
6 Whose Words are they Anyway? The Contrasting Strategies of Almroth Wright and Bradford Hill to Capture the Nomenclature of Controlled Trials When Almroth Wright (1861–1947) began to endure the experience common to many new fathers, of sleepless nights caused by his crying infant, he characteristically applied his physiological knowledge to the problem in a deductive fashion. Reasoning that the child’s distress was the result of milk clotting in its stomach, he added citric acid to the baby’s bottle to prevent the formation of clots. The intervention appeared to succeed; Wright published his conclusions,1 and citrated milk was eventually advocated by many paediatricians in the management of crying babies.2 By contrast, Austin Bradford Hill (1897–1991) approached his wife’s insomnia in an empirical, inductive manner. He proposed persuading his wife to take hot milk before retiring, on randomly determined nights, and recording her subsequent sleeping patterns. Fortunately for domestic harmony, he described this experiment merely as an example – actually to perform it would, he considered, be ‘exceedingly rash’.3 These anecdotes serve to illustrate the very different approaches to therapeutic experiment maintained by the two men. Hill has widely been credited as the main progenitor of the modern randomised controlled trial (RCT).4 He was principally responsible for the design of the first two published British RCTs, to assess the efficacy of streptomycin in tuberculosis5 and the effectiveness of whooping cough vaccine6 and, in particular, was responsible for the introduction of randomisation into their design.7 Almroth Wright remained, throughout his life, an implacable opponent of this ‘statistical method’, preferring what he perceived as the greater certainty of a ‘crucial experiment’ performed in the laboratory. -
“It Took a Global Conflict”— the Second World War and Probability in British
Keynames: M. S. Bartlett, D.G. Kendall, stochastic processes, World War II Wordcount: 17,843 words “It took a global conflict”— the Second World War and Probability in British Mathematics John Aldrich Economics Department University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract In the twentieth century probability became a “respectable” branch of mathematics. This paper describes how in Britain the transformation came after the Second World War and was due largely to David Kendall and Maurice Bartlett who met and worked together in the war and afterwards worked on stochastic processes. Their interests later diverged and, while Bartlett stayed in applied probability, Kendall took an increasingly pure line. March 2020 Probability played no part in a respectable mathematics course, and it took a global conflict to change both British mathematics and D. G. Kendall. Kingman “Obituary: David George Kendall” Introduction In the twentieth century probability is said to have become a “respectable” or “bona fide” branch of mathematics, the transformation occurring at different times in different countries.1 In Britain it came after the Second World War with research on stochastic processes by Maurice Stevenson Bartlett (1910-2002; FRS 1961) and David George Kendall (1918-2007; FRS 1964).2 They also contributed as teachers, especially Kendall who was the “effective beginning of the probability tradition in this country”—his pupils and his pupils’ pupils are “everywhere” reported Bingham (1996: 185). Bartlett and Kendall had full careers—extending beyond retirement in 1975 and ‘85— but I concentrate on the years of setting-up, 1940-55. -
“I Didn't Want to Be a Statistician”
“I didn’t want to be a statistician” Making mathematical statisticians in the Second World War John Aldrich University of Southampton Seminar Durham January 2018 1 The individual before the event “I was interested in mathematics. I wanted to be either an analyst or possibly a mathematical physicist—I didn't want to be a statistician.” David Cox Interview 1994 A generation after the event “There was a large increase in the number of people who knew that statistics was an interesting subject. They had been given an excellent training free of charge.” George Barnard & Robin Plackett (1985) Statistics in the United Kingdom,1939-45 Cox, Barnard and Plackett were among the people who became mathematical statisticians 2 The people, born around 1920 and with a ‘name’ by the 60s : the 20/60s Robin Plackett was typical Born in 1920 Cambridge mathematics undergraduate 1940 Off the conveyor belt from Cambridge mathematics to statistics war-work at SR17 1942 Lecturer in Statistics at Liverpool in 1946 Professor of Statistics King’s College, Durham 1962 3 Some 20/60s (in 1968) 4 “It is interesting to note that a number of these men now hold statistical chairs in this country”* Egon Pearson on SR17 in 1973 In 1939 he was the UK’s only professor of statistics * Including Dennis Lindley Aberystwyth 1960 Peter Armitage School of Hygiene 1961 Robin Plackett Durham/Newcastle 1962 H. J. Godwin Royal Holloway 1968 Maurice Walker Sheffield 1972 5 SR 17 women in statistical chairs? None Few women in SR17: small skills pool—in 30s Cambridge graduated 5 times more men than women Post-war careers—not in statistics or universities Christine Stockman (1923-2015) Maths at Cambridge. -
Alan Agresti
Historical Highlights in the Development of Categorical Data Analysis Alan Agresti Department of Statistics, University of Florida UF Winter Workshop 2010 CDA History – p. 1/39 Karl Pearson (1857-1936) CDA History – p. 2/39 Karl Pearson (1900) Philos. Mag. Introduces chi-squared statistic (observed − expected)2 X2 = X expected df = no. categories − 1 • testing values for multinomial probabilities (Monte Carlo roulette runs) • testing fit of Pearson curves • testing statistical independence in r × c contingency table (df = rc − 1) CDA History – p. 3/39 Karl Pearson (1904) Advocates measuring association in contingency tables by approximating the correlation for an assumed underlying continuous distribution • tetrachoric correlation (2 × 2, assuming bivariate normality) X2 2 • contingency coefficient 2 based on for testing q X +n X independence in r × c contingency table • introduces term “contingency” as a “measure of the total deviation of the classification from independent probability.” CDA History – p. 4/39 George Udny Yule (1871-1951) (1900) Philos. Trans. Royal Soc. London (1912) JRSS Advocates measuring association using odds ratio n11 n12 n21 n22 n n n n − n n θˆ = 11 22 Q = 11 22 12 21 = (θˆ − 1)/(θˆ + 1) n12n21 n11n22 + n12n21 “At best the normal coefficient can only be said to give us. a hypothetical correlation between supposititious variables. The introduction of needless and unverifiable hypotheses does not appear to me a desirable proceeding in scientific work.” (1911) An Introduction to the Theory of Statistics (14 editions) CDA History – p. 5/39 K. Pearson, with D. Heron (1913) Biometrika “Unthinking praise has been bestowed on a textbook which can only lead statistical students hopelessly astray.” . -
The Development of the MRC Statistical Unit, 1911-1948
Medical History, 2000, 44: 323-340 Medical Statistics, Patronage and the State: The Development of the MRC Statistical Unit, 1911-1948 EDWARD HIGGS* The development of medical statistics based on the concepts of probability and error-theory developed by Francis Galton, Karl Pearson, George Udny Yule, R A Fisher, and their colleagues, has had a profound impact on medical science in the present century. This has been associated especially with the activities of the Medical Research Council (MRC) Statistical Unit at the London School of Hygiene and Tropical Medicine (LSHTM). It was here in 1946 that the world's first statistically rigorous clinical trial was undertaken, and where in the 1950s Austin Bradford Hill and Richard Doll revealed a statistical relationship between smoking and lung- cancer. But how and why was this key institution founded, and why were its methods probabilistic?' One might assume that the sheer brilliance of the work produced by Pearson and his statistical followers was sufficient to win over the medical profession, and to attract funding for the MRC Unit. However, one might question the extent to which an association with Pearson, and with the mathematical technicalities of his statistics, immediately endeared the fledgling discipline of biostatistics to the medical com- munity. As J Rosser Matthews has shown recently, Pearsonian statistics were only haltingly accepted by medical scientists in Britain in the 1920s and 1930s, and were little understood by medical practitioners.2 How then was such a university-based infrastructure for medical statistics established? Ultimately, of course, the Statistical * Edward Higgs, DPhil, Department of History, the statistical work on smoking, see V Berridge, University of Essex, Wivenhoe Park, Colchester, 'Science and policy: the case of postwar British C04 3SQ. -
NATURE September 29, 1951 Vol. 16E
542 NATURE September 29, 1951 voL. 16e actually completed this year four months ahead of Fawley now assumes priority as the largest oil schedule. The scheme involved clearing and levelling refinery in Europe. It cost £37,500,000 to build. 450 acres of wooded gravel-land; construction of When in full operation it will produce 6,500,000 tons roads ; laying down a railway system to connect up of petroleum products a year ; more than a million with permanent way to Southampton ; erecting a gallons of motor spirit a day ; and nearly 30 per huge building, 800 ft. long, 180 ft. wide and occupying cent of present total demand for petroleum and more than 3 acres, first to house constructional products in the British Isles. Not least in these materials, ultimately to serve as maintenance and financially stringent times, this brilliantly planned machine shop; lay-out of a construction camp on and executed work will. effect a saving in foreign site for 750 workmen; erection of a huge concrete exchange of more than 2 million dollars a week. plant with output of 140 cubic yards per hour, one Someone concerned with the lavish series · of of the largest in the world ; installation of the main publications which a great occasion like this official .refinery units, in itself a colossal task of erecting and opening obviously presented had the foresight and linking-up the bewildering maze of pipes, tanks, sensitive imagination to commission H. E. Bates to stills, furnaces and huge towers that go to make up write a commemoration "Fawley Achievement", "as a modern oil refinery. -
Professor A. L. Bowley's Theory of the Representative Method *
Key names: Arthur L. Bowley, F. Y. Edgeworth, , R. A. Fisher, Adolph Jensen, J. M. Keynes, Jerzy Neyman, Karl Pearson, G. U. Yule. Keywords: History of Statistics, Sampling theory, Bayesian inference. Professor A. L. Bowley’s Theory of the Representative Method * John Aldrich Economics Division School of Social Sciences University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract Arthur. L. Bowley (1869-1957) first advocated the use of surveys–the “representative method”–in 1906 and started to conduct surveys of economic and social conditions in 1912. Bowley’s 1926 memorandum for the International Statistical Institute on the “Measurement of the precision attained in sampling” was the first large-scale theoretical treatment of sample surveys as he conducted them. This paper examines Bowley’s arguments in the context of the statistical inference theory of the time. The great influence on Bowley’s conception of statistical inference was F. Y. Edgeworth but by 1926 R. A. Fisher was on the scene and was attacking Bayesian methods and promoting a replacement of his own. Bowley defended his Bayesian method against Fisher and against Jerzy Neyman when the latter put forward his concept of a confidence interval and applied it to the representative method. * Based on a talk given at the Sample Surveys and Bayesian Statistics Conference, Southampton, August 2008. I am grateful to Andrew Dale for his comments on an earlier draft. December 2008 1 Introduction “I think that if practical statistics has acquired something valuable in the represen- tative method, this is primarily due to Professor A. -
On the Nineteenth-Century Origins of Significance Testing and P-Hacking
On the nineteenth-century origins of significance testing and p-hacking Glenn Shafer, Rutgers University The Game-Theoretic Probability and Finance Project Working Paper #55 First posted July 18, 2019. Last revised June 11, 2020. Project web site: http://www.probabilityandfinance.com Abstract Although the names significance test, p-value, and confidence interval came into use only in the 20th century, the methods they name were already used and abused in the 19th century. Knowledge of this earlier history can help us evaluate some of the ideas for improving statistical testing and estimation currently being discussed. This article recounts first the development of statistical testing and estima- tion after Laplace's discovery of the central limit theorem and then the sub- sequent transmission of these ideas into the English-language culture of math- ematical statistics in the early 20th century. I argue that the earlier history casts doubt on the efficacy of many of the competing proposals for improving on significance tests and p-values and for forestalling abuses. Rather than fur- ther complicate the way we now teach statistics, we should leave aside most of the 20th-century embellishments and emphasize exploratory data analysis and the idea of testing probabilities by betting against them. 1 Introduction 1 2 Laplace's theorem 3 2.1 Laplace's discovery . .3 2.2 Direct and inverse probability . .4 2.3 Laplacean and Gaussian least squares . .5 2.4 Seeing p-hacking in France . .7 2.5 The disappearance of Laplace's theorem in France . .8 3 Practical certainty 8 3.1 La limite de l'´ecart ..........................9 3.2 Tables of the normal distribution . -
INTRODUCTION I. Preamble: Teatime at Rothamsted the Snapshot Here Reproduced Was First Published in the Staff Journal of Rothams
INTRODUCTION I. Preamble: Teatime at Rothamsted Fig. i Teatime at Rothamsted Experimental Station (around 1931). [MODIFIED IMAGE] Credits: Records of the Rothamsted Staff Harpenden (1931), p. 45. Copyright Rothamsted Research Ltd. The snapshot here reproduced was first published in the staff journal of Rothamsted Experimental Station in 1931 to celebrate the cheerful atmosphere of afternoon tea in the agricultural institution. The scientific staff, men and women together, mingle on the lawn warmed by the sunshine, among the tables set out for tea. The picture, presumably taken by the official photographer of the institution, portrays them deep in conversation or pleasantly smiling, smoking pipes and drinking tea. The original caption reminded the reader that the image depicted “the staff at 4:15pm any summer afternoon”.1 Rothamsted Experimental Station, now Rothamsted Research, has been a key institution of British agricultural science throughout its history. Its beginnings date back to the mid-nineteenth century, when John Bennet Lawes, a businessman engaged in the fertilizer industry, sponsored a series of long- term experiments on crops and fertilizers in the fields of his private estate, Rothamsted, located in the village of Harpenden in Hertfordshire.2 1 RES (1931c), p. 45. 2 On the history of Rothamsted Experimental Station and its role in British agricultural science see E. J. Russell (1966). Afternoon tea was a twentieth century addition to the routine of the station. It was instituted in 1906 as a courtesy to the botanist Winifred Brenchley, the first female member that entered in the Rothamsted scientific staff, and rapidly established itself as a daily social event for all the station research workers.3 During one such teatime gathering in the 1920s the statistician and geneticist Ronald Aylmer Fisher, a founding father of modern statistics and a Rothamsted staff member since 1919, took inspiration for the statistical planning of experiments from the case of a lady tasting tea. -
Rothamsted in the Making of Sir Ronald Fisher Sc.D., F.R.S
Rothamsted in the Making of Sir Ronald Fisher Sc.D., F.R.S. John Aldrich Economics Department University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract In 1919 the agricultural station at Rothamsted hired Ronald Fisher (1890-1962) to analyse historic data on crop yields. For him it was the beginning of a spectacular career and for Rothamsted the beginning of a Statistics Department which became a force in world statistics. Fisher arrived with publications in mathematical statistics and genetics. These were established subjects but at Rothamsted between 1919 and -33 he created the new career of agricultural research statistician. The paper considers how Rothamsted, in the person of its Director Sir John Russell, nurtured this development while supporting Fisher’s continuing work in mathematical statistics and genetics. It considers too how people associated with Fisher at Rothamsted, including assistants Wishart, Irwin and Yates and voluntary workers Tippett, Hoblyn and Hotelling contributed to establishing Fisherian statistics. September 2019 1 Introduction In 1919 a 29 year-old Cambridge mathematics graduate with no established profession started at Rothamsted Experimental Station; he died in 1962 “the most famous statistician and mathematical biologist in the world”—thus Irwin et al. (1963: 159). Sir Ronald Fisher Sc.D., F.R.S. was Balfour Professor at Cambridge when he was knighted and had been Galton Professor at University College but the higher doctorate and Fellowship of the Royal Society came when he was at Rothamsted. Ronald Aylmer Fisher would be another Galton and Balfour Professor but at Rothamsted he was the first Chief Statistician or even statistician.