The Compass for Navigating a Data-Centric World

Total Page:16

File Type:pdf, Size:1020Kb

The Compass for Navigating a Data-Centric World Statistics: The Compass for Navigating a Data-Centric World Marie Davidian Department of Statistics North Carolina State University January 11, 2013 Available at http://statistics2013.org Statistics2013 Video Statistics2013 Video Available at http://statistics2013.org using . Statistics http://fivethirtyeight.blogs.nytimes.com/ Silver used a statistical model to combine the results of state-by-state polls, weighting them according their previous accuracy, and to simulate many elections and estimate probabilities of the outcome Triumph of the geeks Nate Silver predicted the outcome of the 2012 US presidential election in all 50 states http://fivethirtyeight.blogs.nytimes.com/ Silver used a statistical model to combine the results of state-by-state polls, weighting them according their previous accuracy, and to simulate many elections and estimate probabilities of the outcome Triumph of the geeks Nate Silver predicted the outcome of the 2012 US presidential election in all 50 states using . Statistics Triumph of the geeks Nate Silver predicted the outcome of the 2012 US presidential election in all 50 states using . Statistics http://fivethirtyeight.blogs.nytimes.com/ Silver used a statistical model to combine the results of state-by-state polls, weighting them according their previous accuracy, and to simulate many elections and estimate probabilities of the outcome Triumph of the geeks Others did, too. “Dynamic Bayesian forecasting of presidential elections in the states,” by Drew A. Linzer, Journal of the American Statistical Association, in press But the interest in statistics didn’t start with the US elections. Triumph of the geeks “Nate Silver-led statistics men crush pundits in election” – Bloomberg Businessweek “Nate Silver has made statistics sexy again” – Associated Press “Drew Linzer: The stats man who predicted Obama’s win” – BBC News Magazine “The allure of the statistics field grows” – Boston Globe Triumph of the geeks “Nate Silver-led statistics men crush pundits in election” – Bloomberg Businessweek “Nate Silver has made statistics sexy again” – Associated Press “Drew Linzer: The stats man who predicted Obama’s win” – BBC News Magazine “The allure of the statistics field grows” – Boston Globe But the interest in statistics didn’t start with the US elections. Statistics in the news New York Times, August 6, 2009 “I keep saying that the sexy job in the next 10 years will be statisticians” – Hal Varian, Chief Economist, Google Statistics in the news New York Times, January 26, 2012 “I went to parties and heard a little groan when people heard what I did. Now they’re all excited to meet me” – Rob Tibshirani, Department of Statistics, Stanford University Statistics in the news New York Times, February 11, 2012 “Statistics are interesting and fun. It’s cool now” – Andrew Gelman, Department of Statistics, Columbia University Statistics in the news The Wall Street Journal, December 28, 2012 Carl Bialik, The Numbers Guy Data • Administrative (e.g., tax records), government surveys • Genomic, meteorological, air quality, seismic, . • Electronic medical records, health care databases • Credit card transactions, point-of-sale, mobile phone • Online search, social networks • Polls, voter registration records A veritable tsunami/deluge/avalanche of data Data, data, and more data Why is there so much talk of statistics and statisticians? • Administrative (e.g., tax records), government surveys • Genomic, meteorological, air quality, seismic, . • Electronic medical records, health care databases • Credit card transactions, point-of-sale, mobile phone • Online search, social networks • Polls, voter registration records A veritable tsunami/deluge/avalanche of data Data, data, and more data Why is there so much talk of statistics and statisticians? Data Data, data, and more data Why is there so much talk of statistics and statisticians? Data • Administrative (e.g., tax records), government surveys • Genomic, meteorological, air quality, seismic, . • Electronic medical records, health care databases • Credit card transactions, point-of-sale, mobile phone • Online search, social networks • Polls, voter registration records A veritable tsunami/deluge/avalanche of data Demand 2011 McKinsey Global Institute report: Big data: The next frontier for innovation, competition, and productivity “A significant constraint. will be a shortage of . people with deep expertise in statistics and data mining. a talent gap of 140K - 190K positions in 2018 (in the US)” http://www.mckinsey.com/insights/mgi/research/technology and innovation/ big data the next frontier for innovation • However, Big Data does not automatically mean Big Information • Science, decision-making, and policy formulation require not only prediction and finding associations and patterns, but uncovering causal relationships • Which, as we’ll discuss later, is not so easy. Opportunities and challenges • Our ability to collect, store, access, and manipulate vast and complex data is ever-improving • The potential benefits to science and society of learning from these data are enormous Opportunities and challenges • Our ability to collect, store, access, and manipulate vast and complex data is ever-improving • The potential benefits to science and society of learning from these data are enormous • However, Big Data does not automatically mean Big Information • Science, decision-making, and policy formulation require not only prediction and finding associations and patterns, but uncovering causal relationships • Which, as we’ll discuss later, is not so easy. Big Data also supplies more raw material for statistical shenanigans and biased fact-finding excursions. It offers a high-tech twist on an old trick: I know the facts, now let’s find ’em. That is, says Rebecca Goldin, a mathematician at George Mason University, “one of the most pernicious uses of data.” Perils From “The Age of Big Data” With huge data sets and fine-grained measurement,. there is increased risk of “false discoveries.” The trouble with seeking a meaningful needle in massive haystacks of data, says Trevor Hastie, a statistics professor at Stanford, is that “many bits of straw look like needles.” Perils From “The Age of Big Data” With huge data sets and fine-grained measurement,. there is increased risk of “false discoveries.” The trouble with seeking a meaningful needle in massive haystacks of data, says Trevor Hastie, a statistics professor at Stanford, is that “many bits of straw look like needles.” Big Data also supplies more raw material for statistical shenanigans and biased fact-finding excursions. It offers a high-tech twist on an old trick: I know the facts, now let’s find ’em. That is, says Rebecca Goldin, a mathematician at George Mason University, “one of the most pernicious uses of data.” Statistics While Big Data have inspired considerable current interest in statistics, statistics has been fundamental in numerous areas of science, business, and government for decades Critical need Sound, objective methods for modeling, analysis, and interpretation While Big Data have inspired considerable current interest in statistics, statistics has been fundamental in numerous areas of science, business, and government for decades Critical need Sound, objective methods for modeling, analysis, and interpretation Statistics Critical need Sound, objective methods for modeling, analysis, and interpretation Statistics While Big Data have inspired considerable current interest in statistics, statistics has been fundamental in numerous areas of science, business, and government for decades • Statistical stories • Our data-rich future Roadmap • A brief history • Our data-rich future Roadmap • A brief history • Statistical stories Roadmap • A brief history • Statistical stories • Our data-rich future Statistics: The science of learning from data and of measuring, controlling, and communicating uncertainty The path to what is now the formal discipline of statistical science is long and winding. What is statistics? The path to what is now the formal discipline of statistical science is long and winding. What is statistics? Statistics: The science of learning from data and of measuring, controlling, and communicating uncertainty What is statistics? Statistics: The science of learning from data and of measuring, controlling, and communicating uncertainty The path to what is now the formal discipline of statistical science is long and winding. • But it was not until the the mid-1600s that the mathematical notions of probability began to be developed by (mainly) mathematicians and physicists (e.g., Blaise Pascal), often inspired by games of chance • The first formal attempt to summarize and learn from data was by John Graunt, who created a precursor to modern life tables used in demography • Christiaan Huygens was among the first to connect such data analysis to probability Origins – pre-1700 • Sporadic accounts of measurement and data collection and interpretation date back as early as 5 B.C. • The first formal attempt to summarize and learn from data was by John Graunt, who created a precursor to modern life tables used in demography • Christiaan Huygens was among the first to connect such data analysis to probability Origins – pre-1700 • Sporadic accounts of measurement and data collection and interpretation date back as early as 5 B.C. • But it was not until the the mid-1600s that the mathematical notions of probability began to be developed by (mainly) mathematicians and physicists (e.g., Blaise Pascal), often inspired by games of chance
Recommended publications
  • F:\RSS\Me\Society's Mathemarica
    School of Social Sciences Economics Division University of Southampton Southampton SO17 1BJ, UK Discussion Papers in Economics and Econometrics Mathematics in the Statistical Society 1883-1933 John Aldrich No. 0919 This paper is available on our website http://www.southampton.ac.uk/socsci/economics/research/papers ISSN 0966-4246 Mathematics in the Statistical Society 1883-1933* John Aldrich Economics Division School of Social Sciences University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract This paper considers the place of mathematical methods based on probability in the work of the London (later Royal) Statistical Society in the half-century 1883-1933. The end-points are chosen because mathematical work started to appear regularly in 1883 and 1933 saw the formation of the Industrial and Agricultural Research Section– to promote these particular applications was to encourage mathematical methods. In the period three movements are distinguished, associated with major figures in the history of mathematical statistics–F. Y. Edgeworth, Karl Pearson and R. A. Fisher. The first two movements were based on the conviction that the use of mathematical methods could transform the way the Society did its traditional work in economic/social statistics while the third movement was associated with an enlargement in the scope of statistics. The study tries to synthesise research based on the Society’s archives with research on the wider history of statistics. Key names : Arthur Bowley, F. Y. Edgeworth, R. A. Fisher, Egon Pearson, Karl Pearson, Ernest Snow, John Wishart, G. Udny Yule. Keywords : History of Statistics, Royal Statistical Society, mathematical methods.
    [Show full text]
  • Two Principles of Evidence and Their Implications for the Philosophy of Scientific Method
    TWO PRINCIPLES OF EVIDENCE AND THEIR IMPLICATIONS FOR THE PHILOSOPHY OF SCIENTIFIC METHOD by Gregory Stephen Gandenberger BA, Philosophy, Washington University in St. Louis, 2009 MA, Statistics, University of Pittsburgh, 2014 Submitted to the Graduate Faculty of the Kenneth P. Dietrich School of Arts and Sciences in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Pittsburgh 2015 UNIVERSITY OF PITTSBURGH KENNETH P. DIETRICH SCHOOL OF ARTS AND SCIENCES This dissertation was presented by Gregory Stephen Gandenberger It was defended on April 14, 2015 and approved by Edouard Machery, Pittsburgh, Dietrich School of Arts and Sciences Satish Iyengar, Pittsburgh, Dietrich School of Arts and Sciences John Norton, Pittsburgh, Dietrich School of Arts and Sciences Teddy Seidenfeld, Carnegie Mellon University, Dietrich College of Humanities & Social Sciences James Woodward, Pittsburgh, Dietrich School of Arts and Sciences Dissertation Director: Edouard Machery, Pittsburgh, Dietrich School of Arts and Sciences ii Copyright © by Gregory Stephen Gandenberger 2015 iii TWO PRINCIPLES OF EVIDENCE AND THEIR IMPLICATIONS FOR THE PHILOSOPHY OF SCIENTIFIC METHOD Gregory Stephen Gandenberger, PhD University of Pittsburgh, 2015 The notion of evidence is of great importance, but there are substantial disagreements about how it should be understood. One major locus of disagreement is the Likelihood Principle, which says roughly that an observation supports a hypothesis to the extent that the hy- pothesis predicts it. The Likelihood Principle is supported by axiomatic arguments, but the frequentist methods that are most commonly used in science violate it. This dissertation advances debates about the Likelihood Principle, its near-corollary the Law of Likelihood, and related questions about statistical practice.
    [Show full text]
  • The Likelihood Principle
    1 01/28/99 ãMarc Nerlove 1999 Chapter 1: The Likelihood Principle "What has now appeared is that the mathematical concept of probability is ... inadequate to express our mental confidence or diffidence in making ... inferences, and that the mathematical quantity which usually appears to be appropriate for measuring our order of preference among different possible populations does not in fact obey the laws of probability. To distinguish it from probability, I have used the term 'Likelihood' to designate this quantity; since both the words 'likelihood' and 'probability' are loosely used in common speech to cover both kinds of relationship." R. A. Fisher, Statistical Methods for Research Workers, 1925. "What we can find from a sample is the likelihood of any particular value of r [a parameter], if we define the likelihood as a quantity proportional to the probability that, from a particular population having that particular value of r, a sample having the observed value r [a statistic] should be obtained. So defined, probability and likelihood are quantities of an entirely different nature." R. A. Fisher, "On the 'Probable Error' of a Coefficient of Correlation Deduced from a Small Sample," Metron, 1:3-32, 1921. Introduction The likelihood principle as stated by Edwards (1972, p. 30) is that Within the framework of a statistical model, all the information which the data provide concerning the relative merits of two hypotheses is contained in the likelihood ratio of those hypotheses on the data. ...For a continuum of hypotheses, this principle
    [Show full text]
  • Statistics Making an Impact
    John Pullinger J. R. Statist. Soc. A (2013) 176, Part 4, pp. 819–839 Statistics making an impact John Pullinger House of Commons Library, London, UK [The address of the President, delivered to The Royal Statistical Society on Wednesday, June 26th, 2013] Summary. Statistics provides a special kind of understanding that enables well-informed deci- sions. As citizens and consumers we are faced with an array of choices. Statistics can help us to choose well. Our statistical brains need to be nurtured: we can all learn and practise some simple rules of statistical thinking. To understand how statistics can play a bigger part in our lives today we can draw inspiration from the founders of the Royal Statistical Society. Although in today’s world the information landscape is confused, there is an opportunity for statistics that is there to be seized.This calls for us to celebrate the discipline of statistics, to show confidence in our profession, to use statistics in the public interest and to champion statistical education. The Royal Statistical Society has a vital role to play. Keywords: Chartered Statistician; Citizenship; Economic growth; Evidence; ‘getstats’; Justice; Open data; Public good; The state; Wise choices 1. Introduction Dictionaries trace the source of the word statistics from the Latin ‘status’, the state, to the Italian ‘statista’, one skilled in statecraft, and on to the German ‘Statistik’, the science dealing with data about the condition of a state or community. The Oxford English Dictionary brings ‘statistics’ into English in 1787. Florence Nightingale held that ‘the thoughts and purpose of the Deity are only to be discovered by the statistical study of natural phenomena:::the application of the results of such study [is] the religious duty of man’ (Pearson, 1924).
    [Show full text]
  • School of Social Sciences Economics Division University of Southampton Southampton SO17 1BJ, UK
    School of Social Sciences Economics Division University of Southampton Southampton SO17 1BJ, UK Discussion Papers in Economics and Econometrics Professor A L Bowley’s Theory of the Representative Method John Aldrich No. 0801 This paper is available on our website http://www.socsci.soton.ac.uk/economics/Research/Discussion_Papers ISSN 0966-4246 Key names: Arthur L. Bowley, F. Y. Edgeworth, , R. A. Fisher, Adolph Jensen, J. M. Keynes, Jerzy Neyman, Karl Pearson, G. U. Yule. Keywords: History of Statistics, Sampling theory, Bayesian inference. Professor A. L. Bowley’s Theory of the Representative Method * John Aldrich Economics Division School of Social Sciences University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract Arthur. L. Bowley (1869-1957) first advocated the use of surveys–the “representative method”–in 1906 and started to conduct surveys of economic and social conditions in 1912. Bowley’s 1926 memorandum for the International Statistical Institute on the “Measurement of the precision attained in sampling” was the first large-scale theoretical treatment of sample surveys as he conducted them. This paper examines Bowley’s arguments in the context of the statistical inference theory of the time. The great influence on Bowley’s conception of statistical inference was F. Y. Edgeworth but by 1926 R. A. Fisher was on the scene and was attacking Bayesian methods and promoting a replacement of his own. Bowley defended his Bayesian method against Fisher and against Jerzy Neyman when the latter put forward his concept of a confidence interval and applied it to the representative method. * Based on a talk given at the Sample Surveys and Bayesian Statistics Conference, Southampton, August 2008.
    [Show full text]
  • The Contrasting Strategies of Almroth Wright and Bradford Hill to Capture the Nomenclature of Controlled Trials
    6 Whose Words are they Anyway? The Contrasting Strategies of Almroth Wright and Bradford Hill to Capture the Nomenclature of Controlled Trials When Almroth Wright (1861–1947) began to endure the experience common to many new fathers, of sleepless nights caused by his crying infant, he characteristically applied his physiological knowledge to the problem in a deductive fashion. Reasoning that the child’s distress was the result of milk clotting in its stomach, he added citric acid to the baby’s bottle to prevent the formation of clots. The intervention appeared to succeed; Wright published his conclusions,1 and citrated milk was eventually advocated by many paediatricians in the management of crying babies.2 By contrast, Austin Bradford Hill (1897–1991) approached his wife’s insomnia in an empirical, inductive manner. He proposed persuading his wife to take hot milk before retiring, on randomly determined nights, and recording her subsequent sleeping patterns. Fortunately for domestic harmony, he described this experiment merely as an example – actually to perform it would, he considered, be ‘exceedingly rash’.3 These anecdotes serve to illustrate the very different approaches to therapeutic experiment maintained by the two men. Hill has widely been credited as the main progenitor of the modern randomised controlled trial (RCT).4 He was principally responsible for the design of the first two published British RCTs, to assess the efficacy of streptomycin in tuberculosis5 and the effectiveness of whooping cough vaccine6 and, in particular, was responsible for the introduction of randomisation into their design.7 Almroth Wright remained, throughout his life, an implacable opponent of this ‘statistical method’, preferring what he perceived as the greater certainty of a ‘crucial experiment’ performed in the laboratory.
    [Show full text]
  • Frequentist and Bayesian Statistics
    Faculty of Life Sciences Frequentist and Bayesian statistics Claus Ekstrøm E-mail: [email protected] Outline 1 Frequentists and Bayesians • What is a probability? • Interpretation of results / inference 2 Comparisons 3 Markov chain Monte Carlo Slide 2— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics What is a probability? Two schools in statistics: frequentists and Bayesians. Slide 3— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics Frequentist school School of Jerzy Neyman, Egon Pearson and Ronald Fischer. Slide 4— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics Bayesian school “School” of Thomas Bayes P(D|H) · P(H) P(H|D)= P(D|H) · P(H)dH Slide 5— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics Frequentists Frequentists talk about probabilities in relation to experiments with a random component. Relative frequency of an event, A,isdefinedas number of outcomes consistent with A P(A)= number of experiments The probability of event A is the limiting relative frequency. Relative frequency 0.0 0.2 0.4 0.6 0.8 1.0 020406080100 n Slide 6— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics Frequentists — 2 The definition restricts the things we can add probabilities to: What is the probability of there being life on Mars 100 billion years ago? We assume that there is an unknown but fixed underlying parameter, θ, for a population (i.e., the mean height on Danish men). Random variation (environmental factors, measurement errors, ...) means that each observation does not result in the true value. Slide 7— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics The meta-experiment idea Frequentists think of meta-experiments and consider the current dataset as a single realization from all possible datasets.
    [Show full text]
  • Statistical Inference a Work in Progress
    Ronald Christensen Department of Mathematics and Statistics University of New Mexico Copyright c 2019 Statistical Inference A Work in Progress Springer v Seymour and Wes. Preface “But to us, probability is the very guide of life.” Joseph Butler (1736). The Analogy of Religion, Natural and Revealed, to the Constitution and Course of Nature, Introduction. https://www.loc.gov/resource/dcmsiabooks. analogyofreligio00butl_1/?sp=41 Normally, I wouldn’t put anything this incomplete on the internet but I wanted to make parts of it available to my Advanced Inference Class, and once it is up, you have lost control. Seymour Geisser was a mentor to Wes Johnson and me. He was Wes’s Ph.D. advisor. Near the end of his life, Seymour was finishing his 2005 book Modes of Parametric Statistical Inference and needed some help. Seymour asked Wes and Wes asked me. I had quite a few ideas for the book but then I discovered that Sey- mour hated anyone changing his prose. That was the end of my direct involvement. The first idea for this book was to revisit Seymour’s. (So far, that seems only to occur in Chapter 1.) Thinking about what Seymour was doing was the inspiration for me to think about what I had to say about statistical inference. And much of what I have to say is inspired by Seymour’s work as well as the work of my other professors at Min- nesota, notably Christopher Bingham, R. Dennis Cook, Somesh Das Gupta, Mor- ris L. Eaton, Stephen E. Fienberg, Narish Jain, F. Kinley Larntz, Frank B.
    [Show full text]
  • “It Took a Global Conflict”— the Second World War and Probability in British
    Keynames: M. S. Bartlett, D.G. Kendall, stochastic processes, World War II Wordcount: 17,843 words “It took a global conflict”— the Second World War and Probability in British Mathematics John Aldrich Economics Department University of Southampton Southampton SO17 1BJ UK e-mail: [email protected] Abstract In the twentieth century probability became a “respectable” branch of mathematics. This paper describes how in Britain the transformation came after the Second World War and was due largely to David Kendall and Maurice Bartlett who met and worked together in the war and afterwards worked on stochastic processes. Their interests later diverged and, while Bartlett stayed in applied probability, Kendall took an increasingly pure line. March 2020 Probability played no part in a respectable mathematics course, and it took a global conflict to change both British mathematics and D. G. Kendall. Kingman “Obituary: David George Kendall” Introduction In the twentieth century probability is said to have become a “respectable” or “bona fide” branch of mathematics, the transformation occurring at different times in different countries.1 In Britain it came after the Second World War with research on stochastic processes by Maurice Stevenson Bartlett (1910-2002; FRS 1961) and David George Kendall (1918-2007; FRS 1964).2 They also contributed as teachers, especially Kendall who was the “effective beginning of the probability tradition in this country”—his pupils and his pupils’ pupils are “everywhere” reported Bingham (1996: 185). Bartlett and Kendall had full careers—extending beyond retirement in 1975 and ‘85— but I concentrate on the years of setting-up, 1940-55.
    [Show full text]
  • Herman Otto Hartley, Distinguished Professor the Texas A&M University Department of Statistics Established the H. O. Hartley
    Edward George received a Ph.D. in Statistics from Stanford University in 1981; a Master’s in Applied Mathematics and Statistics from SUNY at Stony Brook in 1976; and an AB in Mathematics from Cornell University, in 1972. He was named full professor in the Department of Statistics at the University of Texas at Austin in 1992. From 1997-2001 he was the Ed and Molly Smith Chair in Business Administration. He then moved to the Wharton School of the University of Pennsylvania where he has been a Universal Furniture Professor since 2002. George was the Executive Editor Elect of Statistical Science in 2004 and Executive Editor from 2005-2007. He was guest co-editor of Research on Official Statistics in 2001 and guest co-editor of Applied Stochastic Models in Business and Industry in 2006. He is also serving on the Advisory Board for Quantitative Marketing and Economics. He is currently Associate Editor for Statistics Surveys, Bayesian Analysis, and Asia- Pacific Financial Markets and has organized or participated in the organization of 13 conferences. Herman Otto Hartley, Distinguished Professor The Texas A&M University Department of Statistics established the H. O. Hartley Memorial Lectures in 1988 to honor the memory of Prof. Herman Otto Hartley. Hartley accepted an appointment as Distinguished Professor at Texas A&M in 1963, founded the Texas A&M Institute of Statistics, and served as its director until his retirement in 1977. Hartley built his initial faculty of four into a group of 16, directed more than 30 doctoral students, and published over 75 papers during this period.
    [Show full text]
  • Major Greenwood and Clinical Trials
    From the James Lind Library Journal of the Royal Society of Medicine; 2017, Vol. 110(11) 452–457 DOI: 10.1177/0141076817736028 Major Greenwood and clinical trials Vern Farewell1 and Tony Johnson2 1MRC Biostatistics Unit, University of Cambridge, Cambridge CB2 0SR, UK 2MRC Clinical Trials Unit, Aviation House, London WC2B 6NH, UK Corresponding author: Vern Farewell. Email: [email protected] Introduction References to Greenwood’s work in clinical trials Major Greenwood (1880–1949) was the foremost medical statistician in the UK during the first Greenwood was a renowned epidemiologist who is not half of the 20th century.1 The son of a general usually associated with randomised clinical trials. At medical practitioner, he obtained a medical degree the time of his retirement, randomised clinical trials from the London Hospital in 1904 and then stu- were still under development and Peter Armitage has died statistics under Karl Pearson at University no recollection of Greenwood commenting on trials College London. Instead of general practice, he when Greenwood visited the London School of opted to pursue a career in medical research, first Hygiene and Tropical Medicine during his retirement under the physiologist Leonard Hill at the London (personal communication). Indeed, randomised clin- Hospital, where, in 1908, he set up the first depart- ical trials were a major component of the work of ment of medical statistics and gave the first lecture Austin Bradford Hill,5 who began his employment in course in the subject in 1909. He established the Greenwood’s department in 1923 and succeeded him second department of medical statistics in 1910 at as professor at the London School of Hygiene and the Lister Institute and established the second Tropical Medicine and director of the Medical course of lectures there.
    [Show full text]
  • “I Didn't Want to Be a Statistician”
    “I didn’t want to be a statistician” Making mathematical statisticians in the Second World War John Aldrich University of Southampton Seminar Durham January 2018 1 The individual before the event “I was interested in mathematics. I wanted to be either an analyst or possibly a mathematical physicist—I didn't want to be a statistician.” David Cox Interview 1994 A generation after the event “There was a large increase in the number of people who knew that statistics was an interesting subject. They had been given an excellent training free of charge.” George Barnard & Robin Plackett (1985) Statistics in the United Kingdom,1939-45 Cox, Barnard and Plackett were among the people who became mathematical statisticians 2 The people, born around 1920 and with a ‘name’ by the 60s : the 20/60s Robin Plackett was typical Born in 1920 Cambridge mathematics undergraduate 1940 Off the conveyor belt from Cambridge mathematics to statistics war-work at SR17 1942 Lecturer in Statistics at Liverpool in 1946 Professor of Statistics King’s College, Durham 1962 3 Some 20/60s (in 1968) 4 “It is interesting to note that a number of these men now hold statistical chairs in this country”* Egon Pearson on SR17 in 1973 In 1939 he was the UK’s only professor of statistics * Including Dennis Lindley Aberystwyth 1960 Peter Armitage School of Hygiene 1961 Robin Plackett Durham/Newcastle 1962 H. J. Godwin Royal Holloway 1968 Maurice Walker Sheffield 1972 5 SR 17 women in statistical chairs? None Few women in SR17: small skills pool—in 30s Cambridge graduated 5 times more men than women Post-war careers—not in statistics or universities Christine Stockman (1923-2015) Maths at Cambridge.
    [Show full text]