Depicting Error

Total Page:16

File Type:pdf, Size:1020Kb

Depicting Error Depicting Error Howard WAINER tainty. Despite our appreciation for the importance of this work, we will ignore how the uncertainty is calculated, and Communicating data without some measure of their preci- proceed assuming that, in whatever way was required, such sion can lead to misinterpretation and incorrect inferences. estimates are available and our task is simply to convey Several conventions for displaying error along with the data them as well as we can. they modify are described and illustrated. Alternatives are offered that seem to provide improvements in the effective 2. ERRORS IN TABLES communication of error as well as increasing their ease, and hence their likelihood, of use. These alternatives are illus- Let us begin our examination of the depiction of error trated principally with data from the National Assessment with a typical tabular display. Shown in Table 1 is a data of Educational Progress. table extracted from a much larger display. Omitted are 30 states and two territories as well as another variable (per- KEY WORDS: Display; Errors in graphs; Errors in ta- centage in each category) and its standard error. As abbre- bles. viated, this table still contains the elements important to our discussion without being unwieldy. The only deviation from standard format in this table is that the standard error 1. INTRODUCTION of each data entry is in a separate column rather than de- noted within parentheses adjacent to the data entry whose One hallmark of the statistically conscious investigator is his firm belief that however the survey, experiment, or observational program actually variability it characterizes. This was done primarily to ease turned out, it could have turned out somewhat differently. Holding such a text manipulation. There are three lines added to the bottom belief and taking appropriate actions make effective use of data possible. of this table. These lines are the point of this discussion. We need not always ask explicitly "How much differently?" but we should Showing the standard errors of statistics in this way cer- be aware of such questions. Most of us find uncertainty uncomfortable ... (but) ... each of us who deals with the analysis and interpretation of tainly satisfies requirement (1). The visual weight given to data must learn to cope with uncertainty. the numbers connoting error is the same as that which con- (Mosteller and Tukey 1968, p. 100) veys the data. This depiction does not however allow us to Whenever we discuss information we must also discuss summarize easily the structure of error. For example, we its accuracy. Effective display of data must: cannot tell if the size of the error is related to the profi- ciency, nor is it easy to see how much variation there is in 1. remind us that the data being displayed do contain the error over the various states or across the categories of some uncertainty, and then parental education. The answers to these kinds of questions 2. characterize the size of that uncertainty as it pertains are important for requirement (2). In this case, errors across to the inferences we have in mind, and in so doing, states are quite homogeneous, and so displaying the max- 3. help keep us from drawing incorrect conclusions imum value of the standard error as a summary allows us through the lack of a full appreciation of the precision of to extract a handy, conservative value from the error terms our knowledge. provided. Using such a summary term (rather than say a The examples chosen here focus principally on errors in mean or a median) will lower the likelihood of our declar- means, but the graphical ideas expressed should generalize ing states different when there is reasonable evidence that easily to other situations. Throughout this discussion we as- they might not be. This raises the second part of require- sume that estimates of precision are available, and thus the ment (2), "uncertainty as it pertains to the inferences we task is strictly one of effective display. This assumption is have in mind." What kinds of questions are likely to be admittedly a big one. Great efforts and much imagination used to query Table 1? have been expended in the search for the true uncertainty. When national data are reported broken down by states, it Modern statistics has gone far beyond characterizing error is natural to assume that the disaggregated data are meant by dividing the observed standard deviation by n. Meth- to be compared. Thus the standard errors are merely the ods of sensitivity analysis through resampling or multiple building blocks that are required to be able to construct the imputation are some attempts to measure the real uncer- standard error of the difference so that the statistical sig- nificance of the observed difference between state means can be ascertained. An upper bound on this standard er- Howard Wainer is Principal Research Scientist, Educational Testing ror is obtained by multiplying the maximum standard error Service, Princeton, NJ 08541. This work was supported by Contract R999B40013 from the National Center for Education Statistics to the Ed- by the 2. This result is reported on the first line of the ucational Testing Service, Howard Wainer, Principal Investigator. The au- section at the bottom of the table labeled "Errorterms for thor is pleased to acknowledge this help. He also thanks Alan MacEachren, comparisons." It is a handy rule-of-thumb for anyone wish- Eugene Johnson, Kinley Larntz, Robert Mislevy, Charles P. Nesko, Linda Steinberg, and John W. Tukey for their helpful comments on an earlier ing to make a single comparison. Thus suppose, for some draft. reason, we are interested in comparing the performance of ( 1996 American Statistical Association1 The American Statisticiani,May 1996, Vol. 50, No. 2 101 This content downloaded from 160.39.32.189 on Fri, 23 Jan 2015 07:43:12 AM All use subject to JSTOR Terms and Conditions Table 1. Average Mathematics Proficiency by Parents' Highest Level of Education Grade 8-1992. A revised display of these data is shown in Table 2. Some education Did not Graduated after Graduated finish I don't college high school high school high school know Public schools 0 se 0 se 0 se 0 se 0 se Nation 279 1.4 270 1.2 256 1.4 248 1.8 251 1.7 States Alabama 261 2.5 258 2.0 244 1.8 239 2.0 237 2.9 Arizona 277 1.5 270 1.5 256 1.6 245 2.5 248 2.7 Arkansas 264 1.9 264 1.7 248 1.6 246 2.4 245 2.7 California 275 2.0 266 2.1 251 2.1 241 2.2 240 2.9 Colorado 282 1.3 276 1.6 260 1.5 250 2.4 252 2.6 Connecticut 288 1.0 272 1.8 260 1.8 245 3.3 251 2.4 Delaware 274 1.3 268 2.3 251 1.7 248 4.0 248 3.4 District of Columbia 244 1.7 240 1.9 224 1.6 225 3.2 229 2.2 Florida 268 1.9 266 1.9 251 1.8 244 2.7 244 3.2 Georgia 271 2.1 264 1.7 250 1.3 244 2.2 245 2.6 Hawaii 267 1.5 266 1.9 246 1.8 242 3.5 246 2.1 Idaho 281 .9 278 1.3 268 1.4 254 2.3 254 2.8 Error Terms for Comparisons Max. Std. error of difference 3.5 3.4 3.5 6.4 6.4 40 Bonferroni (std. err. x 3.2) 11.3 11.0 11.3 20.7 20.7 820 Bonferroni (std. err. x 4.0) 14.0 13.6 14.0 25.6 25.6 0 is an estimate of the average proficiency in the state. Source Abstracted from Table 2.12 in The 1992 NAEP TrialState Assessment. Hawaii's 8th graders whose parents were college graduates that participated in the state assessment, we can declare any (267) with their counterparts in Delaware (274). We note differences greater than 11.3 points "statistically significant that their mean scores differ by 7. This is twice the max- beyond the .05 level." The 11.3 point decision rule is con- imum standard error of the difference of the means (3.5). servative (for most comparisons about 50% too large), but We can thus conclude that this observed difference is statis- it provides a safe and quick rule-of-thumb. tically significant beyond the nominal (a = .05) value. We If someone is interested in comparing each state with could have done this comparison for any single pair that all others there will be ("1) or 820 comparisons. To con- was of special interest to us. trol the type I error rate in this situation requires boosting Although the comparison of a particular pair of states the critical region still further (to 14.0). Such a figure is may be an occasional use for a data table like this, we sus- given next to the label "820 Bonferroni." We are uncer- pect that comparing one state (perhaps our own) with all tain of the usefulness of including such a figure, although others is a more likely use. To do this correctly we need it still works, because anyone who is really interested in to control the artifactual increases in the error rate due to making all 820 comparisons will probably want a some- making many comparisons. The most common way to as- what less conservative decision rule. Such a user of the sure that the type I error rate is controlled is boosting the data table would need to go back to the original standard size of the standard error sufficiently so that the overall er- errors and compute a more precise figure.
Recommended publications
  • Infovis and Statistical Graphics: Different Goals, Different Looks1
    Infovis and Statistical Graphics: Different Goals, Different Looks1 Andrew Gelman2 and Antony Unwin3 20 Jan 2012 Abstract. The importance of graphical displays in statistical practice has been recognized sporadically in the statistical literature over the past century, with wider awareness following Tukey’s Exploratory Data Analysis (1977) and Tufte’s books in the succeeding decades. But statistical graphics still occupies an awkward in-between position: Within statistics, exploratory and graphical methods represent a minor subfield and are not well- integrated with larger themes of modeling and inference. Outside of statistics, infographics (also called information visualization or Infovis) is huge, but their purveyors and enthusiasts appear largely to be uninterested in statistical principles. We present here a set of goals for graphical displays discussed primarily from the statistical point of view and discuss some inherent contradictions in these goals that may be impeding communication between the fields of statistics and Infovis. One of our constructive suggestions, to Infovis practitioners and statisticians alike, is to try not to cram into a single graph what can be better displayed in two or more. We recognize that we offer only one perspective and intend this article to be a starting point for a wide-ranging discussion among graphics designers, statisticians, and users of statistical methods. The purpose of this article is not to criticize but to explore the different goals that lead researchers in different fields to value different aspects of data visualization. Recent decades have seen huge progress in statistical modeling and computing, with statisticians in friendly competition with researchers in applied fields such as psychometrics, econometrics, and more recently machine learning and “data science.” But the field of statistical graphics has suffered relative neglect.
    [Show full text]
  • Reviews Edited by Beth Notzon and Edith Paal
    Reviews edited by Beth Notzon and Edith Paal People have been trying to display infor- dull statistician or economist: he dabbled mation graphically ever since our ancestors in fields as varied as engineering, journal- depicted hunts on the cave walls. From ism, and blackmail, the reader discovers. those early efforts through the graphs and There is essentially no limit to the types tables in modern scientific publications, of data display that can be influenced by the attempts have met with various degrees good design, as Wainer makes clear by of success, as Howard Wainer describes in the variety of examples he presents. The Graphic Discovery: A Trout in the Milk and college acceptance letter Wainer’s son Other Visual Adventures. received is cited as a successful display. The This book is no dry, academic tome word “YES,” which is really the only infor- about data. The conversational tone, well- mation the reader cares about in such a chosen illustrations, and enriching asides letter, is printed in large type in the middle combine to create a delightful presentation of the page, with two short, smaller-type of the highlights and lowlights of graphic lines of congratulatory text at the bottom. display. And good data displays will con- As Wainer points out, that tells readers tinue to be important in our current, data- what they want to know without making inundated society. them hunt for it. (He leaves unaddressed Like any good story, this one has its hero. what that school’s rejection letters looked Although writers have used pictures to like that year.
    [Show full text]
  • How Well Can We Equate Test Forms That Are Constructed by Examinees? Program Statistics Research
    DOCUMENT RESUME ED 385 579 TM 024 017 AUTHOR Wainer, Howard; And Others TITLE How Well Can We Equate Test Forms That Are Constructed by Examinees? Program Statistics Research. INSTITUTION Educational Testing Service, Princeton, N.J. REPORT NO ETS-RR-91-57; ETS-TR-91-15 PUB DATE Oct 91 NOTE 29p. PUB TYPE Reports Research/Technical (143) EDRS PRICE MF01/PCO2 Plus Postage. DESCRIPTORS *Adaptive Testing; Chemistry; ComparativeAnalysis; Computer Assisted Testing; *Constructed Response; Difficulty Level; *Equated Scores; *ItemResponse Theory; Models; Selection; Test Format; Testing; *Test Items IDENTIFIERS Advanced Placement Examinations (CEEB); *Unidimensionality (Tests) ABSTRACT When an examination consists, in wholeor in part, of constructed response items, it isa common-practice to allow the examinee to choose amonga variety of questions. This procedure is usually adopted so that the limited number ofitems that can be completed in the allotted time does not unfairlyaffect the examinee. This results in the de facto administrationof several different test forms, where the exact structure ofany particular form is determined by the examinee. When different formsare administered, a canon of good testing practice requires that thoseforms be equated to adjust for differences in their difficulty.When the items are chosen by the examinee, traditional equating proceduresdo not strictly apply. In this paper, how one might equate withan item response theory (IRT) framework is explored. The procedure isillustrated with data from the College Board's Advanced PlacementTest in Chemistry taken by a sample of 18,431 examinees. Comparablescores can be produced in the context of choice to the extent thatresponses may be characterized with a unidimensional IRT model.Seven tables and five figures illustrate the discussion.
    [Show full text]
  • Notices of the American Mathematical
    ISSN 0002-9920 Notices of the American Mathematical Society AMERICAN MATHEMATICAL SOCIETY Graduate Studies in Mathematics Series The volumes in the GSM series are specifically designed as graduate studies texts, but are also suitable for recommended and/or supplemental course reading. With appeal to both students and professors, these texts make ideal independent study resources. The breadth and depth of the series’ coverage make it an ideal acquisition for all academic libraries that of the American Mathematical Society support mathematics programs. al January 2010 Volume 57, Number 1 Training Manual Optimal Control of Partial on Transport Differential Equations and Fluids Theory, Methods and Applications John C. Neu FROM THE GSM SERIES... Fredi Tro˝ltzsch NEW Graduate Studies Graduate Studies in Mathematics in Mathematics Volume 109 Manifolds and Differential Geometry Volume 112 ocietty American Mathematical Society Jeffrey M. Lee, Texas Tech University, Lubbock, American Mathematical Society TX Volume 107; 2009; 671 pages; Hardcover; ISBN: 978-0-8218- 4815-9; List US$89; AMS members US$71; Order code GSM/107 Differential Algebraic Topology From Stratifolds to Exotic Spheres Mapping Degree Theory Matthias Kreck, Hausdorff Research Institute for Enrique Outerelo and Jesús M. Ruiz, Mathematics, Bonn, Germany Universidad Complutense de Madrid, Spain Volume 110; 2010; approximately 215 pages; Hardcover; A co-publication of the AMS and Real Sociedad Matemática ISBN: 978-0-8218-4898-2; List US$55; AMS members US$44; Española (RSME). Order code GSM/110 Volume 108; 2009; 244 pages; Hardcover; ISBN: 978-0-8218- 4915-6; List US$62; AMS members US$50; Ricci Flow and the Sphere Theorem The Art of Order code GSM/108 Simon Brendle, Stanford University, CA Mathematics Volume 111; 2010; 176 pages; Hardcover; ISBN: 978-0-8218- page 8 Training Manual on Transport 4938-5; List US$47; AMS members US$38; and Fluids Order code GSM/111 John C.
    [Show full text]
  • Curriculum Vitae for James J. Heckman
    September 13, 2021 James Joseph Heckman Department of Economics University of Chicago 1126 East 59th Street Chicago, Illinois 60637 Telephone: (773) 702-0634 Fax: (773) 702-8490 Email: [email protected] Personal Date of Birth: April 19, 1944 Place of Birth: Chicago, Illinois Education B.A. 1965 (Math) Colorado College (summa cum laude) M.A. 1968 (Econ) Princeton University Ph.D. 1971 (Econ) Princeton University Dissertation “Three Essays on Household Labor Supply and the Demand for Market Goods.” Sponsors: S. Black, H. Kelejian, A. Rees Graduate and Undergraduate Academic Honors Phi Beta Kappa Woodrow Wilson Fellow NDEA Fellow NIH Fellow Harold Willis Dodds Fellow Post-Graduate Honors Honorary Degrees and Professorships Doctor Honoris Causa, Vienna University of Economics and Business, Vienna, Austria. Jan- uary, 2017. Doctor of Social Sciences Honoris Causa, Lignan University, Hong Kong, China. November, 2015. Honorary Doctorate of Science (Economics), University College London. September, 2013. Doctor Honoris Causis, Pontifical University, Santiago, Chile. August, 2009. Doctor Honoris Causis, University of Montreal.´ May 2004. 1 September 13, 2021 Doctor Honoris Causis, Bard College, May 2004. Doctor Honoris Causis, UAEM, Mexico. January 2003. Doctor Honoris Causis, University of Chile, Fall 2002. Honorary Doctor of Laws, Colorado College, 2001. Honorary Professor, Jinan University, Guangzhou, China, June, 2014. Honorary Professor, Renmin University, P. R. China, June, 2010. Honorary Professor, Beijing Normal University, P. R. China, June, 2010. Honorary Professor, Harbin Institute of Technology, P. R. China, October, 2007. Honorary Professor, Wuhan University, Wuhan, China, 2003. Honorary Professor, Huazhong University of Science and Technology, Wuhan, China, 2001. Honorary Professor, University of Tucuman, October, 1998.
    [Show full text]
  • COPYRIGHT NOTICE: Howard Wainer: Picturing the Uncertain World Is Published by Princeton University Press and Copyrighted, © 2009, by Princeton University Press
    COPYRIGHT NOTICE: Howard Wainer: Picturing the Uncertain World is published by Princeton University Press and copyrighted, © 2009, by Princeton University Press. All rights reserved. No part of this book may be reproduced in any form by any electronic or mechanical means (including photocopying, recording, or information storage and retrieval) without permission in writing from the publisher, except for reading and browsing via the World Wide Web. Users are not permitted to mount this file on any network servers. Follow links for Class Use and other Permissions. For more information send email to: [email protected] 1 The Most Dangerous Equation 1.1. INTRODUCTION What constitutes a dangerous equation? There are two obvious inter­ pretations: some equations are dangerous if you know the equation and others are dangerous if you do not. In this chapter I will not ex- plore the dangers an equation might hold because the secrets within its bounds open doors behind which lie terrible peril. Few would dis- agree that the obvious winner in this contest would be Einstein’s iconic equation E 5MC 2 (1.1) for it provides a measure of the enormous energy hidden within ordi- nary matter. Its destructive capability was recognized by Leo Szilard, who then instigated the sequence of events that culminated in the con- struction of atomic bombs. This is not, however, the direction I wish to pursue. Instead, I am interested in equations that unleash their danger, not when we know about them, but rather when we do not; equations that allow us to understand things clearly, but whose absence leaves us dangerously 5 ignorant.* There are many plausible candidates that seem to fill the bill.
    [Show full text]
  • Life Before EXCEL: Some Historical Statistical Graphs David Robinson THAMES CANCER REGISTRY
    Life before EXCEL: some historical statistical graphs David Robinson THAMES CANCER REGISTRY Introduction Francis Galton (1822-1911) - scientist, statistician and African explorer - was a cousin of Charles Darwin. He was born in Birmingham, the youngest of In this age of the personal computer, where complex pictorial representations of eight children of a prominent Quaker family. He studied medicine at King’s data can be produced at the press of a button or the click of a mouse, we tend College London and mathematics at Cambridge. He was a man of diverse to take statistical graphics very much for granted, and it is difficult to imagine talents, and was responsible (amongst many other projects) for the first weather how revolutionary the first simple charts were. What follows is a description of map, a theory of anticyclones, and the system for classifying fingerprints still in some of the most influential of these early attempts to display data, and the use today. He wrote a treatise on how to make the perfect cup of tea, and stories and characters behind them. published a ‘beauty map’ of the British Isles – based on the number of attractive women he encountered on his travels. (London came out with the highest score; Aberdeen the lowest.) Among his many statistical achievements, he is credited with the introduction of the standard deviation, the correlation coefficient and the regression line. His discovery of the phenomenon of ‘regression to the mean’ is illustrated in his famous chart (Figure 7) shown at the Royal Institution Lecture of 1877. Among Figure 2 his protégés was the great statistician Karl Pearson.
    [Show full text]
  • A Brief History of Data Visualization
    A Brief History of Data Visualization Michael Friendly Psychology Department and Statistical Consulting Service York University 4700 Keele Street, Toronto, ON, Canada M3J 1P3 in: Handbook of Computational Statistics: Data Visualization. See also BIBTEX entry below. BIBTEX: @InCollection{Friendly:06:hbook, author = {M. Friendly}, title = {A Brief History of Data Visualization}, year = {2006}, publisher = {Springer-Verlag}, address = {Heidelberg}, booktitle = {Handbook of Computational Statistics: Data Visualization}, volume = {III}, editor = {C. Chen and W. H\"ardle and A Unwin}, pages = {???--???}, note = {(In press)}, } © copyright by the author(s) document created on: March 21, 2006 created from file: hbook.tex cover page automatically created with CoverPage.sty (available at your favourite CTAN mirror) A brief history of data visualization Michael Friendly∗ March 21, 2006 Abstract It is common to think of statistical graphics and data visualization as relatively modern developments in statistics. In fact, the graphic representation of quantitative information has deep roots. These roots reach into the histories of the earliest map-making and visual depiction, and later into thematic cartography, statistics and statistical graphics, medicine, and other fields. Along the way, developments in technologies (printing, reproduction) mathematical theory and practice, and empirical observation and recording, enabled the wider use of graphics and new advances in form and content. This chapter provides an overview of the intellectual history of data visualization from medieval to modern times, describing and illustrating some significant advances along the way. It is based on a project, called the Milestones Project, to collect, catalog and document in one place the important developments in a wide range of areas and fields that led to mod- ern data visualization.
    [Show full text]
  • A Brief History of Data Visualization
    ABriefHistory II.1 of Data Visualization Michael Friendly 1.1 Introduction ........................................................................................ 16 1.2 Milestones Tour ................................................................................... 17 Pre-17th Century: Early Maps and Diagrams ............................................. 17 1600–1699: Measurement and Theory ..................................................... 19 1700–1799: New Graphic Forms .............................................................. 22 1800–1850: Beginnings of Modern Graphics ............................................ 25 1850–1900: The Golden Age of Statistical Graphics................................... 28 1900–1950: The Modern Dark Ages ......................................................... 37 1950–1975: Rebirth of Data Visualization ................................................. 39 1975–present: High-D, Interactive and Dynamic Data Visualization ........... 40 1.3 Statistical Historiography .................................................................. 42 History as ‘Data’ ...................................................................................... 42 Analysing Milestones Data ...................................................................... 43 What Was He Thinking? – Understanding Through Reproduction.............. 45 1.4 Final Thoughts .................................................................................... 48 16 Michael Friendly It is common to think of statistical graphics and data
    [Show full text]
  • Course Syllabus Page 1
    Seminar in Criminology Research and Analysis – Crim 7301 Fall 2016 Syllabus Course Information Time: Thursdays, 1:00 to 3:45 Class Location: GR 3.206 Professor Contact Information Dr. Andrew P. Wheeler – but call me Andy! Email: [email protected] Office Hours: Tuesday and Friday, 9:00 to 11:00, Office is GR 3.530 The quickest way to reach me is via email. I am frequently in my office, so feel free to stop by whenever (knock if the door is closed). Otherwise you can email to set up an appointment time. Course Pre-requisites, Co-requisites, and/or Other Restrictions This is a PhD course. You will have been expected to have completed EPPS 6310 (Research Design I), EPPS 6313 (Intro to Quant. Methods), and EPPS 6316 (Applied Regression). Course Description The course is an advanced course on quantitative research designs often used in criminology and criminal justice research. This course includes both lecture and time in the computer lab. The format of the course will often be I give a lecture (that includes student discussion of particular topics), and then save time for a computer lab (conducting statistical analysis) in the latter half of class. Student Learning Objectives/Outcomes By the end of the course you will be able to: Learn how regression equations can answer causal questions Understand when to use common quasi-experimental research designs -- such as propensity score matching and difference-in-differences. Have run practical examples of conducting those research designs in statistical software Know the basics of making a reproducible statistical analysis Gained introductory exposure to group based trajectory modelling, social network analysis, and machine learning applications in criminology Required Textbooks and Materials There will be two required texts for the course.
    [Show full text]
  • A Sampling of Statistical Problems Encountered at the Educational Testing Service
    DOCUMENT RESUME ED 388 715 TM 024 162 AUTHOR Wainer, Howard; And Others TITLE A Sampling of Statistical Problems Encountered at the Educational Testing Service. Program Statistics Research Technical Report No. 92-26. INSTITUTION Educational Testing Service, Princeton, N.J. REPORT NO ETS-RR-92-68 PUB DATE Nov 92 NOTE 25p. PUB TYPE Reports Evaluative/Feasibility (142) EDRS PRICE MF01/PC01 Plus Postage. DESCRIPTORS Adaptive Testing; *Bayesian Statistics; *Educational Research; Estimation (Mathematics); Longitudinal Studies; National Surveys; Researchers; Research Methodology; *Research Problems; *Response Rates (Questionnaires); Statistical Analysis; *Test Theory; *True Scores IDENTIFIERS Educational Testing Service; *National Assessment of Educational Progress ABSTRACT Four researchers at the Educational Testing Service describe what they consider some of the most vexing research problems they face. While these problems are not completely statistical, they all have major statistical components. Following the introduction (section 1),in section 2, "Problems with the Simultaneous Estimation of Many True Scores," Charles Lewis describes a technical problem that occurs in taking a Bayesian approach to traditional test theory. In the third section, "Test Theory Reconceived," Robert J. Mislevy explains problems involved in reconceiving old approaches and new theories. Section 4, "Allowing Examinee Choice in Exams" by Howard Wainer, discusses the general problem of nonignorable nonresponse in the circumstance in which examinees choose to answer only a small number of test items from a larger sample. The fifth section, "Some Statistical Issues Facing NAEP," by Eugene G. Johnson, describes the inferences that are occurring within the National Assessment of Educational Progress due to nonignorable response. (Contains 28 references.)(SLD) *********************************************************************** Reproductions supplied by EDRS are the best that can be made * from the original document.
    [Show full text]
  • A History of Data Visualization and Graphic Communication
    AHistoryof Data Visualization and Graphic Communication Michael Friendly Howard Wainer harvarduniversitypress Cambridge, Massachusetts, and London, England 2021 Contents Introduction 1 1 In the Beginning ... 10 2 TheFirstGraphGotItRight 29 3TheBirthofData 44 4 Vital Statistics: William Farr, John Snow, and Cholera 66 5 The Big Bang: William Playfair, the Father of Modern Graphics 95 6 The Origin and Development of the Scatterplot 121 7 The Golden Age of Statistical Graphics 158 8 Escaping Flatland 185 9 Visualizing Time and Space 199 10 Graphs as Poetry 231 Learning More 251 Notes 259 References 277 Acknowledgments 291 Index 293 Color illustrations follow page 230 Introduction The only new thing in the world is the history you don’t know. —harry s. truman,quotedbyDavidMcCulloch We live on islands surrounded by seas of data. Some call it “big data.” In these seas live various species of observable phenomena. Ideas, hypotheses, explanations, and graphics also roam in the seas of data and can clarify the waters or allow unsupported species to die. These creatures thrive on visual explanation and scientific proof. Over time new varieties of graphical species arise, prompted by new problems and inner visions of the fishers in the seas of data. Whether we’re aware of this or not, data are a part of almost every area of our lives. As individuals, fitness trackers and blood sugar meters let us moni- tor our health. Online bank dashboards let us view our spending patterns and track financial goals. As members of society, we read stories of outbreaks of wildfires in California or extreme weather events and wonder if these are mere anomalies or conclusive evidence for climate change.
    [Show full text]