Visions of the Past, Present and Future of Statistical Graphics Apart1
Total Page:16
File Type:pdf, Size:1020Kb
Visions of the Past, Present and Future of Statistical Graphics apart1 Visions of The Past, Present and Future Visions of the Past of Statistical Graphics (An Ideo-Graphic and Idiosyncratic View) The only new thing in the world is the history you don’t know. Harry S. Truman The Milestones Project 16 The Golden Age of Statistical Graphics Sex: Male 14 1198 1493 12 Re-Visions of Minard 10 8 6 Attract 16 14 12 10 8 6 Photo 16 Admit?: No Admit?: Yes 14 12 10 557 1278 8 6 Subjsex Sex: Female 1:High 2:Med 3:Low Female Male Female Male Michael Friendly York University American Psychological Association August, 2003 APA 2003 1 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics milestone1 Visions of the Past, Present and Future of Statistical Graphics milestone1 Milestones Project: Roots of Data Visualization Milestones Project: Goals Cartography Comprehensive catalog of historical developments in all fields related to data early map-making geo-measurement thematic cartography visualization GIS, geo-visualization→ → collect detailed bibliography, images, cross-references, web links, etc. Statistics, statistical thinking → probability theory distributions estimation 220 milestone items (6200 BC – present) → → statistical models diagnostic plots interactive graphics 240 images, portraits → → Data collection 140 web links (biographies, commentary) early recording devices 250 references “statistics” (numbers of the state): population, mortality census, surveys economic, social, moral, medical, . statistics → enable researchers to study themes, antecedants, influences, trends, etc. → Visual thinking Web version: http://www.math.yorku.ca/SCS/Gallery/milestone/ geometry, functions, mechanical diagrams, EDA Technology Present form: hyperlinked, chronological listing (HTML, PDF) Next: searchable by subject, content, author, country, etc. (LAT X XML) paper, printing, lithography, computing, displays, . E → APA 2003 2 Michael Friendly APA 2003 3 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics milestone1 Visions of the Past, Present and Future of Statistical Graphics milestone1 APA 2003 4 Michael Friendly APA 2003 5 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics milestone1 Visions of the Past, Present and Future of Statistical Graphics milestone1 APA 2003 6 Michael Friendly APA 2003 7 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics milestone1 Visions of the Past, Present and Future of Statistical Graphics milestone1 APA 2003 8 Michael Friendly APA 2003 9 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics milestone1 Visions of the Past, Present and Future of Statistical Graphics milestone1 Beginning of Modern Data Graphics: 1800–1849 Playfair’s linear arithmetic (1780–1800): line plot, pie chart, etc. Adolphe Quetelet (1835) ,“average man” as central tendency in a normal curve. Moral, social and medical statistics collected systematically (1820–) Dupin: distributions of years of schooling; prostitutes in Paris. APA 2003 10 Michael Friendly APA 2003 11 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics milestone1 Visions of the Past, Present and Future of Statistical Graphics golden The Golden Age of Statistical Graphics Snow: map of cholera cases (Aug 31–Sep 8, 1854) Broad Street pump. → APA 2003 12 Michael Friendly APA 2003 13 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics golden Visions of the Past, Present and Future of Statistical Graphics golden cf. Water in Walkerton: Outbreak of E. coli contamination (May 16–22, 2000) 6 “The Best Statistical Graphic Ever Produced” → died, > 2000 ill. Source: undetermined until Jan. 2001 No one thought to make a map! E-J Marey (1878): “defies the pen of the historian by its brutal eloquence”. Funkhouser (1937): Minard, the Playfair of France. Tufte (1983): “multivariate complexity integrated so gently that viewers are hardly aware that they are looking into a world of six dimensions ... the best statistical graphic ever produced.” APA 2003 14 Michael Friendly APA 2003 15 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics golden Visions of the Past, Present and Future of Statistical Graphics album Flow maps as visual tools Movement of people and goods was a consistent theme of most of Minard’s work Data represented both visually and numerically Extensive legends, describing how the information should be understood and interpreted Visual engineer for France: the dawn of globalization, emergence of the modern French state. APA 2003 16 Michael Friendly APA 2003 17 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics album Visions of the Past, Present and Future of Statistical Graphics album Minard’s graphic inventions Population represented by squares, area population ∼ Visual center of gravity used to choose location for new post office Carte figurative et approximative du mouvement des voyageurs sur les principal chemin de fer de l’Europe en 1862 (1865) [ENPC: 5862/C351] APA 2003 18 Michael Friendly APA 2003 19 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics album Visions of the Past, Present and Future of Statistical Graphics march The March Re-visited March on Moscow was part of a pair, along with Hannibal’s campaign Aug. 1869: Prussian army invades, Minard flees to Bordeau Personal meaning: horrors of war, the human cost of thirst for military glory. APA 2003 20 Michael Friendly APA 2003 21 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics album Visions of the Past, Present and Future of Statistical Graphics album L’Album de Statistique Graphique Why the Golden Age? The pinnacle of the Golden Age of Graphics 18 volumes published 1879–1899 Statistics as a discipline: Les Chevaliers des Album 1st International Statistics Congress (1853) [Quetelet] 3rd ISC: Expo. & standardization of graphical methods (Vienna, 1857) 1889: Gross receipts in theaters in Paris, 1848-1889 la Societ´ e´ de statistique de Paris (1860) Royal Statistical Society (1860) Expansion of industrialization, trade, transport government initiatives in data → collection and analysis. Statistics: Numbers of the State Ministry of Public Works (France): Statistical Bureau (Emile´ Chasson) Similar efforts in Germany, Switzerland, etc. U.S. Census Bureau (Edward Walker)— first US census (1860) APA 2003 22 Michael Friendly APA 2003 23 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics album Visions of the Past, Present and Future of Statistical Graphics jsm APA 2003 24 Michael Friendly APA 2003 25 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics jsm Visions of the Past, Present and Future of Statistical Graphics apart2 Visions of the Present Look not mournfully into the past. It comes not back again. Wisely improve the present. It is thine. Henry Wadsworth Longfellow Graphical methods for categorical data Fourfold displays Mosaic displays Diagnostic plots for GLIMs Graphical principles: Rendering and effect ordering Corrgrams Effect ordering for data display Other innovations JMP— Graphs as first-place objects; graphic scripting VISTA— dynamic graphics (spreadplots), workmaps ggobi R— interconnectivity Graphical→ excellence: e.g., linked micromaps (Dan Carr) God is in the details •NVIZN— Grammar of Graphics JAVA → APA 2003 26 Michael Friendly APA 2003 27 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics show Visions of the Past, Present and Future of Statistical Graphics berkeley Fourfold displays for 2 2 tables × Quarter circles: radius √nij area frequency Independence: Adjoining∼ quadrants⇒ align∼ Graphical methods for categorical data Odds ratio: ratio of areas of diagonally≈ opposite cells Confidence rings: Visual test of H0 : θ = 1 adjoining rings overlap Visualizing Categorical Data (Friendly, 2000) Sex: Male ↔ 1198 1493 Goals: Develop graphical methods comparable to those used for quantitative data • Make them available and accessible in SAS Software • Visualizing odds ratios— Fourfold displays Visual fitting for loglinear models— Mosaic displays Visualizing model diagnostics for GLIMs— Influence plots Admit?: No Multi-variable overviews— Mosaic matrices Admit?: Yes See: http://www.math.yorku.ca/SCS/vcd/ 557 1278 Sex: Female Confidence rings do not overlap: θ = 1 6 APA 2003 28 Michael Friendly APA 2003 29 Michael Friendly Visions of the Past, Present and Future of Statistical Graphics berkeley Visions of the Past, Present and Future of Statistical Graphics berkeley Department: A Department: B Department: C Sex: Male Sex: Male Sex: Male 512 313 353 207 120 205 Admit?: No Admit?: No Admit?: No Fourfold displays for 2 2 k tables Admit?: Yes Admit?: Yes Admit?: Yes 89 19 17 8 202 391 × × Sex: Female Sex: Female Sex: Female Data had been pooled over departments Department: D Department: E Department: F Sex: Male Sex: Male Sex: Male 138 279 53 138 22 351 Stratified analysis: one fourfold display for each department Each 2 2 table standardized to equate marginal frequencies × Shading: highlight departments for which Ha : θi = 1 6 Admit?: No Admit?: No Admit?: No Admit?: Yes Admit?: Yes Admit?: Yes 131 244 94 299 24 317 Sex: Female Sex: Female Sex: Female Only one department (A) shows association; θA = 0.349 women (0.349)−1 = 2.86 times as likely as men to be admitted. → APA 2003 30 Michael Friendly APA 2003 31 Michael Friendly