To Draw a Tree

Total Page:16

File Type:pdf, Size:1020Kb

To Draw a Tree To Draw a Tree Pat Hanrahan Computer Science Department Stanford University Motivation Hierarchies File systems and web sites Organization charts Categorical classifications Similiarity and clustering Branching processes Genealogy and lineages Phylogenetic trees Decision processes Indices or search trees Decision trees Tournaments Page 1 Tree Drawing Simple Tree Drawing Preorder or inorder traversal Page 2 Rheingold-Tilford Algorithm Information Visualization Page 3 Tree Representations Most Common … Page 4 Tournaments! Page 5 Second Most Common … Lineages Page 6 http://www.royal.gov.uk/history/trees.htm Page 7 Demonstration Saito-Sederberg Genealogy Viewer C. Elegans Cell Lineage [Sulston] Page 8 Page 9 Page 10 Page 11 Evolutionary Trees [Haeckel] Page 12 Page 13 [Agassiz, 1883] 1989 Page 14 Chapple and Garofolo, In Tufte [Furbringer] Page 15 [Simpson]] [Gould] Page 16 Tree of Life [Haeckel] [Tufte] Page 17 Janvier, 1812 “Graphical Excellence is nearly always multivariate” Edward Tufte Page 18 Phenograms to Cladograms GeneBase Page 19 http://www.gwu.edu/~clade/spiders/peet.htm Page 20 Page 21 The Shape of Trees Page 22 Patterns of Evolution Page 23 Hierachical Databases Stolte and Hanrahan, Polaris, InfoVis 2000 Page 24 Generalization • Aggregation • Simplification • Filtering Abstraction Hierarchies Datacubes Star and Snowflake Schemes Page 25 Memory & Code Cache misses for a procedure for 10 million cycles White = not run Grey = no misses Red = # misses y-dimension is source code x-dimension is cycles (time) Memory & Code zooming on y zooms from fileprocedurelineassembly code zooming on x increases time resolution down to one cycle per bar Page 26 Themes Cognitive Principles for Design Congruence Principle: The structure and content of the external representation should correspond to the desired structure and content of the internal representation. Apprehension Principle: The structure and content of the external representation should be readily and accurately perceived and comprehended. [B. Tversky in press] Page 27 Metaphors Survey the space of designs Best metaphors have been refined over time (and have stood the test of time) Need to understand and appreciate what makes metaphors effective It takes both creativity and experience, algorithms and aesthetics, to create new metaphors. Often one element is lacking “Graphical excellence is multivariate,” therefore mix your metaphors Depiction How to map information to graphics Central problem in visualization Formal systems needed Automation and composition Need high-level tools to make this easy Key theorists: Bertin, Cleveland, Mackinlay, MacEachren, Wilkinson Page 28 Challenge Best visualizations designed by humans Computer mediated communication is becoming ubiquitous Therefore: Visualizations are regressing Challenge: Develop algorithms that produce high-quality, effective designs Acknowledgements Tamara Munzner Chris Stolte Diane Tang Maneesh Agrawala Barbara Tversky Page 29.
Recommended publications
  • Evolution of the Infographic
    EVOLUTION OF THE INFOGRAPHIC: Then, now, and future-now. EVOLUTION People have been using images and data to tell stories for ages—long before the days of the Internet, smartphones, and Excel. In fact, the history of infographics pre-dates the web by more than 30,000 years with the earliest forms of these visuals being cave paintings that helped early humans find food, resources, and shelter. But as technology has advanced, so has our ability to tell meaningful stories. Here’s a look into the evolution of modern infographics—where they’ve been, how they’ve evolved, and where they’re headed. Then: Printed, static infographics The 20th Century introduced the infographic—a staple for how we communicate, visualize, and share information today. Early on, these print graphics married illustration and data to communicate information in a revolutionary way. ADVANTAGE Design elements enable people to quickly absorb information previously confined to long paragraphs of text. LIMITATION Static infographics didn’t allow for deeper dives into the data to explore granularities. Hoping to drill down for more detail or context? Tough luck—what you see is what you get. Source: http://www.wired.co.uk/news/archive/2012-01/16/painting- by-numbers-at-london-transport-museum INFOGRAPHICS THROUGH THE AGES DOMO 03 Now: Web-based, interactive infographics While the first wave of modern infographics made complex data more consumable, web-based, interactive infographics made data more explorable. These are everywhere today. ADVANTAGE Everyone looking to make data an asset, from executives to graphic designers, are now building interactive data stories that deliver additional context and value.
    [Show full text]
  • Domain-Specific Programming Systems
    Lecture 22: Domain-Specific Programming Systems Parallel Computer Architecture and Programming CMU 15-418/15-618, Spring 2020 Slide acknowledgments: Pat Hanrahan, Zach Devito (Stanford University) Jonathan Ragan-Kelley (MIT, Berkeley) Course themes: Designing computer systems that scale (running faster given more resources) Designing computer systems that are efficient (running faster under constraints on resources) Techniques discussed: Exploiting parallelism in applications Exploiting locality in applications Leveraging hardware specialization (earlier lecture) CMU 15-418/618, Spring 2020 Claim: most software uses modern hardware resources inefficiently ▪ Consider a piece of sequential C code - Call the performance of this code our “baseline performance” ▪ Well-written sequential C code: ~ 5-10x faster ▪ Assembly language program: maybe another small constant factor faster ▪ Java, Python, PHP, etc. ?? Credit: Pat Hanrahan CMU 15-418/618, Spring 2020 Code performance: relative to C (single core) GCC -O3 (no manual vector optimizations) 51 40/57/53 47 44/114x 40 = NBody 35 = Mandlebrot = Tree Alloc/Delloc 30 = Power method (compute eigenvalue) 25 20 15 10 5 Slowdown (Compared to C++) Slowdown (Compared no data no 0 data no Java Scala C# Haskell Go Javascript Lua PHP Python 3 Ruby (Mono) (V8) (JRuby) Data from: The Computer Language Benchmarks Game: CMU 15-418/618, http://shootout.alioth.debian.org Spring 2020 Even good C code is inefficient Recall Assignment 1’s Mandelbrot program Consider execution on a high-end laptop: quad-core, Intel Core i7, AVX instructions... Single core, with AVX vector instructions: 5.8x speedup over C implementation Multi-core + hyper-threading + AVX instructions: 21.7x speedup Conclusion: basic C implementation compiled with -O3 leaves a lot of performance on the table CMU 15-418/618, Spring 2020 Making efficient use of modern machines is challenging (proof by assignments 2, 3, and 4) In our assignments, you only programmed homogeneous parallel computers.
    [Show full text]
  • Minard's Chart of Napolean's Campaign
    Minard's Chart of Napolean's Campaign Charles Joseph Minard (French: [minaʁ]; 27 March 1781 – 24 October 1870) was a French civil engineer recognized for his significant contribution in the field of information graphics in civil engineering and statistics. Minard was, among other things, noted for his representation of numerical data on geographic maps. Charles Minard's map of Napoleon's disastrous Russian campaign of 1812. The graphic is notable for its representation in two dimensions of six types of data: the number of Napoleon's troops; distance; temperature; the latitude and longitude; direction of travel; and location relative to specific dates.[2] Wikipedia (n.d.). Charles Minard's 1869 chart showing the number of men in Napoleon’s 1812 Russian campaign army, their movements, as well as the temperature they encountered on the return path. File:Minard.png. https:// en.wikipedia.org/wiki/File:Minard.png The original description in French accompanying the map translated to English:[3] Drawn by Mr. Minard, Inspector General of Bridges and Roads in retirement. Paris, 20 November 1869. The numbers of men present are represented by the widths of the colored zones in a rate of one millimeter for ten thousand men; these are also written beside the zones. Red designates men moving into Russia, black those on retreat. — The informations used for drawing the map were taken from the works of Messrs. Thiers, de Ségur, de Fezensac, de Chambray and the unpublished diary of Jacob, pharmacist of the Army since 28 October. Recognition Modern information
    [Show full text]
  • Capítulo 1 Análise De Sentimentos Utilizando Técnicas De Classificação Multiclasse
    XII Simpósio Brasileiro de Sistemas de Informação De 17 a 20 de maio de 2016 Florianópolis – SC Tópicos em Sistemas de Informação: Minicursos SBSI 2016 Sociedade Brasileira de Computação – SBC Organizadores Clodis Boscarioli Ronaldo dos Santos Mello Frank Augusto Siqueira Patrícia Vilain Realização INE/UFSC – Departamento de Informática e Estatística/ Universidade Federal de Santa Catarina Promoção Sociedade Brasileira de Computação – SBC Patrocínio Institucional CAPES – Coordenação de Aperfeiçoamento de Pessoal de Nível Superior CNPq - Conselho Nacional de Desenvolvimento Científico e Tecnológico FAPESC - Fundação de Amparo à Pesquisa e Inovação do Estado de Santa Catarina Catalogação na fonte pela Biblioteca Universitária da Universidade Federal de Santa Catarina S612a Simpósio Brasileiro de Sistemas de Informação (12. : 2016 : Florianópolis, SC) Anais [do] XII Simpósio Brasileiro de Sistemas de Informação [recurso eletrônico] / Tópicos em Sistemas de Informação: Minicursos SBSI 2016 ; organizadores Clodis Boscarioli ; realização Departamento de Informática e Estatística/ Universidade Federal de Santa Catarina ; promoção Sociedade Brasileira de Computação (SBC). Florianópolis : UFSC/Departamento de Informática e Estatística, 2016. 1 e-book Minicursos SBSI 2016: Tópicos em Sistemas de Informação Disponível em: http://sbsi2016.ufsc.br/anais/ Evento realizado em Florianópolis de 17 a 20 de maio de 2016. ISBN 978-85-7669-317-8 1. Sistemas de recuperação da informação Congressos. 2. Tecnologia Serviços de informação Congressos. 3. Internet na administração pública Congressos. I. Boscarioli, Clodis. II. Universidade Federal de Santa Catarina. Departamento de Informática e Estatística. III. Sociedade Brasileira de Computação. IV. Título. CDU: 004.65 Prefácio Dentre as atividades de Simpósio Brasileiro de Sistemas de Informação (SBSI) a discussão de temas atuais sobre pesquisa e ensino, e também sua relação com a indústria, é sempre oportunizada.
    [Show full text]
  • The Mission of IUPUI Is to Provide for Its Constituents Excellence in  Teaching and Learning;  Research, Scholarship, and Creative Activity; and  Civic Engagement
    INFO H517 Visualization Design, Analysis, and Evaluation Department of Human-Centered Computing Indiana University School of Informatics and Computing, Indianapolis Fall 2016 Section No.: 35557 Credit Hours: 3 Time: Wednesdays 12:00 – 2:40 PM Location: IT 257, Informatics & Communications Technology Complex 535 West Michigan Street, Indianapolis, IN 46202 [map] First Class: August 24, 2016 Website: http://vis.ninja/teaching/h590/ Instructor: Khairi Reda, Ph.D. in Computer Science (University of Illinois, Chicago) Assistant Professor, Human–Centered Computing Office Hours: Mondays, 1:00-2:30PM, or by Appointment Office: IT 581, Informatics & Communications Technology Complex 535 West Michigan Street, Indianapolis, IN 46202 [map] Phone: (317) 274-5788 (Office) Email: [email protected] Website: http://vis.ninja/ Prerequisites: Prior programming experience in a high-level language (e.g., Java, JavaScript, Python, C/C++, C#). COURSE DESCRIPTION This is an introductory course in design and evaluation of interactive visualizations for data analysis. Topics include human visual perception, visualization design, interaction techniques, and evaluation methods. Students develop projects to create their own web- based visualizations and develop competence to undertake independent research in visualization and visual analytics. EXTENDED COURSE DESCRIPTION This course introduces students to interactive data visualization from a human-centered perspective. Students learn how to apply principles from perceptual psychology, cognitive science, and graphics design to create effective visualizations for a variety of data types and analytical tasks. Topics include fundamentals of human visual perception and cognition, 1 2 graphical data encoding, visual representations (including statistical plots, maps, graphs, small-multiples), task abstraction, interaction techniques, data analysis methods (e.g., clustering and dimensionality reduction), and evaluation methods.
    [Show full text]
  • Computational Photography
    Computational Photography George Legrady © 2020 Experimental Visualization Lab Media Arts & Technology University of California, Santa Barbara Definition of Computational Photography • An R & D development in the mid 2000s • Computational photography refers to digital image-capture cameras where computers are integrated into the image-capture process within the camera • Examples of computational photography results include in-camera computation of digital panoramas,[6] high-dynamic-range images, light- field cameras, and other unconventional optics • Correlating one’s image with others through the internet (Photosynth) • Sensing in other electromagnetic spectrum • 3 Leading labs: Marc Levoy, Stanford (LightField, FrankenCamera, etc.) Ramesh Raskar, Camera Culture, MIT Media Lab Shree Nayar, CV Lab, Columbia University https://en.wikipedia.org/wiki/Computational_photography Definition of Computational Photography Sensor System Opticcs Software Processing Image From Shree K. Nayar, Columbia University From Shree K. Nayar, Columbia University Shree K. Nayar, Director (lab description video) • Computer Vision Laboratory • Lab focuses on vision sensors; physics based models for vision; algorithms for scene interpretation • Digital imaging, machine vision, robotics, human- machine interfaces, computer graphics and displays [URL] From Shree K. Nayar, Columbia University • Refraction (lens / dioptrics) and reflection (mirror / catoptrics) are combined in an optical system, (Used in search lights, headlamps, optical telescopes, microscopes, and telephoto lenses.) • Catadioptric Camera for 360 degree Imaging • Reflector | Recorded Image | Unwrapped [dance video] • 360 degree cameras [Various projects] Marc Levoy, Computer Science Lab, Stanford George Legrady Director, Experimental Visualization Lab Media Arts & Technology University of California, Santa Barbara http://vislab.ucsb.edu http://georgelegrady.com Marc Levoy, Computer Science Lab, Stanford • Light fields were introduced into computer graphics in 1996 by Marc Levoy and Pat Hanrahan.
    [Show full text]
  • Surfacing Visualization Mirages
    Surfacing Visualization Mirages Andrew McNutt Gordon Kindlmann Michael Correll University of Chicago University of Chicago Tableau Research Chicago, IL Chicago, IL Seattle, WA [email protected] [email protected] [email protected] ABSTRACT In this paper, we present a conceptual model of these visual- Dirty data and deceptive design practices can undermine, in- ization mirages and show how users’ choices can cause errors vert, or invalidate the purported messages of charts and graphs. in all stages of the visual analytics (VA) process that can lead These failures can arise silently: a conclusion derived from to untrue or unwarranted conclusions from data. Using our a particular visualization may look plausible unless the an- model we observe a gap in automatic techniques for validating alyst looks closer and discovers an issue with the backing visualizations, specifically in the relationship between data data, visual specification, or their own assumptions. We term and chart specification. We address this gap by developing a such silent but significant failures visualization mirages. We theory of metamorphic testing for visualization which synthe- describe a conceptual model of mirages and show how they sizes prior work on metamorphic testing [92] and algebraic can be generated at every stage of the visual analytics process. visualization errors [54]. Through this combination we seek to We adapt a methodology from software testing, metamorphic alert viewers to situations where minor changes to the visual- testing, as a way of automatically surfacing potential mirages ization design or backing data have large (but illusory) effects at the visual encoding stage of analysis through modifications on the resulting visualization, or where potentially important to the underlying data and chart specification.
    [Show full text]
  • Using Visualization to Understand the Behavior of Computer Systems
    USING VISUALIZATION TO UNDERSTAND THE BEHAVIOR OF COMPUTER SYSTEMS A DISSERTATION SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Robert P. Bosch Jr. August 2001 c Copyright by Robert P. Bosch Jr. 2001 All Rights Reserved ii I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Dr. Mendel Rosenblum (Principal Advisor) I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Dr. Pat Hanrahan I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Dr. Mark Horowitz Approved for the University Committee on Graduate Studies: iii Abstract As computer systems continue to grow rapidly in both complexity and scale, developers need tools to help them understand the behavior and performance of these systems. While information visu- alization is a promising technique, most existing computer systems visualizations have focused on very specific problems and data sources, limiting their applicability. This dissertation introduces Rivet, a general-purpose environment for the development of com- puter systems visualizations. Rivet can be used for both real-time and post-mortem analyses of data from a wide variety of sources. The modular architecture of Rivet enables sophisticated visualiza- tions to be assembled using simple building blocks representing the data, the visual representations, and the mappings between them.
    [Show full text]
  • Visual Analytics Application
    Selecting a Visual Analytics Application AUTHORS: Professor Pat Hanrahan Stanford University CTO, Tableau Software Dr. Chris Stolte VP, Engineering Tableau Software Dr. Jock Mackinlay Director, Visual Analytics Tableau Software Name Inflation: Visual Analytics Visual analytics is becoming the fastest way for people to explore and understand data of any size. Many companies took notice when Gartner cited interactive data visualization as one of the top five trends transforming business intelligence. New conferences have emerged to promote research and best practices in the area, including VAST (Visual Analytics Science & Technology), organized by the 100,000 member IEEE. Technologies based on visual analytics have moved from research into widespread use in the last five years, driven by the increased power of analytical databases and computer hardware. The IT departments of leading companies are increasingly recognizing the need for a visual analytics standard. Not surprisingly, everywhere you look, software companies are adopting the terms “visual analytics” and “interactive data visualization.” Tools that do little more than produce charts and dashboards are now laying claim to the label. How can you tell the cleverly named from the genuine? What should you look for? It’s important to know the defining characteristics of visual analytics before you shop. This paper introduces you to the seven essential elements of true visual analytics applications. Figure 1: There are seven essential elements of a visual analytics application. Does a true visual analytics application also include standard analytical features like pivot-tables, dashboards and statistics? Of course—all good analytics applications do. But none of those features captures the essence of what visual analytics is bringing to the world’s leading companies.
    [Show full text]
  • Interactive Visualization of Large Graphs and Networks
    INTERACTIVE VISUALIZATION OF LARGE GRAPHS AND NETWORKS A DISSERTATION SUBMITTED TO THE DEPARTMENT OF COMPUTER SCIENCE AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Tamara Munzner June 2000 c 2000 by Tamara Munzner All Rights Reserved ii I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Pat Hanrahan (Principal Adviser) I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Marc Levoy I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Terry Winograd I certify that I have read this dissertation and that in my opinion it is fully adequate, in scope and quality, as a dissertation for the degree of Doctor of Philosophy. Stephen North (AT&T Research) Approved for the University Committee on Graduate Studies: iii Abstract Many real-world domains can be represented as large node-link graphs: backbone Internet routers connect with 70,000 other hosts, mid-sized Web servers handle between 20,000 and 200,000 hyperlinked documents, and dictionaries contain millions of words defined in terms of each other. Computational manipulation of such large graphs is common, but previous tools for graph visualization have been limited to datasets of a few thousand nodes.
    [Show full text]
  • The Visual Display of Quantitative Information, 2E
    2 Lecture 34 of 41 Lecture Outline Visualization, Part 1 of 3: Reading for Last Class: Chapter 15, Eberly 2e; Ray Tracing Handout Data (Quantities & Evidence) Reading for Today: Tufte Handout Reading for Next Class: Ray Tracing Handout Last Time: Ray Tracing 2 of 2 William H. Hsu Stochastic & distributed RT Department of Computing and Information Sciences, KSU Stochastic (local) vs. distributed (nonlocal) randomization “Softening” shadows, reflection, transparency KSOL course pages: http://bit.ly/hGvXlH / http://bit.ly/eVizrE Public mirror web site: http://www.kddresearch.org/Courses/CIS636 Hybrid global illumination: RT with progressive refinement radiosity Instructor home page: http://www.cis.ksu.edu/~bhsu Today: Visualization Part 1 of 3 – Scientific, Data, Information Vis What is visualization? Readings: Tufte 1: The Visual Display of Quantitative Information, 2 e Last class: Chapter 15, Eberly 2e – see http://bit.ly/ieUq45; Ray Tracing Handout Basic statistical & scientific visualization techniques Today: Tufte Handout 1 Next class: Ray Tracing Handout Graphical integrity vs. lie factor (“How to lie with statisticsvis”) Wikipedia, Visualization: http://bit.ly/gVxRFp Graphical excellence vs. chartjunk Wikipedia, Data Visualization: http://bit.ly/9icAZk Data-ink, data-ink ratio (& “data-pixels”) CIS 536/636 Computing & Information Sciences CIS 536/636 Computing & Information Sciences Lecture 34 of 41 Lecture 34 of 41 Introduction to Computer Graphics Kansas State University Introduction to Computer Graphics Kansas State
    [Show full text]
  • Visualisation and Its Importance in the Analysis of Big Data Stefano De Francisci
    Visualisation and its importance in the analysis of Big Data Stefano De Francisci THE CONTRACTOR IS ACTING UNDER A FRAMEWORK CONTRACT CONCLUDED WITH THE COMMISSION Eurostat Outline Introduction Visual cognition process Graphic information processing Big Data visualization Storytelling Eurostat Outline Introduction Visual cognition process Graphic information processing Big Data visualization Storytelling Eurostat To be learned #1: Data-ink ratio “Above all else show the data” (Edward Tufte) http://www.infovis-wiki.net/index.php/Data-Ink_Ratio Eurostat To be learned #2: Visual information-seeking mantra “Overview first, zoom and filter, then details-on-demand” (Ben Shneiderman) 1. Overview 2. Zoom 3. Filter 4. Details-on-demand 5. Relate 6. History 7. Extracts Eurostat To be learned #3: Storytelling, or the weaving of a narrative “Don't just get to the point – start with it” (David Marder) Gapminder «We all have important stories to (Hans Rosling) tell. Some stories involve numbers» «Data are not intrinsically boring. Neither are intrinsically interesting.» (S. Few) NoiItalia (Istat) «Behind the scene stands the Young people and storyteller, but behind the storyteller the jobs crisis in stands a community of memory» numbers (H. Arendt) (Oecd) Eurostat Why we need to visualize information? «By visualizing information, we turn it Not to get lost in into a landscape that you can explore with the data your eyes, a sort of information map. And when you’re lost in information, an information map is kind of useful.» (D. McCandless) http://www.brainyquote.com/quotes/quotes/d/davidmccan630550.html Visualization of [Big] Data gives you a way So… we can quote «to find things that you had no theory Baudelaire: about and no statistical models to identify, «Le beau est toujours but with visualization it jumps right out at bizarre» you and says, ‘This is bizarre’ » (B.
    [Show full text]