99-008.Pdf (1.908Mb Application/Pdf)
Total Page:16
File Type:pdf, Size:1020Kb
A Framework for Information Visualization Spreadsheets Technical Report Department of Computer Science and Engineering University of Minnesota 4-192 EECS Building 200 Union Street SE Minneapolis, MN 55455-0159 USA TR 99-008 A Framework for Information Visualization Spreadsheets Ed Huai-hsin Chi March 09, 1999 UNIVERSITY OF MINNESOTA This is to certify that I have examined this copy of a doctoral thesis by Ed Huai-hsin Chi and have found that it is complete and satisfactory in all respects, and that any and all revisions required by the final examining committee have been made. John T. Riedl Name of Faculty Advisor Signature of Faculty Advisor March 8, 1999 Date GRADUATE SCHOOL A Framework for Information Visualization Spreadsheets ATHESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Ed Huai-hsin Chi IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OFPHILOSOPHY March 1999 c Copyright Ed Huai-hsin Chi 1999 Declaration. The ideas described in this dissertation is patent–pending in a U.S. patent application entitled “Visualization Spreadsheet” (University of Minnesota Docket Number 99009, Merchant & Gould Law Firm Reference Number 600.373-US-01). Some portion of this thesis was previous published in IEEE conference proceedings and magazines. As per IEEE Copyright Policy (http://www.computer.org/forminfo.htm), I retained the right to reuse any portion of these copyrighted works, without fee, and in any future works. However, I am required to give the following notice: Some portion of this work is c 1997, 1998 IEEE, reprinted from Proceedings of the Symposium on Information Visualization ’97, July 1998 issue of IEEE Computer Graphics and Applications, and Pro- ceedings of the Symposium on Information Visualization ’98 by permission. i ACKNOWLEDGMENTS I would like to specially thank my advisor, John T. Riedl, for his mentorship, support, advice, guidance, encouragement, and above all, his friendship. Choosing him as my advi- sor was one of the best choices I have made in my life. In the last several years, I have also been fortunate to have received great mentor- ship from Dr. Stuart K. Card at Xerox Palo Alto Research Center (Xerox PARC), and I thank him for his belief in my abilities. I also thank my co–workers James Pitkow, Rich Gossweiler, Peter Pirolli, and Jock Mackinlay at Xerox PARC for their contribution to my education and the Web visualization analysis work. I thank Professors John Carlis, Joseph Konstan, and Phillip Barry of the Computer Science Department, and Libby Shoop and Ernest Retzel at the Computational Biology Centers, who have all contributed interesting ideas and discussions. I thank Dr. Gordon Legge of the Psychology Department, who introduced me to visual perceptual psychology. I thank my family for their continual support. I would not have had the strength to complete this work without the love and support of my big sister, Susan, and my friends, Chad, Brook, Ahna, and Kate. Completing a Ph. D. is like running a marathon. The ups and the downs of its various stages is impossible to endure without support from family and friends. I am forever grateful of their belief in me. ii DEDICATION To my family: my Mother, Sophia Hsiu-chieh Chen, my Father, Charles Chiu-lang Chi, my Sister, Su-shan Susan Chi, my Brother, Tim Ssu-yuan Chi, To my friends: my Soul Brothers, Chad Potter Juncker, and Brook Winston Kabanuk, my two Caring Friends, Ahnnie and Katie. Journey 1992–1998 You only see me through distorted lens. The purple iris is no longer purple. Through your torn lens, a distortion is all that we have. Around the corner I wept, and straight ahead I carved. Carved a path as I kept my tired soul on the road. As one phase blends into another, I find comfort in the silence of my heartbeat. Holding her in my arms, she rests. Unfolding souls in charm, I rest. iii Abstract Information has become interactive. Information visualization is the design and cre- ation of interactive graphic depictions of information by combining principles in the dis- ciplines of graphic design, cognitive science, and interactive computer graphics. As the volume and complexity of the data increases, users require more powerful visualization tools that allow them to more effectively explore large abstract datasets. This thesis seeks to make information visualization more accessible to potential users by creating a “Visualization Spreadsheet”, where each cell can contain an entire set of data represented using interactive graphics. Just as a numeric spreadsheet enables exploration of numbers, a visualization spreadsheet enables exploration of visual forms of informa- tion. Unlike numeric spreadsheets, which store only simple data elements and formulas in each cell, a cell in the Visualization Spreadsheet can hold an entire abstract data set, selection criteria, viewing specifications, and other information needed for a full-fledged information visualization. Similarly, intra-cell and inter-cell operations are far more com- plex, stretching beyond simple arithmetic and string operations to encompass a range of domain-specific operators. The complexity of operations and interactions requires a visualization framework that is easily understandable to both end-users and visualization designers. This thesis devel- ops and discusses the general utility of a novel visualization framework, and validates the framework by applying it to various visualization techniques and showing several systems that illustrate some of these research issues. We show that the spreadsheet approach facili- tates certain visual user tasks that are more difficult using other approaches. The underlying approach in our work allows domain experts to define new data types and data operations, and enables visualization experts to incorporate new visualizations, viewing parameters, and view operations. iv Keywords: Information Visualization, Visualization, Spreadsheet, Interactive Graphics, Visualization Systems, User Interface, User Interactions, Visualization Framework, Visu- alization Model, Operators, View/value, Data State Model, Date Flow Model, Visualiza- tion Design, Application Extensibility, Sensemaking, Biological Sequence Analysis, Al- gorithm Visualization, World-Wide Web, Content Usage Structure Analysis, Information Ecologies. v TABLE OF CONTENTS List of Figures ix List of Tables xi Chapter 1: Introduction 1 1.1 Motivation................................... 1 1.2 VisualizationSpreadsheetDefined ...................... 4 1.3 CognitiveAdvantagesoftheVisualizationSpreadsheet ........... 5 1.3.1 Visual Information Processing .................... 5 1.3.2 Sensemaking, Information Ecology and Foraging Theory . 6 1.3.3 Levels of Knowledge ..... .................... 7 1.3.4 Advantages of Visual Sensemaking in the Visualization Spreadsheet 8 1.4 ResearchVision................................ 9 1.5 ResearchApproach.............................. 11 1.6 Overview ................................... 13 Chapter 2: Related Work 15 2.1 ReferenceModelforInformationVisualization............... 15 2.2 Spreadsheet.................................. 18 2.3 Summary ................................... 23 Chapter 3: Illustrated Principles, Scenarios and Detailed Case Study 25 3.1 Original Data Domain: Genetic Sequence Similarity ... ......... 26 vi 3.2 OtherDataDomains ............................. 31 3.2.1 Time-series Matrices ..... .................... 32 3.2.2 Algorithm Visualization ... .................... 33 3.3 IllustratedPrinciples ............................. 33 3.3.1 Derive Comparison Data Sets .................... 34 3.3.2 Apply Operators in Parallel . .................... 40 3.3.3 Extract Multiple Visual Features Simultaneously . ......... 44 3.3.4 Create Analysis Templates .. .................... 46 3.3.5 Update Automatically via Dependency Links ... ......... 48 3.3.6 Mapping Value to Structure using Custom Layouts ......... 50 3.3.7 Use Both Direct Manipulation and Command Languages . 51 3.4 Detailed Case Study: Web Analysis Visualization Spreadsheet . 54 3.4.1 Visualization of Web Space . .................... 54 3.4.2 Real-World Analysis Scenarios ... ................ 55 3.5 Summary ................................... 63 Chapter 4: Information Visualization Operator Framework 67 4.1 TheNeedforanOperatorFramework .................... 68 4.1.1 Problems from End-Users’ Perspective ...... ......... 69 4.1.2 Problems from Designers’ Perspective ................ 70 4.1.3 Operator Framework Helps Users and Designers . ......... 72 4.2 Fundamental Properties of Operators . .................... 73 4.2.1 View versus Value ...... .................... 74 4.2.2 Operational versus Functional Similarity ..... ......... 74 4.3 ADataStateModelforVisualizationOperators............... 76 4.3.1 Visualization Pipeline .... .................... 76 4.3.2 Data State Model ........................... 78 vii 4.3.3 Example: Web Analysis in the Visualization Framework . 81 4.4 ClassificationofOperatorsusingtheFramework............... 85 4.4.1 Example: Web Analysis Visualization Operators . ......... 88 4.5 PropertiesoftheFramework ......................... 88 4.5.1 View versus Value ...... .................... 90 4.5.2 Applicability of Operators .. .................... 92 4.5.3 Operator-Centric Approach . .................... 93 4.5.4 Direct Manipulation ..... .................... 94 4.5.5 Implementation Choices ... .................... 95 4.6 Discussion................................... 96 4.6.1 Three Classes of Users of this Framework ..... ........