Information Visualization and Visual Analytics

Total Page:16

File Type:pdf, Size:1020Kb

Information Visualization and Visual Analytics Information Visualization and Visual Analytics Pekka Wartiainen University of Jyv¨askyl¨a pekka.wartiainen@jyu.fi 23.4.2014 Outline Objectives Introduction Visual Analytics Information Visualization Our research Summary Learning objectives I To understand the definition of visual analytics. I To be aware with visual analytics approach in problem solving. I To understand the basics of data visualization. Motivation I Raw data has no value in itself, only the extracted information has value I Time and money are wasted and opportunities are lost I Success depends on availability of the right information I Visual analytics aims at making data and information processing transparent I Visual analytics combines the strengths of humans and computers An historical perspective on visual analytics I Early visual analytics: exploratory data analysis I Visual data exploration and visual data mining I First book of visual analytics: Illuminating the Path, 2005 I Some earlier systems exhibited the characteristics of visual analytics I CoCo system for improving silicon chips, 1990 Past few years I VisMaster is an European Coordination Action Project I Web-page: I URL: Visual-Analytics.EU I Book: I URL: Mastering the information age - solving problems with visual analytics I YouTube video: I URL: Inria - Vismaster, visual analytics Visual analytics Definition Visual analytics combines automated analysis techniques with interactive visualisations for an effective understanding, reasoning and decision making on the basis of very large and complex datasets. Timeline Application of visual analytics I First application area was security I Many major application areas I physics, astronomy, medicine, climate, . Example: business intelligence I Financial market generates large amounts of data on a daily basis {> extremely high data volumes over the years I More than 300 million VISA credit card transactions per day I Multiple perspectives and assumptions for analysis I history, current situation, monitoring, forecasting, recurring situations Visual analytics { Coordinated Graph Visualization Visual support for the simulation of climate models provided by CGV, a highly interactive graph visualization system. Visual analytics { NFlowVis Analysis of a distributed network attack. The visual analytics process Process model of visual analytics. Building blocks of visual analytics research Visual analytics integrates science and technology from many disciplines. Evaluation I Evaluation include techniques, methods, modes and theories as well as software tools I Challenge: often processing data from the real world I Evaluation involves users, tasks and data I Especially in the industry, the domain expert has the best knowledge {> Empirical evaluation I Evaluation criteria, e.g.: I effectiveness I efficiency I user satisfaction I Importance of documentation is emphasized Infrastructure I Visual analytics is both user-driven and data-driven I Current challenges: lack of interaction and dynamic data I Limitations of traditional data bases I Old fashioned `architectural reference model' I Big data solutions I Need for: I Fast imprecise answers with progressive refinement I Incremental re-computation, either in the data (e.g., some data has been changed) or in the analysis parameters I Steering the computation towards data regions that are of higher interest to the user. Data management { Why? I The big opportunity of the Information Age I Many obstacles need to be overcome I Heterogeneity of data sources I Different data types I Data streams I Working under pressure I Time consuming activities I Data management ensures data consistency and standards Data management { VA aspects I Data and semantic integration I Utilizing known processing methods I Data Warehousing, OLAP and Data Mining I Data reduction and abstraction I Data quality is crucial (cf. GIGO model) I Visual techniques for exploring data Space and time I In large systems, space and time are essential {> complexity increases I Space and time are more than just numbers I Specific properties: I Dependencies between observations I Uncertainty I Scale I Time I Spatial approaches: Cartography, GIS, Geovisualization I Representation of time: visualization of time-related data and time itself I Interactive visualizations I Big data cases { dimension reduction Space and time { OECD eXplorer Allows to explore regional statistics data from OECD URL:Organisation for Economic Cooperation and Development Data mining I Humans are required in the data analysis process I New tools and methodologies are necessary to help experts extract relevant information I Limitations in KDD process and visualizations I Combination of multidisciplinary approaches I Pattern identification methods I Spatio-temporal data mining I Many software have been developed Perception and cognitive aspects { visualization I The human is at the heart of visual analytics human interaction, analysis, intuition, problem solving and visual perception. I Distinction between high and low-level vision I Humans do not have to remember everything but extract visual clues from the environment Pre-attentive processing makes items pop out the display automatically. Data visualization I Fast and understandable way to present data to a user I Data mining methods as pre-processing tools I Many visualization methods existing I JFreeChart I Google Charts I Remember how not to use visualization techniques I Dynamic behavior of the data sets special requirements I Data visualization is part of information visualization GUI design I Visual analytics has high demand for GUI I Scalable and interactive interface I General guidelines for different purposes I Windows, OS X, Android, . I Online solutions I Define target group before designing the GUI I Multidisciplinary research groups I Personalized user roles Common interaction I select : mark data items of interest, possible followed by another operation, I explore : show some other data e.g., panning, zoom, resampling, I reconfigure : rearrange the data spatially e.g., sort, change attribute assigned to axis, rotate (3D), slide, I encode : change visual appearance e.g., change type of representation (view), adjust colour/size/shape, I abstract/elaborate : show more or less detail e.g., details on demand, tooltips, geometric zoom, I filter : select or show data matching certain conditions, I connect : highlight related data items e.g., brushing (selection shown in multiple views). Using colors I Powerful element in visualization I Wrong usage of colors is disturbing I Color Usage Research Lab I NASA Ames research center I Ready made color palettes are solid alternatives Visual analytics in energy production I Application area: BFB boiler burning biomass I Co-operation with VTT, department of chemistry, and private companies I Funded by Regional Council of Central Finland I Time-series data measured from the different parts of the process I Context-sensitive framework approach I Matlab routines with Java GUI People included into process The human context of visual analytics. Summary I Visual analytics for multidisciplinary research problems I Visualization, data analysis, user interaction I Highly interactive interfaces I The whole process should be taken into account I Many challenges still existing, especially with big and dynamic data I Humans are part of the process References D. Keim, J. Kohlhammer, G. Ellis ja F. Mansmann, Mastering the Information Age: Solving Problems with Visual Analytics, Eurographics Association, Germany, 2010. P. J¨arvinen,K. Puolam¨aki,P. Siltanen ja M. Yliker¨al¨a,Visual Analytics, Technical report, VTT, Finland, 2009. P. Wartiainen, T. K¨arkk¨ainen,A. Heimb¨urger,ja S. Ayr¨am¨o.¨ Context-sensitive approach to dynamic visual analytics of energy production processes. In 22th European-Japanese Conference on Information Modelling and Knowledge Bases. MATFYZPRESS - Univerzity Karlovy, 2012..
Recommended publications
  • Enhancing Usability Evaluation of Web-Based Geographic Information
    Enhancing Usability Evaluation of Web-Based Geographic Information Systems (WebGIS) with Visual Analytics René Unrau Institute for Geoinformatics, University of Münster, Germany Christian Kray Institute for Geoinformatics, University of Münster, Germany Abstract Many websites nowadays incorporate geospatial data that users interact with, for example, to filter search results or compare alternatives. These web-based geographic information systems (WebGIS) pose new challenges for usability evaluations as both the interaction with classic interface elements and with map-based visualizations have to be analyzed to understand user behavior. This paper proposes a new scalable approach that applies visual analytics to logged interaction data with WebGIS, which facilitates the interactive exploration and analysis of user behavior. In order to evaluate our approach, we implemented it as a toolkit that can be easily integrated into existing WebGIS. We then deployed the toolkit in a user study (N=60) with a realistic WebGIS and analyzed users’ interaction in a second study with usability experts (N=7). Our results indicate that the proposed approach is practically feasible, easy to integrate into existing systems, and facilitates insights into the usability of WebGIS. 2012 ACM Subject Classification Human-centered computing → User studies; Human-centered computing → Usability testing Keywords and phrases map interaction, usability evaluation, visual analytics Digital Object Identifier 10.4230/LIPIcs.GIScience.2021.I.15 Supplementary Material The source code is publicly available at https://github.com/ReneU/ session-viewer and includes configuration instructions for use in other scenarios. 1 Introduction Geospatial data has become a critical backbone of many web services available today, such as search engines, online booking sites, or open data portals [29].
    [Show full text]
  • Catalogue Description
    INF 454: Data Visualization and User Interface Design Spring 2016 Syllabus Day/Times: TBD (4 Units) Location: TBD Instructor: Dr. Luciano Nocera Email: [email protected] Phone: (213) 740-9819 Office: PHE 412 Course TA: TBD Email: TBD Office Hours: TBD IT Support: TBD Email: TBD Office Hours: TBD Instructor’s Office Hours: TBD; other hours by appointment only. Students are advised to make appointments ahead of time in any event and be specific with the subject matter to be discussed. Students should also be prepared for their appointment by bringing all applicable materials and information. Catalogue Description One of the cornerstones of analytics is presenting the data to customers in a usable fashion. When considering the design of systems that will perform data analytic functions, both the interface for the user and the graphical depictions of data are of utmost importance, as it allows for more efficient and effective processing, leading to faster and more accurate results. To foster the best tools possible, it is important for designers to understand the principles of user interfaces and data visualization as the tools they build are used by many people - with technical and non-technical background - to perform their work. In this course, students will apply the fundamentals and techniques in a semester-long group project where they design, build and test a responsive application that runs on mobile devices and desktops and that includes graphical depictions of data for communication, analysis, and decision support. Short description: Foundational course focusing on the design, creation, understanding, application, and evaluation of data visualization and user interface design for communicating, interacting and exploring data.
    [Show full text]
  • Geotime As an Adjunct Analysis Tool for Social Media Threat Analysis and Investigations for the Boston Police Department Offeror: Uncharted Software Inc
    GeoTime as an Adjunct Analysis Tool for Social Media Threat Analysis and Investigations for the Boston Police Department Offeror: Uncharted Software Inc. 2 Berkeley St, Suite 600 Toronto ON M5A 4J5 Canada Business Type: Canadian Small Business Jurisdiction: Federally incorporated in Canada Date of Incorporation: October 8, 2001 Federal Tax Identification Number: 98-0691013 ATTN: Jenny Prosser, Contract Manager, [email protected] Subject: Acquiring Technology and Services of Social Media Threats for the Boston Police Department Uncharted Software Inc. (formerly Oculus Info Inc.) respectfully submits the following response to the Technology and Services of Social Media Threats RFP. Uncharted accepts all conditions and requirements contained in the RFP. Uncharted designs, develops and deploys innovative visual analytics systems and products for analysis and decision-making in complex information environments. Please direct any questions about this response to our point of contact for this response, Adeel Khamisa at 416-203-3003 x250 or [email protected]. Sincerely, Adeel Khamisa Law Enforcement Industry Manager, GeoTime® Uncharted Software Inc. [email protected] 416-203-3003 x250 416-708-6677 Company Proprietary Notice: This proposal includes data that shall not be disclosed outside the Government and shall not be duplicated, used, or disclosed – in whole or in part – for any purpose other than to evaluate this proposal. If, however, a contract is awarded to this offeror as a result of – or in connection with – the submission of this data, the Government shall have the right to duplicate, use, or disclose the data to the extent provided in the resulting contract. GeoTime as an Adjunct Analysis Tool for Social Media Threat Analysis and Investigations 1.
    [Show full text]
  • Volume Rendering
    Volume Rendering 1.1. Introduction Rapid advances in hardware have been transforming revolutionary approaches in computer graphics into reality. One typical example is the raster graphics that took place in the seventies, when hardware innovations enabled the transition from vector graphics to raster graphics. Another example which has a similar potential is currently shaping up in the field of volume graphics. This trend is rooted in the extensive research and development effort in scientific visualization in general and in volume visualization in particular. Visualization is the usage of computer-supported, interactive, visual representations of data to amplify cognition. Scientific visualization is the visualization of physically based data. Volume visualization is a method of extracting meaningful information from volumetric datasets through the use of interactive graphics and imaging, and is concerned with the representation, manipulation, and rendering of volumetric datasets. Its objective is to provide mechanisms for peering inside volumetric datasets and to enhance the visual understanding. Traditional 3D graphics is based on surface representation. Most common form is polygon-based surfaces for which affordable special-purpose rendering hardware have been developed in the recent years. Volume graphics has the potential to greatly advance the field of 3D graphics by offering a comprehensive alternative to conventional surface representation methods. The object of this thesis is to examine the existing methods for volume visualization and to find a way of efficiently rendering scientific data with commercially available hardware, like PC’s, without requiring dedicated systems. 1.2. Volume Rendering Our display screens are composed of a two-dimensional array of pixels each representing a unit area.
    [Show full text]
  • Introduction to Geospatial Data Visualization
    Introduction to Geospatial Data Visualization Lecturers: Viktor Lagutov, Katalin Szende, Joszef Laszlovsky. Ruben Mnatsakanian Duration: Fall term (September – December) Credits: 2 Course level: PhD / MA Maximum number of students: 15 Pre-requisites: none Software: GoogleEarthPro, qGIS, online mapping tools (e.g. GoogleMaps, ArcGISonline) Rapidly growing cross-disciplinary recognition and availability made Geospatial Methods in general, and Mapping, in particular, a popular approach in many research areas. Till recently, maps development had been a prerogative of cartographers and, later, experts in specialized mapping packages. Latest advances in hardware and software have opened this area to researchers in other disciplines and allowed them to enhance traditional research methods. The wide spectrum of such technologies and approaches is often referred as Geographic Information Systems (GIS) and includes, among others, mapping packages, geospatial analysis, crowdsourcing with mobile technologies, drones, online interactive data publishing. The geospatial literacy is becoming not an optional advantage for researchers and policy officers, but a basic requirement for many employers. The aim of the course is to develop basic understanding of spatially referenced data use and to explore potential applications of GIS in various research areas. The sessions provide both theoretical understanding and practical use of geospatial data and technologies for mapping societal and environmental phenomena. Students will learn basic features of GIS packages and the ways to utilize them for own research. The course is focused on practical skills in geospatial data visualization (mapping) and consists of • Theoretical sessions on principles of geospatial data visualization, cartography and GIS basics; • Practicals on learning GIS methods and getting mapping skills using free open source packages; • Supervised and independent students’ work on individual course projects.
    [Show full text]
  • Tree-Map: a Visualization Tool for Large Data
    TREE-MAP: A VISUALIZATION TOOL FOR LARGE DATA Mahipal Jadeja Kesha Shah DA-IICT DA-IICT Gandhinagar,Gujarat Gandhinagar,Gujarat India India Tel:+91-9173535506 Tel:+91-7405217629 [email protected] [email protected] ABSTRACT 1. INTRODUCTION Traditional approach to represent hierarchical data is to use Tree-Maps are used to present hierarchical information on directed tree. But it is impractical to display large (in terms 2-D[1] (or 3-D [2]) displays. Tree-maps offer many features: of size as well complexity) trees in limited amount of space. based upon attribute values users can specify various cate- In order to render large trees consisting of millions of nodes gories, users can visualize as well as manipulate categorized efficiently, the Tree-Map algorithm was developed. Even file information and saving of more than one hierarchy is also system of UNIX can be represented using Tree-Map. Defi- supported [3]. nition of Tree-Maps is recursive: allocate one box for par- Various tiling algorithms are known for tree-maps namely: ent node and children of node are drawn as boxes within Binary tree, mixed treemaps, ordered, slice and dice, squar- it. Practically, it is possible to render any tree within prede- ified and strip. Transition from traditional representation fined space using this technique. It has applications in many methods to Tree-Maps are shown below. In figure 1 given fields including bio-informatics, visualization of stock port- hierarchical data and equivalent tree representation of given folio etc. This paper supports Tree-Map method for data data are shown. One can consider nodes as sets, children integration aspect of knowledge graph.
    [Show full text]
  • Connected 2D and 3D Visualizations for the Interactive Exploration of Spatial Information
    CONNECTED 2D AND 3D VISUALIZATIONS FOR THE INTERACTIVE EXPLORATION OF SPATIAL INFORMATION S. Bleisch *, S. Nebiker FHNW, University of Applied Sciences Northwestern Switzerland, Institute of Geomatics Engineering, CH-4132 Muttenz, Switzerland - (susanne.bleisch, stephan.nebiker)@fhnw.ch KEY WORDS: Geovisualization, Three-dimensional representation, Interactive, Spatial Data Exploration, Virtual globe, Development ABSTRACT: This paper describes the concepts and the successful prototypal implementation of interactively connected 2D information visualizations and data displays in 3D virtual environments for the interactive exploration of spatial data and information. Virtual globes or earth viewers such as Google Earth have become very popular over the last few years. They are used for looking at holiday destinations but more importantly also for scientific visualizations. From a geovisualization point of view we might regard 3D data or information displays as yet another representation type that adds to the multitude of information visualization methods. Combining 3D views of data sets with traditional 2D displays offers the advantage of being able to use 3D if and when this type of representation is considered useful or effective for finding new insights into a data set. The traditional and newer displays of mainly 2D information visualization may be enhanced and new insights into the data may be generated by displays of the data in a 3D virtual environment. On the other hand, data in 3D displays might be better understood by simultaneously reading and querying connected 2D representations.The paper presents a prototypal implementation of the interactively connected visualizations of spatial information in 2D views and 3D virtual environments using the brushing technique.
    [Show full text]
  • Stories in Geotime
    Stories in GeoTime Ryan Eccles, Thomas Kapler, Robert Harper, William Wright Information Visualization ( 2008 ) © Stefan John 2008 Summer term 2008 1 Introduction • Story a powerful abstraction − Used by intelligence analysts to conceptualize threats and understand patterns − Part of the analytical process • Oculus Info’s GeoTime™ − Geo-temporal event visualization tool − Augmented with story system − Narratives, hypertext-linked visualizations, visual annotations, and pattern detection − Environment for analytic exploration and communication − Detects geo-temporal patterns − Integrates story narration to increase analytic sense-making cohesion Summer term 2008 2 1 Introduction • Assisting the analyst in: − Identifying, − Extracting, − Arranging, and − Presenting stories within the data • Story system − Lets analysts operate at story level − Higher level abstractions of data (behaviors and events) − Staying connected to the evidence − Developed in collaboration with analysts • Formal evaluation showed high utility and usability Summer term 2008 3 Overview • Storytelling • Related Work • Geo-Temporal Visualization in GeoTime • Stories in GeoTime • Evaluation Summer term 2008 4 2 Storytelling • First described in Aristotle’s Poetics − Objects of a tragedy (story): - Plot -> arrangements of incidents -Character - Thought -> processes of reasoning leading characters to their respective behavior • Narrative theory suggests: − People are essentially storytellers − Implicit ability to evaluate a story for: - Consistency -Detail - Structure Summer
    [Show full text]
  • Dashboards for SAS® Visual Analytics Laura Oliver, Experis Solutions
    SESUG Paper 256-2019 Dashboards for SAS® Visual Analytics Laura Oliver, Experis Solutions ABSTRACT Visual Analytics is a great tool to use to visualize and analyze data and create dashboard for others to review data. The designer layout has multiple sections with multiple parts to each section. This paper will provide an introduction to the designer tool available for Visual Analytics. INTRODUCTION This paper is a companion to the Hands on Workshop (HOW), Dashboards for SAS® Visual Analytics. This paper will give an overall view of the report designer environment for Visual Analytics. It will show where data and objects are added as well as give an introduction to how objects can be customized to enhance the user experience. It will also touch on how to filter data at the data source and object level. VISUAL ANALYTICS DESIGNER LAYOUT The Visual Analytics development environment consists of 3 main sections. The section on the left has tabs for Objects, Data and Imports. The middle section is the canvas where the report is built. The section on the right provides options to customize objects that have been added to the report from the Objects tab. Each object has its own set of properties, styles, etc. Figure 1 - VA Development Environment 1 DATA TAB The first step in creating a report is adding a data source. This is done on the Data tab using the Select a data source drop down. The Add Data Source dialog box will open, allowing a data source to be selected. All data sources must have been previously uploaded to the LASR server to be available for use in a report.
    [Show full text]
  • From Surface Rendering to Volume
    What is Computer Graphics? • Computational process of generating images from models and/or datasets using computers • This is called rendering (computer graphics was traditionally considered as a rendering method) • A rendering algorithm converts a geometric model and/or dataset into a picture Department of Computer Science CSE564 Lectures STNY BRK Center for Visual Computing STATE UNIVERSITY OF NEW YORK What is Computer Graphics? This process is also called scan conversion or rasterization How does Visualization fit in here? Department of Computer Science CSE564 Lectures STNY BRK Center for Visual Computing STATE UNIVERSITY OF NEW YORK Computer Graphics • Computer graphics consists of : 1. Modeling (representations) 2. Rendering (display) 3. Interaction (user interfaces) 4. Animation (combination of 1-3) • Usually “computer graphics” refers to rendering Department of Computer Science CSE564 Lectures STNY BRK Center for Visual Computing STATE UNIVERSITY OF NEW YORK Computer Graphics Components Department of Computer Science CSE364 Lectures STNY BRK Center for Visual Computing STATE UNIVERSITY OF NEW YORK Surface Rendering • Surface representations are good and sufficient for objects that have homogeneous material distributions and/or are not translucent or transparent • Such representations are good only when object boundaries are important (in fact, only boundary geometric information is available) • Examples: furniture, mechanical objects, plant life • Applications: video games, virtual reality, computer- aided design Department of
    [Show full text]
  • SAS Visual Analytics Tricks We Learned From
    SESUG Paper RV-58-2017 SAS® Visual Analytics Tricks We Learned from Reading Hundreds of SAS® Community Posts Tricia Aanderud, Ryan Kumpfmiller; Zencos Consulting Rob Collum, SAS Institute ABSTRACT After you know the basics of SAS Visual Analytics, you realize there are some situations that require unique strategies. Sometimes tables are not structured right or become too large for the environment. Maybe creating the right custom calculation for a dashboard can be confusing. Geospatial data is hard to work with if you haven’t ever used it before. We looked through 100s of SAS Communities posts for the most common questions. These solutions (and a few extras) were extracted from the newly released Introduction to SAS Visual Analytics book. INTRODUCTION Our goal in writing the Introduction to SAS Visual Analytics book was to create a really practical and useful book for users. As part of the research for the book, we read hundreds of posts in the SAS Communities: SAS Visual Analytics section. This was a great exercise because we could confirm some of the sticking points that we know users have and pick up some tips to emphasize in the book as well. This paper combines our favorite tips from the book and some other ones that we think are worth sharing. Note: All examples were done using SAS Visual Analytics version 7.3 WHAT IS SAS COMMUNITIES? SAS Communities is an open forum on the SAS website, https://communities.sas.com, which connects SAS users all over the world. With over a hundred thousand members, users can post questions about challenges they are currently having working with SAS products or provide answers to other users who are looking for advice.
    [Show full text]
  • Infovis and Statistical Graphics: Different Goals, Different Looks1
    Infovis and Statistical Graphics: Different Goals, Different Looks1 Andrew Gelman2 and Antony Unwin3 20 Jan 2012 Abstract. The importance of graphical displays in statistical practice has been recognized sporadically in the statistical literature over the past century, with wider awareness following Tukey’s Exploratory Data Analysis (1977) and Tufte’s books in the succeeding decades. But statistical graphics still occupies an awkward in-between position: Within statistics, exploratory and graphical methods represent a minor subfield and are not well- integrated with larger themes of modeling and inference. Outside of statistics, infographics (also called information visualization or Infovis) is huge, but their purveyors and enthusiasts appear largely to be uninterested in statistical principles. We present here a set of goals for graphical displays discussed primarily from the statistical point of view and discuss some inherent contradictions in these goals that may be impeding communication between the fields of statistics and Infovis. One of our constructive suggestions, to Infovis practitioners and statisticians alike, is to try not to cram into a single graph what can be better displayed in two or more. We recognize that we offer only one perspective and intend this article to be a starting point for a wide-ranging discussion among graphics designers, statisticians, and users of statistical methods. The purpose of this article is not to criticize but to explore the different goals that lead researchers in different fields to value different aspects of data visualization. Recent decades have seen huge progress in statistical modeling and computing, with statisticians in friendly competition with researchers in applied fields such as psychometrics, econometrics, and more recently machine learning and “data science.” But the field of statistical graphics has suffered relative neglect.
    [Show full text]