Ggobi : Interactive and Dynamic Data Visualization System

Ggobi : Interactive and Dynamic Data Visualization System

GGobi : Interactive and dynamic data visualization system Bioinformatics and Biostatistics Lab., Seoul National Univ. Seoul, Korea Eun-Kyung Lee 1 Outline z interactive and dynamic graphics z Exploratory data analysis and Data mining z What is GGobi? z Main features of GGobi z Demo with a couple of examples 2 Interactive vs. Dynamic Graphics z Interactive graphics - a user can actively manipulate the visual graphics by input devices and make changes based on the visual result. z Dynamic graphics - the visible graphics change on the computer screen without further user interaction --> Interactive and dynamic graphics 3 Interactive and dynamic graphical methods z Focusing - zooming, slicing, rescaling, reformatting z Arranging - rotation, grand tour, guided tour, manual tour z Linking - Linked brushing and identification 4 EDA vs. DM z Exploratory Data Analysis (EDA) - numerical or graphical detective work - a continuation of Tukey's idea to use graphics to find structure, general concepts, unexpected behavior, etc. in data sets by looking at the data. 5 EDA vs. DM z Data Mining (DM) - Data mining is exploratory data analysis with little or no human interaction using computationally feasible techniques - Wegman z Visual Data Mining (VDM) = DM + Statistical graphics 6 What is GGobi? z A direct descendent of XGobi z A data visualization system with interactive and dynamic methods for the manipulation of views of data. z provide various plots with multiple plotting windows system z use XML file format for data z can be easily extended, either by being embedded in other SW or by the addition of plugins z able to use in R (rggobi) 7 GGobi’smain features 1. Appearance z Use GTK+ z single session can support multiple plots z single process can support multiple independent session z support several types of plots z scatter plot, parallel coordinate plot, scatter plot matrix, time series plot, barchart z include interactive tools to specify and tune color maps z able to add variables on the fly z panning and zooming 8 GGobi’s Main features 2. Portability z runs under various platforms, like Linux, Windows or Mac. 3. Data format z XGobi : use several files (.dat, .col, .row, .glyphs, .colors, etc) z use XML z allow complex characteristics and relationships in data to be specified z multiple dataset can be entered in a single XML file and specifications can be included for linking them 9 GGobi’s Main features 4. Embedding in other SW z GGobi can be treated as a C library and directly embedded in other SW, then controlled using an application program interface (API) z This allows GGobi functionality to be integrated into one’s own stand-alone application and provide as an add-on to existing language and scripting environments 5. Extending with plugins z The plugin mechanism allows to provide add-on extensions to GGobi that are not part of the core design z data viewer, ggvis (MDS), Variogram Cloud, Save Display Description 10 GGobi : File z open z XML from files z XML from URL z CSV z New z open new session z Save z as XML : keep all the information including color, glyph, etc. z as CSV : keep only numeric data values 11 GGobi : Display z open new plot window z New Scatterplot Display z New Scatterplot Matrix z New Parallel coordinates Display z New Time Series z New Barchart 12 GGobi : View z 1D plot z XY plot : 2D plot z 1D tour : project data into 1D space z Rotation : use three variables z 2D tour : project data into 2D space z 2x1D tour : use 2 different 1D tour 13 GGobi : interaction z Scale z Brush z Identify z scale z EditEdges : add edges or add points in Display z MovePoints : move points in Display 14 GGobi : Tools z Variable Manipulation ; Variable Transformation z Sphering (PCA) z Variable jittering : prevent point mass viewing z Color Schemes; Automatic Brushing z Color & Glyph groups; Case Subsetting & Sampling z Missing Values z plugins z Data Viewer, ggvis(MDS), Variogram Cloud, Save Display Description 15 RGGobi z able to use GGobi in R z Link, including programming customized GUIs containing linked GGobi plots, writing new linking rules in S, and responding to GGobi events, create GGobi plugins written in R. 16 Example 1 : Restaurant Tipping z In early 1990, one waiter recorded information about each tip he received over a period of a few months working in one restaurant. He collected several variables; (n=244) z TOTBILL : total bill in dollars z TIP : tip in dollars z SEX : gender of the bill payer; male(0), female(1) z SMOKER : whether the party included smokers or not No(0), Yes(1) z DAY : days of week ; Thu(3), Fri(4), Sat(5), Sun(6) z TIME : lunch(0), dinner(1) z SIZE : size of the party 17 Example 2 : Italian Olive Oils z This data consists of the percentage composition of 8 fatty acids found in the lipid fraction of 572 Italian olive oils (n=572) z Region : South(1),North(2) or Sardinia(3) z Area : North-Apulia(1), Calabria(2), South Apulia(3), Sicily(4), Inland Sardinia(5), Costal Sardinia(6), East Liguria(7), West Liguria(8), and Umbria(9) z Palmitic z Palmitoleic z Stearic z Oleic z Linoleic z Linolenic z Arachidic z Eicosenoic 18 Italy North Italy 8 7 9 1 South 5 3 Italy Sardinia 6 2 4 19 Example 3 : Leukemia data n = 72 : # of observation p = 3571 ➝ p=40 : # of genes z acute myeloid leukemia(AML) : 25 cases z acute lymphoblastic leukemia(ALL) : 47 cases ◉ B-cell ALL : 38 cases ◉ T-cell ALL : 9 cases 20 exploRase z Visual data mining tools for microarray data and metabolic networks z Visual data analysis interface for microarray data and metabolic networks z Based on R and GGobi z Provide EDA tool using direct manipulation z analyze the connections between microarray data and metabolic pathway visually and interactively z combine statistical analysis results with interactive plots to improve the analysis. 21 exploRase 22 Discussion z Full marriage between GGobi’sdirect manipulation graphical environment and R’s familiar extensible environment for statistical data analysis Î powerful tool for visual data mining * all references and documents are on the web http://www.ggobi.org 23.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    23 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us