Data Exploration and Analysis Guide

Oracle® Big Data Discovery Data Exploration and Analysis Guide Version 1.0.0 • Revision A • March 2015 Copyright and disclaimer Copyright © 2003, 2015, Oracle and/or its affiliates. All rights reserved. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. UNIX is a registered trademark of The Open Group. This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT END USERS: Oracle programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, delivered to U.S. Government end users are "commercial computer software" pursuant to the applicable Federal Acquisition Regulation and agency- specific supplemental regulations. As such, use, duplication, disclosure, modification, and adaptation of the programs, including any operating system, integrated software, any programs installed on the hardware, and/or documentation, shall be subject to license terms and license restrictions applicable to the programs. No other rights are granted to the U.S. Government. This software or hardware is developed for general use in a variety of information management applications. It is not developed or intended for use in any inherently dangerous applications, including applications that may create a risk of personal injury. If you use this software or hardware in dangerous applications, then you shall be responsible to take all appropriate fail-safe, backup, redundancy, and other measures to ensure its safe use. Oracle Corporation and its affiliates disclaim any liability for any damages caused by use of this software or hardware in dangerous applications. This software or hardware and documentation may provide access to or information on content, products and services from third parties. Oracle Corporation and its affiliates are not responsible for and expressly disclaim all warranties of any kind with respect to third-party content, products, and services. Oracle Corporation and its affiliates will not be responsible for any loss, costs, or damages incurred due to your access to or use of third-party content, products, or services. Oracle® Big Data Discovery: Data Exploration and Analysis Guide Version 1.0.0 • Revision A • March 2015 Table of Contents Copyright and disclaimer ..........................................................2 Preface.........................................................................13 About this guide ...............................................................13 Who should use this guide ........................................................13 Conventions used in this document .................................................13 Contacting Oracle Customer Support ................................................14 Part I: Introduction to Big Data Discovery Chapter 1: Overview .............................................................16 Value of Using Big Data Discovery ..................................................16 Your tasks within Big Data Discovery ................................................17 About the Workflow in Big Data Discovery ............................................18 Find ....................................................................19 Profile and Enrich Data ......................................................21 Explore ..................................................................21 Transform ................................................................22 Discover: Analyze and Obtain Results ...........................................23 Filter, Use Guided Navigation, and Search ........................................24 Understand the Data Flow ....................................................24 Part II: Logging in and Managing Your Account Chapter 2: Getting Access to Big Data Discovery.....................................27 Options for access to Big Data Discovery .............................................27 Logging in to Big Data Discovery ...................................................27 Chapter 3: Managing Your Big Data Discovery Account ...............................29 Changing your Big Data Discovery password ..........................................30 Selecting the locale to use for Big Data Discovery.......................................31 About locales .............................................................31 Configuring the locale for your user account .......................................32 Selecting a locale from the user menu ...........................................32 Chapter 4: Using Big Data Discovery on an iPad .....................................34 Part III: Searching, Filtering, and Navigating in Big Data Discovery Chapter 5: Using Refinements for Filtering ..........................................36 About using refinements for filtering .................................................36 Oracle® Big Data Discovery: Data Exploration and Analysis Guide Version 1.0.0 • Revision A • March 2015 Table of Contents 4 Using a value list to select refinements ...............................................37 Using a range filter to select refinements .............................................41 Using date/time values for refinement................................................44 About implicit refinements ........................................................46 Chapter 6: Using the Big Data Discovery Search Feature ..............................48 About the Big Data Discovery search feature ..........................................48 Displaying and using the results of a search ...........................................48 Notes about keyword searches ....................................................56 Chapter 7: Managing Selected Refinements .........................................58 About the Selected Refinements panel ...............................................58 Removing refinements from the Selected Refinements panel...............................60 Part IV: Finding and Browsing Available Data Sets and Projects Chapter 8: About the Catalog ......................................................62 Chapter 9: Filtering and Searching Data Sets and Projects.............................63 Chapter 10: Browsing Information About Data Sets or Projects.........................65 Chapter 11: About Tags ..........................................................67 Adding and removing tags for catalog items ...........................................67 Part V: Creating, Exploring, and Deleting Data Sets Chapter 12: Creating a New Data Set ...............................................69 Loading data from a file ..........................................................69 Chapter 13: Exploring a Data Set...................................................71 About exploring a data set .......................................................71 Selecting the data to explore ......................................................71 Switching Explore between tile view and table view......................................72 Filtering and sorting the attributes in Explore...........................................72 Working with an individual attribute in Explore..........................................74 Using the scratchpad............................................................75 Chapter 14: Deleting Data Sets ....................................................80 Deleting a data set .............................................................80 Part VI: Creating and Configuring Projects Chapter 15: Creating a project .....................................................82 About projects .................................................................82 Creating a project using the Catalog.................................................82 Creating a project from within a data set .............................................83 Oracle® Big Data Discovery: Data Exploration and Analysis Guide Version 1.0.0 • Revision A • March 2015 Table of Contents 5 Chapter 16: Managing Project Pages ...............................................84 Adding a page to a project ........................................................84 Renaming a page ..............................................................84 Changing the layout of a page .....................................................85 Changing the data set binding for a page .............................................85 Deleting a page................................................................86 Chapter 17: Adding and Configuring Components ....................................87 Adding and deleting components on a page ...........................................87 Renaming components ..........................................................88

Data Exploration and Analysis Guide

Improving Data Preparation for Business Analytics Applying Technologies and Methods for Establishing Trusted Data Assets for More Productive Users

Automating Exploratory Data Analysis Via Machine Learning: an Overview

The Landscape of R Packages for Automated Exploratory Data Analysis by Mateusz Staniak and Przemysław Biecek

Lecture #1: Exploratory Data Analysis CS 109A, STAT 121A, AC 209A: Data Science

Database Benchmarking for Supporting Real-Time Interactive

An Interactive Visualization Environment for Data Exploration

Exploring the Data Wilderness Through Examples

Astronomy in the Big Data Era Search

Big Data Visualization Tools ?

Vizcertify: a Framework for Secure Visual Data Exploration

The New Analytics Lifecycle Accelerating Insight to Drive Innovation

Data Exploration and Visualization