IBM Cognos Analytics

Data Prep. and Modelling Working with your data Michael McGeein, CPA, CA

September 2018

IBM Business Analytics :: IBM Confidential :: © 2016 IBM Corporation Notices and disclaimers

Copyright © 2018 by International Business Machines Corporation (IBM). as illustrations of how those customers have used IBM products and No part of this document may be reproduced or transmitted in any form without the results they may have achieved. Actual performance, cost, savings or other written permission from IBM. results in other operating environments may vary.

U.S. Government Users Restricted Rights — use, duplication or disclosure References in this document to IBM products, programs, or services does not restricted by GSA ADP Schedule Contract with IBM. imply that IBM intends to make such products, programs or services available in all countries in which IBM operates or does business. Information in these presentations (including information relating to products that have not yet been announced by IBM) has been reviewed for accuracy as Workshops, sessions and associated materials may have been prepared by of the date of initial publication and could include unintentional technical or independent session speakers, and do not necessarily reflect the typographical errors. IBM shall have no responsibility to update this information. views of IBM. All materials and discussions are provided for informational This document is distributed “as is” without any warranty, either express or purposes only, and are neither intended to, nor shall constitute legal or other implied. In no event shall IBM be liable for any damage arising from the use of guidance or advice to any individual participant or their specific situation. this information, including but not limited to, loss of data, business interruption, loss of profit or loss of opportunity. IBM products and services are warranted It is the customer’s responsibility to insure its own compliance with legal according to the terms and conditions of the agreements under which they are requirements and to obtain advice of competent legal counsel as to provided. the identification and interpretation of any relevant laws and regulatory requirements that may affect the customer’s business and any actions IBM products are manufactured from new parts or new and used parts. the customer may need to take to comply with such laws. IBM does not In some cases, a product may not be new and may have been previously provide legal advice or represent or warrant that its services or products will installed. Regardless, our warranty terms apply.” ensure that the customer is in compliance with any law.

Any statements regarding IBM's future direction, intent or product plans are subject to change or withdrawal without notice.

Performance data contained herein was generally obtained in a controlled, isolated environments. Customer examples are presented

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation IBM Analytics University 2018 Notices and disclaimers continued Information concerning non-IBM products was obtained from the suppliers of Global Technology Services®, IBM ExperienceOne™, IBM SmartCloud®, IBM those products, their published announcements or other publicly available Social Business®, Information on Demand, ILOG, Maximo®, sources. IBM has not tested those products in connection with this publication MQIntegrator®, MQSeries®, Netcool®, OMEGAMON, OpenPower, and cannot confirm the accuracy of performance, compatibility or any other PureAnalytics™, PureApplication®, pureCluster™, PureCoverage®, PureData®, claims related to non-IBM products. Questions on the capabilities of non-IBM PureExperience®, PureFlex®, pureQuery®, pureScale®, products should be addressed to the suppliers of those products. IBM does not PureSystems®, QRadar®, Rational®, Rhapsody®, Smarter Commerce®, SoDA, warrant the quality of any third-party products, or the ability of any such third- SPSS, Sterling Commerce®, StoredIQ, Tealeaf®, Tivoli® Trusteer®, party products to interoperate with IBM’s products. IBM expressly disclaims all Unica®, urban{code}®, , WebSphere®, Worklight®, X-Force® and System warranties, expressed or implied, including but not limited to, the implied z® Z/OS, are trademarks of International Business Machines Corporation, warranties of merchantability and fitness for a particular, purpose. registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM The provision of the information contained herein is not intended to, and does trademarks is available on the Web at "Copyright and trademark information" at: not, grant any right or license under any IBM patents, copyrights, trademarks or www..com/legal/copytrade.shtml. other intellectual property right.

IBM, the IBM logo, ibm.com, Aspera®, Bluemix, Blueworks Live, CICS, Clearcase, Cognos®, DOORS®, Emptoris®, Enterprise Document Management System™, FASP®, FileNet®, Global Business Services®,

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation IBM Analytics University 2018 • Problem – Data prep. – few users want to do it

1 • 80% of time spent in analytics is “wrangling’ the data

• Cleaning Big Data: Most Time-Consuming, Least Enjoyable Data Science Task 2

Sloan Management / IBM IBV study 1

Forbes / CloudFlower data science study 2

IBM Business Analytics :: © 2018 IBM Corporation 4 Static reports Interactive reports Data Consumers Personalized 60% reports Dashboards & Stories Data Casual Users Power Users Explorers Data modules & 30% Data Analysts 8% ad hoc reports Packages & professional reports

Data Scientists 2%

Source Eckerson Group 2017 IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation 5 Data modeling is preparing …

• A business-oriented presentation of one or more data sources ▪ Easily understood by decision makers

• The metadata building blocks for assembling reports, dashboards, and stories

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation Data modeling is shaping…

• Filtering data to only what’s needed

• Augmenting data with calculations

• Joining data sets together

• Cleaning and grouping data

• Setting metadata like aggregation and sorting to the best defaults for reports, dashboards and stories

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation Share and reuse

Shaping data in a dashboard only affects that dashboard

Changes to a data model affect all reports, dashboards, and stories based on it

Save a data model into a folder with suitable security permissions so others can use it as appropriate

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation Two types of data models in Cognos Analytics

Packages Data modules

• Oriented to IT users • Oriented to LoB users without alienating IT

• Desktop tools like Framework Manager • Web-based

• 10+ years of maturity • Modern user experience

• Maintained but not enhanced by IBM • Actively enhanced

• Automated intelligence in recognition that data modeling is a necessary evil ☺

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation Data Modules Investment Clean & Prep Usability areas

Aggregation Roll Relative Date ups support

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation 10 Data Preparation Split / Clean

IBM Business Analytics :: © 2018 IBM Corporation 11 Aggregation Roll ups

• Multi Fact / Multi Grain

IBM Business Analytics :: © 2018 IBM Corporation 12 Create table from SQL Usability Improvements

Tables and Set Operations

Expression Editor guidance

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation Relative Dates

• 2 Calendars shipping with 11.1

• Include Calendar in Data module

• Relate Date field(s) to calendar (Lookup Reference)

• Link measure to date field

• Instructions available to generate another calendar

Note: Currently, Months MUST begin on the first day of the month

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation Demo

• Split Columns

• Relative Dates

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation IBM Analytics University 2018 Wrap Up IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation Data modelling improvements

• Flat file - multi tab support & append data

• Recommendation on relationships – join, blend

• Data preparation – split column, trim, convert date

• Relative dates - month to date, quarter to date(QTD), prior QTD

• Aggregation across grains –Multi grain analysis

• Ease of use – expression editor, SQL based tables, folders, format, filter

• Set operations – Union, Intersect, Except

• Security Filters

IBM Business Analytics :: IBM Confidential :: © 2018 IBM Corporation IBM Analytics University 2018