The Extent of Autonomous Data Analysis for Non-It Staff with Self-Service Business Intelligence Tools

UNIVERSITY OF LJUBLJANA FACULTY OF ECONOMICS MASTER'S THESIS THE EXTENT OF AUTONOMOUS DATA ANALYSIS FOR NON-IT STAFF WITH SELF-SERVICE BUSINESS INTELLIGENCE TOOLS Ljubljana, October 10th, 2018 Christian Piechorowski AUTHORSHIP STATEMENT The undersigned Christian Piechorowski, a student at the University of Ljubljana, Faculty of Economics, (hereafter: FELU), author of this written final work of studies with the title The extent of autonomous data analysis for non-it staff with self-service business intelligence tools, prepared under supervision of Prof. dr. Jurij Jaklič and co-supervision of Prof. Tiago Oliveira DECLARE 1. this written final work of studies to be based on the results of my own research; 2. the printed form of this written final work of studies to be identical to its electronic form; 3. the text of this written final work of studies to be language-edited and technically in adherence with the FELU’s Technical Guidelines for Written Works, which means that I cited and / or quoted works and opinions of other authors in this written final work of studies in accordance with the FELU’s Technical Guidelines for Written Works; 4. to be aware of the fact that plagiarism (in written or graphical form) is a criminal offence and can be prosecuted in accordance with the Criminal Code of the Republic of Slovenia; 5. to be aware of the consequences a proven plagiarism charge based on the this written final work could have for my status at the FELU in accordance with the relevant FELU Rules; 6. to have obtained all the necessary permits to use the data and works of other authors which are (in written or graphical form) referred to in this written final work of studies and to have clearly marked them; 7. to have acted in accordance with ethical principles during the preparation of this written final work of studies and to have, where necessary, obtained permission of the Ethics Committee; 8. my consent to use the electronic form of this written final work of studies for the detection of content similarity with other written works, using similarity detection software that is connected with the FELU Study Information System; 9. to transfer to the University of Ljubljana free of charge, non-exclusively, geographically and time-wise unlimited the right of saving this written final work of studies in the electronic form, the right of its reproduction, as well as the right of making this written final work of studies available to the public on the World Wide Web via the Repository of the University of Ljubljana; 10. my consent to publication of my personal data that are included in this written final work of studies and in this declaration, when this written final work of studies is published. Ljubljana, ________________________ Author’s signature: _________________________ (Month in words / Day / Year, e. g. June 1st, 2012 TABLE OF CONTENTS INTRODUCTION .................................................................................................................... 1 1. ANALYSIS OF POTENTIAL BENEFITS OF SSBI .................................................... 3 1.1 Basic Concepts of Business Intelligence ................................................................. 3 1.2 Business Intelligence Architecture ............................................................................... 5 1.2.1 Operational Source System ......................................................................................... 6 1.2.2 Extract, Transform, Load ............................................................................................. 7 1.2.3 Online Analytical Processing ...................................................................................... 8 1.2.4 Business Intelligence Applications .............................................................................. 9 1.3 Dimensional Data Modelling ........................................................................................ 9 1.4 Self-Service and Self-Service Business Intelligence .................................................. 17 1.5 Self-Service Business Intelligence: Benefits and Shortcomings ............................. 21 1.5.1 Levels of Self-Service Business Intelligence ............................................................ 24 1.5.2 Data Governance and SSBI ....................................................................................... 26 1.5.3 Data Quality and its Dimensions. .............................................................................. 27 1.5.3.1 Accuracy ............................................................................................................. 28 1.5.3.2 Completeness ...................................................................................................... 28 1.5.3.3 Time-related Dimensions ................................................................................... 28 1.5.3.4 Consistency ......................................................................................................... 29 1.5.3.5 Other Considerations ......................................................................................... 29 1.6 Self-Service Business Intelligence and the importance of correct Ontology .......... 30 2 METHODOLOGY, LIMITATIONS, AND EXECUTION OF THE EXPERIMENT 33 3 RESULTS ........................................................................................................................ 41 4 DISCUSSION .................................................................................................................. 47 CONCLUSION AND OUTLOOK ....................................................................................... 50 REFERENCES ....................................................................................................................... 53 LIST OF TABLES Table 1 Overall results averaged over the Demographic Data…………………………….....43 LIST OF FIGURES Figure 1 DW/BI Architecture ..................................................................................................... 6 Figure 2 Entity Classification of the AdventureWorks database ............................................. 13 Figure 3 Hierarchies in the Sales case ...................................................................................... 15 Figure 4 Database after the transformation .............................................................................. 16 Figure 5 Levels of SSBI ........................................................................................................... 25 Figure 6 Gartner Magic Quadrant ............................................................................................ 35 Figure 7 Sales by Territory (2011-2014) .................................................................................. 36 Figure 8 Average Sales Amount per Customer by Territory (2011-2014) .............................. 37 Figure 9 Sales by Quarters (2011-2014) .................................................................................. 37 Figure 10 Direct Sales and Sales through Resellers (2011-2014) ............................................ 38 Figure 11 Adjustment of the Ontology .................................................................................... 39 Figure 12 Box Plot of the investigated Variables .................................................................... 41 Figure 13 Funnel with the Success Rate of the Experiment .................................................... 42 Figure 14 Correlation Matrix of the Variables ......................................................................... 44 Figure 15 Clustering of the relevance of Numeric Charts in connection with Avg. Perceived Usefulness ................................................................................................................ 45 Figure 16 Clustering of total steps completed in connection with Avg. Ease of Use .............. 46 LIST OF ABBREVIATIONS 3NF 3rd Normal Form A Anticipated Use ATM Automated Teller Machine BI Business Intelligence BIS Business Intelligence System DAS Data Staging Area DAX Data Analysis expression DM Data Mining DM Dimensional Modelling DQM Data Quality Management DW Data Warehouse E Ease of Use ERD Entity Relationship Diagram ERP Enterprise Resource Planning ETL Extract, Transform, Load GDPR General Data Protection Regulation GIGO Garbage In, Garbage Out HOLAP Hybrid OnLine Analytical Processing ID Identifier IM Information Management IT Information Technology J Enjoyment MDDB Multidimensional Database MIS Management Information Systems MOLAP Multidimensional OnLine Analytical Processing MS Microsoft O Perceived Characteristics of the Output OLAP OnLine Analytical Processing OLTP OnLine Transactional Processing, RDBMS Relational Database Management System ROLAP Relational OnLine Analytical Processing SDK Software Development Kit SDLC Software Development Life Cycle SQL Structured Query Language SSBI Self-Service Business Intelligence SSIS SQL Server Integration Services SSMS SQL Server Management Studio SST Self-Service Technology Intelligence TAM Technology Acceptance Model TDQM Total Data Quality Management TIST Trust in Information Systems Technology U Perceived Usefulness INTRODUCTION Not only do companies nowadays compete in the efficient and effective use of human capital and their assets, but also of data. Data has become an integral part of every business that is competing in the market today. There are many ways the data can be used; however, this thesis will focus on the field of Business Intelligence

The Extent of Autonomous Data Analysis for Non-It Staff with Self-Service Business Intelligence Tools

Beyond Relational Databases

Benchmarking Distributed Data Warehouse Solutions for Storing Genomic Variant Information

Building an Effective Data Warehousing for Financial Sector

What Is OLAP (Online Analytical Processing): Cube, Operations & Types What Is Online Analytical Processing?

SAS 9.1 OLAP Server: Administrator’S Guide, Please Send Them to Us on a Photocopy of This Page, Or Send Us Electronic Mail

Improving Traveling Habits Using an OLAP Cube

Constructing OLAP Cubes Based on Queries

Understanding an OLAP Solution from Oracle

Data Intensive Computing Systems

Query Optimizing for On-Line Analytical Processing Adventures in the Land of Heuristics

Optimisation of Ad-Hoc Analysis of an OLAP Cube Using Sparksql

OLAP Cubes Ming-Nu Lee OLAP (Online Analytical Processing)