13.7 Star Schema Facts Dimensions Attributes

13.7 Star Schema Facts Dimensions • Data modeling technique • Numeric measurements • Qualifying that represent specific characteristics provide – Maps multidimensional decision support data into business aspect or activity relational database additional perspectives – Normally stored in fact to a given fact • Creates near equivalent of multidimensional table that is center of star • Decision support data database schema from relational data schema almost always viewed in • Easily implemented model for multidimensional • Fact table contains facts relation to other data data analysis linked through their • Study facts via – Preserves relational structures on which operational dimensions dimensions database is built • Metrics are facts • Dimensions stored in • Four components: facts, dimensions, attributes, computed at run time and attribute hierarchies dimension tables Database Systems, 8th Edition 1 Database Systems, 8th Edition 2 Attributes 13.13 • Each dimension table contains attributes • Use to search, filter, and classify facts • Dimensions provide descriptive characteristics of facts through their attributes • No mathematical limit to the number of dimensions • The data warehouse designer must define common business attributes, to be used by data analyst to narrow a search, group information, or describe dimension Database Systems, 8th Edition 3 Database Systems, 8th Edition 4 13.11 Slice and dice: focus on slices of the data cube for more detailed analysis 5 6 13.17 Attribute Hierarchies 13.16 • Attributes within dimensions can be ordered in a well-defined attribute hierarchy • Provide top-down data organization • Two purposes: – Aggregation – Drill-down/Roll-up data analysis • Determine how the data are extracted and represented • Stored in the DBMS’s data dictionary • Used by OLAP tool to access warehouse properly Database Systems, 8th Edition 7 8 Star Schema Representation 13.18 • Facts and dimensions represented in physical tables in data warehouse database • Many fact rows related to each dimension row – Primary key of fact table is a composite primary key – Fact table primary key formed by combining foreign keys pointing to dimension tables • Dimension tables smaller than fact tables • Each dimension record related to thousands of fact records Database Systems, 8th Edition 9 10 13.19 13.9 Data Mining • Data-mining tools do the following: – Analyze data – Uncover problems or opportunities hidden in data relationships – Form computer models based on their findings – Use models to predict business behavior • Requires minimal end-user intervention Skip 13.7.6, 13.8 11 Database Systems, 8th Edition 12 13.23 13.24 13 14 13.10 SQL Extensions for OLAP 13.25 • Proliferation of OLAP tools fostered development of SQL extensions • Many innovations have become part of standard SQL • All SQL commands will work in data warehouse as expected • Most queries include many data groupings and aggregations over multiple columns Database Systems, 8th Edition 15 16 The ROLLUP Extension • Used with GROUP BY clause to generate aggregates by different dimensions • GROUP BY generates only one aggregate for each new value combination of attributes • ROLLUP extension enables subtotal for each column listed except for the last one – Last column gets grand total • Order of column list important Database Systems, 8th Edition 17 18 The CUBE Extension • CUBE extension used with GROUP BY clause to generate aggregates by listed columns – Includes the last column • Enables subtotal for each column in addition to grand total for last column • Useful when you want to compute all possible subtotals within groupings • Cross-tabulations good application of CUBE extension Database Systems, 8th Edition 19 20 Materialized Views • A dynamic table that contains SQL query command to generate rows – Also contains the actual rows • Created the first time query is run and summary rows are stored in table • Automatically updated when base tables are updated Database Systems, 8th Edition 21 22 Summary • Business intelligence generates information used to support decision making • BI covers a range of technologies, applications, and functionalities • Decision support systems were the precursor of current generation BI systems • Operational data not suited for decision support 23 Database Systems, 8th Edition 24 Summary (continued) Summary (continued) • Four categories of requirements for decision • OLAP systems have four main characteristics: support DBMS: – Use of multidimensional data analysis – Database schema – Advanced database support – Data extraction and loading – Easy-to-use end-user interfaces – End-user analytical interface – Client/server architecture – Database size requirements • Data warehouse provides support for decision • ROLAP provides OLAP functionality with relational making databases – Usually read-only • MOLAP provides OLAP functionality with MDBMSs – Optimized for data analysis, query processing Database Systems, 8th Edition 25 Database Systems, 8th Edition 26 Summary (continued) • Star schema is a data-modeling technique – Maps multidimensional decision support data into a relational database • Star schema has four components: –Facts – Dimensions – Attributes – Attribute hierarchies • Data mining automates analysis of operational data • SQL extensions support OLAP-type processing and data generation Database Systems, 8th Edition 27.

13.7 Star Schema Facts Dimensions Attributes

Star Schema Modeling with Pentaho Data Integration

Star Vs Snowflake Schema in Data Warehouse

Beyond the Data Model: Designing the Data Warehouse

Olap Queries

A Data Warehouse Implementation Using the Star Schema

Introduction to UCDW Star Schemas and Data Marts

The-Unified-Star-Schema.Pdf

Providing High Availability and Elasticity for an In-Memory Database System with Ramcloud

Unit 11 Online Analytical Processing (OLAP) Basic Concepts

Student Absence Attendance Fine Using Data Warehouse System

Column-Oriented Database Systems with Daniel Abadi (Yale) Stavros Harizopuolos (HP Labs)

USING a DIMENSIONAL DATA WAREHOUSE to STANDARDIZE SURVEY and CENSUS METADATA Mickey Yost and Jack Nealon National Agricultural Statistics Service, U.S