SAS 9.1 OLAP Server: Administrator’S Guide, Please Send Them to Us on a Photocopy of This Page, Or Send Us Electronic Mail

Total Page:16

File Type:pdf, Size:1020Kb

SAS 9.1 OLAP Server: Administrator’S Guide, Please Send Them to Us on a Photocopy of This Page, Or Send Us Electronic Mail SAS® 9.1 OLAP Server Administrator’s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2004. SAS ® 9.1 OLAP Server: Administrator’s Guide. Cary, NC: SAS Institute Inc. SAS® 9.1 OLAP Server: Administrator’s Guide Copyright © 2004, SAS Institute Inc., Cary, NC, USA All rights reserved. Produced in the United States of America. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, or otherwise, without the prior written permission of the publisher, SAS Institute Inc. U.S. Government Restricted Rights Notice. Use, duplication, or disclosure of this software and related documentation by the U.S. government is subject to the Agreement with SAS Institute and the restrictions set forth in FAR 52.227–19 Commercial Computer Software-Restricted Rights (June 1987). SAS Institute Inc., SAS Campus Drive, Cary, North Carolina 27513. 1st printing, January 2004 SAS Publishing provides a complete selection of books and electronic products to help customers use SAS software to its fullest potential. For more information about our e-books, e-learning products, CDs, and hard-copy books, visit the SAS Publishing Web site at support.sas.com/pubs or call 1-800-727-3228. SAS® and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA registration. Other brand and product names are registered trademarks or trademarks of their respective companies. Contents What’s New v Overview v Details v Chapter 1 R OLAP Introduction and Overview 1 What Is OLAP? 1 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata Storage 4 Why You Should Use Cubes 4 Analyzing Your Data 5 Chapter 2 R Installing and Administering SAS OLAP Server 7 Installing and Configuring SAS OLAP Server 8 Monitoring OLAP Server Performance 27 Changing an OLAP Server Configuration 28 Optimizing OLAP Server 31 Monitoring and Administering Sessions—SAS OLAP Server Monitor Plug-In 34 Securing Cubes 35 Cubes and the Metadata Server 38 Understanding Change Management in SAS OLAP Cube Studio 39 Accessing OLAP Cubes from SAS: SQL Pass-Through Facility for OLAP 40 Chapter 3 R Building and Updating Cubes 45 Background 45 Building a Cube from a Detail Table 48 Building a Cube from a Summary Table 55 Building a Cube from a Star Schema 61 Updating a Cube 67 Refreshing Cube Metadata 67 Defining Member Properties 68 Defining Multiple Hierarchies for a Dimension 69 Defining Ragged Hierarchies for a Dimension 70 Manually Tuning Cube Aggregates 73 Multiple Language Support and Dimension Table Translations 74 Adding SAS System Options to a Cube 75 Specifying Tuning and Performance Options in Cube Aggregations 76 Chapter 4 R Using SAS OLAP Cubes 79 Using a Cube with ADO MD 79 Using a Cube with OLE DB for OLAP 79 iv Using a Cube with Additional SAS Software 80 Using a Cube with Third-Party Clients 80 Chapter 5 R Transitioning from SAS OLAP Server Release 8.2 to SAS 9.1 85 Conversion and Migration Issues from Release 8.2 to SAS 9.1 85 Comparing OLAP Functionality in SAS 8 and SAS 9.1 86 Comparing PROC MDDB Code and PROC OLAP Code 89 Appendix 1 R The OLAP Procedure 91 The OLAP Procedure 92 Syntax: OLAP Procedure 92 PROC OLAP Statement 92 METASVR Statement 97 DIMENSION Statement 99 LEVEL Statement 102 PROPERTY Statement 103 HIERARCHY Statement 105 MEASURE Statement 107 AGGREGATION Statement 110 DROP_AGGREGATION Statement 112 DEFINE Statement 113 USER_DEFINED_TRANSLATIONS Statement 115 Tables Used to Define Cubes 119 Naming Guidelines for SAS OLAP Server 120 Loading Cubes 121 Maintaining Cubes 125 Specialized Syntax Options for PROC OLAP 127 Appendix 2 R Recommended Reading 129 Recommended Reading 129 Glossary 131 Index 139 v What’s New Overview The SAS OLAP Server enables users to develop and deploy scalable Online Analytical Processing (OLAP) applications. In addition, automated data loading and cube building is available through the use of a new administration interface called the SAS OLAP Cube Studio, which was developed using Java technology. OLAP queries are performed using the Multidimensional Expressions (MDX) query language in client applications that are connected to the OLAP Server by using the SQL Pass-Through Facility for OLAP, which is designed to process MDX queries within the PROC SQL environment. open access technologies such as OLE DB for OLAP, ADO MD, and Java. Note: This section describes the features of the SAS OLAP Server that are new or enhanced since SAS 8.2. R Details There are two new tools for data loading and cube building: The OLAP procedure, in addition to cube building, includes options for handling ragged hierarchies, defining global calculated members and named sets, assigning properties to levels, and optimizing cube creation and query performance. It also supports multiple hierarchies and drill-through tables. The SAS OLAP Cube Studio is an alternative Java interface to the OLAP procedure. This interface is also integrated with SAS ETL Studio. Server performance is recorded and analyzed by using the Application Response Measurement (ARM) system. The new multi-threaded data storage and server functionality provide faster cube performance. The data can be stored in a multidimensional form (MOLAP) or in a form that includes existing aggregations from presummarized data sources. The metadata structure is improved, and metadata is stored with the cube. vi What’s New Caching and logging can be enabled or disabled. Support for ad hoc calculations and time dimensions is improved. An SQL Pass-Through Facility for OLAP is available in SAS for use in querying cubes. Aggregations can be added to or deleted from existing cubes. Note: Version 8 of the SAS OLAP Server can be used with SAS 9. For help, see “V8 SAS OLAP Server” in SAS System Help and Documentation. R 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata Storage 4 Why You Should Use Cubes 4 Cube Usage and Storage Space Reduction 4 Multi-Threading Capabilities 5 Easy Setup and Maintenance 5 Data Management: Choosing Your Own Tool 5 Analyzing Your Data 5 Data Preparation and Dimension Design 5 Aggregation Design 6 What Is OLAP? Online Analytical Processnding (OLAP) is a technology that is used to create decision support software. OLAP enables application users to quickly analyze information that has been summarized into multidimensional views and hierarchies. By summarizing predicted queries into multidimensional views prior to run time, OLAP tools provide the benefit of increased performance over traditional database access tools. Most of the resource-intensive calculation that is required to summarize the data is done before a query is submitted. Data Storage and Access Decision makers are asked to make timely and accurate decisions that are based on the past performance and behavior of an organization as well as on future trends and directives. To make effective business decisions, business analysts must have access to the data that their company generates and responds to. This access must include timely queries, summaries, and reviews of numerous levels and combinations of large, recurrent amounts of data. The information that business analysts review determines the quality of their decisions. Organizations usually have databases and data stores that maintain repeated and frequent business transaction data. This provides simple yet detailed storage and retrieval of specific data events. However, these data storage systems are not well suited for analytical summaries and queries that are typically generated by decision 2 Benefits of OLAP R Chapter 1 makers. For decision makers to reveal hidden trends, inconsistencies, and risks in a business, they must be able to maintain a certain degree of momentum when querying the data. An answer to one question usually leads to additional questions and review of the data. Simple data stores do not successfully support this type of querying. A second type of storage, the data warehouse, is better suited for this. Data is maintained and organized so that complicated queries and summaries can be run. OLAP further organizes and summarizes specific categories and subsets of data from the data warehouse. This results in a robust and detailed level of data storage with efficient and fast query returns. SAS OLAP cubes can be built from either partially or completely denormalized data warehouse tables. Stored, precalculated summarizations called aggregations, can be added to the cube to improve cube access performance. Aggregations can either be pre-built relational tables, or you can let the cube create its own optimized aggregates. Benefits of OLAP The ability to have coherent and relevant information is the reason OLAP has gained in popularity. OLAP systems help reveal evasive inconsistencies and trends in data that might not have been seen before. OLAP users can intuitively search data that has been consolidated and summarized within the OLAP structure. In addition, OLAP tools allow for tasks such as sales forecasting, asset analysis, resource planning, budgeting, and risk assessment. OLAP systems also provide the following benefits: fast access, calculations, and summaries of an organization’s data support for multiple user access and multiple queries the ability to handle multiple hierarchies and levels of data the ability to pre-summarize and consolidate data for faster query and reporting functions the ability to expand the number of dimensions and levels of data as a business grows. To fully understand the benefits of OLAP and the details of its effective implementation, it helps to examine the technology from two perspectives—first, from that of the users and second, from that of the information technology (IT) administrators who are responsible for OLAP implementation.
Recommended publications
  • Cubes Documentation Release 1.0.1
    Cubes Documentation Release 1.0.1 Stefan Urbanek April 07, 2015 Contents 1 Getting Started 3 1.1 Introduction.............................................3 1.2 Installation..............................................5 1.3 Tutorial................................................6 1.4 Credits................................................9 2 Data Modeling 11 2.1 Logical Model and Metadata..................................... 11 2.2 Schemas and Models......................................... 25 2.3 Localization............................................. 38 3 Aggregation, Slicing and Dicing 41 3.1 Slicing and Dicing.......................................... 41 3.2 Data Formatters........................................... 45 4 Analytical Workspace 47 4.1 Analytical Workspace........................................ 47 4.2 Authorization and Authentication.................................. 49 4.3 Configuration............................................. 50 5 Slicer Server and Tool 57 5.1 OLAP Server............................................. 57 5.2 Server Deployment.......................................... 70 5.3 slicer - Command Line Tool..................................... 71 6 Backends 77 6.1 SQL Backend............................................. 77 6.2 MongoDB Backend......................................... 89 6.3 Google Analytics Backend...................................... 90 6.4 Mixpanel Backend.......................................... 92 6.5 Slicer Server............................................. 94 7 Recipes 97 7.1 Recipes...............................................
    [Show full text]
  • Beyond Relational Databases
    EXPERT ANALYSIS BY MARCOS ALBE, SUPPORT ENGINEER, PERCONA Beyond Relational Databases: A Focus on Redis, MongoDB, and ClickHouse Many of us use and love relational databases… until we try and use them for purposes which aren’t their strong point. Queues, caches, catalogs, unstructured data, counters, and many other use cases, can be solved with relational databases, but are better served by alternative options. In this expert analysis, we examine the goals, pros and cons, and the good and bad use cases of the most popular alternatives on the market, and look into some modern open source implementations. Beyond Relational Databases Developers frequently choose the backend store for the applications they produce. Amidst dozens of options, buzzwords, industry preferences, and vendor offers, it’s not always easy to make the right choice… Even with a map! !# O# d# "# a# `# @R*7-# @94FA6)6 =F(*I-76#A4+)74/*2(:# ( JA$:+49>)# &-)6+16F-# (M#@E61>-#W6e6# &6EH#;)7-6<+# &6EH# J(7)(:X(78+# !"#$%&'( S-76I6)6#'4+)-:-7# A((E-N# ##@E61>-#;E678# ;)762(# .01.%2%+'.('.$%,3( @E61>-#;(F7# D((9F-#=F(*I## =(:c*-:)U@E61>-#W6e6# @F2+16F-# G*/(F-# @Q;# $%&## @R*7-## A6)6S(77-:)U@E61>-#@E-N# K4E-F4:-A%# A6)6E7(1# %49$:+49>)+# @E61>-#'*1-:-# @E61>-#;6<R6# L&H# A6)6#'68-# $%&#@:6F521+#M(7#@E61>-#;E678# .761F-#;)7-6<#LNEF(7-7# S-76I6)6#=F(*I# A6)6/7418+# @ !"#$%&'( ;H=JO# ;(\X67-#@D# M(7#J6I((E# .761F-#%49#A6)6#=F(*I# @ )*&+',"-.%/( S$%=.#;)7-6<%6+-# =F(*I-76# LF6+21+-671># ;G';)7-6<# LF6+21#[(*:I# @E61>-#;"# @E61>-#;)(7<# H618+E61-# *&'+,"#$%&'$#( .761F-#%49#A6)6#@EEF46:1-#
    [Show full text]
  • An Online Analytical Processing Multi-Dimensional Data Warehouse for Malaria Data S
    Database, 2017, 1–20 doi: 10.1093/database/bax073 Original article Original article An online analytical processing multi-dimensional data warehouse for malaria data S. M. Niaz Arifin1,*, Gregory R. Madey1, Alexander Vyushkov2, Benoit Raybaud3, Thomas R. Burkot4 and Frank H. Collins1,4,5 1Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, Indiana, USA, 2Center for Research Computing, University of Notre Dame, Notre Dame, Indiana, USA, 3Institute for Disease Modeling, Bellevue, Washington, USA, 4Australian Institute of Tropical Health and Medicine, James Cook University, Cairns, Queensland, Australia 5Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, USA *Corresponding author: Tel: þ1 574 387 9404; Fax: 1 574 631 9260; Email: sarifi[email protected] Citation details: Arifin,S.M.N., Madey,G.R., Vyushkov,A. et al. An online analytical processing multi-dimensional data warehouse for malaria data. Database (2017) Vol. 2017: article ID bax073; doi:10.1093/database/bax073 Received 15 July 2016; Revised 21 August 2017; Accepted 22 August 2017 Abstract Malaria is a vector-borne disease that contributes substantially to the global burden of morbidity and mortality. The management of malaria-related data from heterogeneous, autonomous, and distributed data sources poses unique challenges and requirements. Although online data storage systems exist that address specific malaria-related issues, a globally integrated online resource to address different aspects of the disease does not exist. In this article, we describe the design, implementation, and applications of a multi- dimensional, online analytical processing data warehouse, named the VecNet Data Warehouse (VecNet-DW). It is the first online, globally-integrated platform that provides efficient search, retrieval and visualization of historical, predictive, and static malaria- related data, organized in data marts.
    [Show full text]
  • Marvin: a Toolkit for Streamlined Access and Visualization of the Sdss-Iv Manga Data Set
    Draft version December 11, 2018 Preprint typeset using LATEX style AASTeX6 v. 1.0 MARVIN: A TOOLKIT FOR STREAMLINED ACCESS AND VISUALIZATION OF THE SDSS-IV MANGA DATA SET Brian Cherinka1, Brett H. Andrews2, Jose´ Sanchez-Gallego´ 3, Joel Brownstein4, Mar´ıa Argudo-Fernandez´ 5,6, Michael Blanton7, Kevin Bundy8, Amy Jones13, Karen Masters10,11, David R. Law1, Kate Rowlands9, Anne-Marie Weijmans12, Kyle Westfall8, Renbin Yan14 1Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA 2Department of Physics and Astronomy and PITT PACC, University of Pittsburgh, 3941 OHara Street, Pittsburgh, PA 15260, USA 3Department of Astronomy, Box 351580, University of Washington, Seattle, WA 98195, USA 4Department of Physics and Astronomy, University of Utah, 115 S 1400 E, Salt Lake City, UT 84112, USA 5Centro de Astronom´ıa(CITEVA), Universidad de Antofagasta, Avenida Angamos 601 Antofagasta, Chile 6Chinese Academy of Sciences South America Center for Astronomy, China-Chile Joint Center for Astronomy, Camino El Observatorio, 1515, Las Condes, Santiago, Chile 7Department of Physics, New York University, 726 Broadway, New York, NY 10003, USA 8University of California Observatories, University of California Santa Cruz, 1156 High Street, Santa Cruz, CA 95064, USA 9Department of Physics and Astronomy, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA 10Department of Physics and Astronomy, Haverford College, 370 Lancaster Avenue, Haverford, Pennsylvania 19041, USA 11Institute of Cosmology & Gravitation, University
    [Show full text]
  • Benchmarking Distributed Data Warehouse Solutions for Storing Genomic Variant Information
    Research Collection Journal Article Benchmarking distributed data warehouse solutions for storing genomic variant information Author(s): Wiewiórka, Marek S.; Wysakowicz, David P.; Okoniewski, Michał J.; Gambin, Tomasz Publication Date: 2017-07-11 Permanent Link: https://doi.org/10.3929/ethz-b-000237893 Originally published in: Database 2017, http://doi.org/10.1093/database/bax049 Rights / License: Creative Commons Attribution 4.0 International This page was generated automatically upon download from the ETH Zurich Research Collection. For more information please consult the Terms of use. ETH Library Database, 2017, 1–16 doi: 10.1093/database/bax049 Original article Original article Benchmarking distributed data warehouse solutions for storing genomic variant information Marek S. Wiewiorka 1, Dawid P. Wysakowicz1, Michał J. Okoniewski2 and Tomasz Gambin1,3,* 1Institute of Computer Science, Warsaw University of Technology, Nowowiejska 15/19, Warsaw 00-665, Poland, 2Scientific IT Services, ETH Zurich, Weinbergstrasse 11, Zurich 8092, Switzerland and 3Department of Medical Genetics, Institute of Mother and Child, Kasprzaka 17a, Warsaw 01-211, Poland *Corresponding author: Tel.: þ48693175804; Fax: þ48222346091; Email: [email protected] Citation details: Wiewiorka,M.S., Wysakowicz,D.P., Okoniewski,M.J. et al. Benchmarking distributed data warehouse so- lutions for storing genomic variant information. Database (2017) Vol. 2017: article ID bax049; doi:10.1093/database/bax049 Received 15 September 2016; Revised 4 April 2017; Accepted 29 May 2017 Abstract Genomic-based personalized medicine encompasses storing, analysing and interpreting genomic variants as its central issues. At a time when thousands of patientss sequenced exomes and genomes are becoming available, there is a growing need for efficient data- base storage and querying.
    [Show full text]
  • Building an Effective Data Warehousing for Financial Sector
    Automatic Control and Information Sciences, 2017, Vol. 3, No. 1, 16-25 Available online at http://pubs.sciepub.com/acis/3/1/4 ©Science and Education Publishing DOI:10.12691/acis-3-1-4 Building an Effective Data Warehousing for Financial Sector José Ferreira1, Fernando Almeida2, José Monteiro1,* 1Higher Polytechnic Institute of Gaya, V.N.Gaia, Portugal 2Faculty of Engineering of Oporto University, INESC TEC, Porto, Portugal *Corresponding author: [email protected] Abstract This article presents the implementation process of a Data Warehouse and a multidimensional analysis of business data for a holding company in the financial sector. The goal is to create a business intelligence system that, in a simple, quick but also versatile way, allows the access to updated, aggregated, real and/or projected information, regarding bank account balances. The established system extracts and processes the operational database information which supports cash management information by using Integration Services and Analysis Services tools from Microsoft SQL Server. The end-user interface is a pivot table, properly arranged to explore the information available by the produced cube. The results have shown that the adoption of online analytical processing cubes offers better performance and provides a more automated and robust process to analyze current and provisional aggregated financial data balances compared to the current process based on static reports built from transactional databases. Keywords: data warehouse, OLAP cube, data analysis, information system, business intelligence, pivot tables Cite This Article: José Ferreira, Fernando Almeida, and José Monteiro, “Building an Effective Data Warehousing for Financial Sector.” Automatic Control and Information Sciences, vol.
    [Show full text]
  • What Is OLAP (Online Analytical Processing): Cube, Operations & Types What Is Online Analytical Processing?
    What is OLAP (Online Analytical Processing): Cube, Operations & Types What is Online Analytical Processing? OLAP is a category of software that allows users to analyze information from multiple database systems at the same time. It is a technology that enables analysts to extract and view business data from different points of view. OLAP stands for Online Analytical Processing. Analysts frequently need to group, aggregate and join data. These operations in relational databases are resource intensive. With OLAP data can be pre-calculated and pre-aggregated, making analysis faster. OLAP databases are divided into one or more cubes. The cubes are designed in such a way that creating and viewing reports become easy. OLAP cube: Ahmed Yasir Khan Page 1 of 12 At the core of the OLAP, concept is an OLAP Cube. The OLAP cube is a data structure optimized for very quick data analysis. The OLAP Cube consists of numeric facts called measures which are categorized by dimensions. OLAP Cube is also called the hypercube. Usually, data operations and analysis are performed using the simple spreadsheet, where data values are arranged in row and column format. This is ideal for two- dimensional data. However, OLAP contains multidimensional data, with data usually obtained from a different and unrelated source. Using a spreadsheet is not an optimal option. The cube can store and analyze multidimensional data in a logical and orderly manner. How does it work? A Data warehouse would extract information from multiple data sources and formats like text files, excel sheet, multimedia files, etc. The extracted data is cleaned and transformed.
    [Show full text]
  • Improving Traveling Habits Using an OLAP Cube
    Improving traveling habits using an OLAP cube Development of a business intelligence system Marcus Hellman Marcus Hellman Spring 2016 Master Thesis in Computing Science, 30 hp Supervisor: Anders Broberg Extern Supervisor: Kim Nilsson Examiner: Henrik Bjorklund¨ Umea˚ University, Department of Computing Science Abstract The aim of this thesis is to improve the traveling habits of clients using the SpaceTime system when arranging their travels. The improvement of traveling habits refers to lowering costs and emissions generated by the travels. To do this, a business intelligence system, including an OLAP cube, were created to provide the clients with feedback on how they travel. This to make it possible to see if they are improving and how much they have saved, both in money and emissions. Since these kind of systems often are quite complex, studies on best practices and how to keep such systems agile were performed to be able to provide a system of high quality. During this project, it was found that the pre-study and design phase were just as challenging as the creation of the designed components. The result of this project was a business intelligence system, including ETL, a Data warehouse, and an OLAP cube that will be used in the SpaceTime system as well as mock-ups presenting how data from the OLAP cube could be presented in the SpaceTime web-application in the future. Acknowledgements First, I would like to thank SpaceTime and Dohi for giving me the opportunity to do this the- sis. Also, I would like to thank my supervisors Anders Broberg and Kim Nilsson for all the help during this thesis project, providing valuable feedback on my thesis report throughout the project as well as guiding me through the introduction and development of the system.
    [Show full text]
  • Constructing OLAP Cubes Based on Queries
    Constructing OLAP Cubes Based on Queries Tapio Niemi Jyrki Nummenmaa Peter Thanisch Department of Computer and Department of Computer and Department of Computer Science, Information Sciences Information Sciences University of Edinburgh FIN-33014 University of Tampere FIN-33014 University of Tampere Edinburgh, EH9 3JZ Finland Finland Scotland +358 32156595 +358 405277999 +44 7968401525 [email protected] [email protected] [email protected] ABSTRACT The design of a cube is based on knowledge of the application area and the types of queries the users are expected to pose. An On-Line Analytical Processing (OLAP) user often follows a Since constructing an OLAP cube can be a difficult task for the train of thought, posing a sequence of related queries against the end user, it is often seen as the duty of the data warehouse data warehouse. Although their details are not known in advance, administrator. This has led to researchers and vendors regarding the general form of those queries is apparent beforehand. Thus, the OLAP cube as a static storage structure for data warehouse the user can outline the relevant portion of the data posing data. This is problematic since the user often wants to make new generalised queries against a cube representing the data kinds of queries, which may also need new OLAP cubes. The warehouse. same cube is not always practical for different analysis tasks, Since existing OLAP design methods are not suitable for non- since the structure of the cube has a notable effect on efficiency professionals, we present a technique that automates cube design and ease of posing queries.
    [Show full text]
  • Understanding an OLAP Solution from Oracle
    Understanding an OLAP Solution from Oracle An Oracle White Paper April 2008 Understanding an OLAP Solution from Oracle NOTE: The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. Understanding an OLAP Solution from Oracle Page 2 Table of Contents Introduction ....................................................................................................... 4 Similarities between the Oracle Database OLAP Option and Oracle’s Hyperion Essbase ......................................................................................... 5 Commitment to Product Development .................................................... 5 Overview............................................................................................................. 5 Architectural Heritage .................................................................................. 5 Oracle’s Hyperion Essbase: Middle Tier OLAP...................................... 5 Oracle Database OLAP Option: Database-Centric OLAP.................... 6 Architecting the Appropriate Oracle OLAP Solution................................. 6 The Purpose..................................................................................................
    [Show full text]
  • Data Intensive Computing Systems
    CompSci 516 Data Intensive Computing Systems Lecture 18 NoSQL and Column Store Instructor: Sudeepa Roy CompSci 516: Data Intensive Computing Duke CS, Fall 2016 1 Systems Announcements • HW3 (last HW) has been posted on Sakai • Same problems as in HW1 but in MongoDB (NOSQL) • Due in two weeks after today’s lecture (~11/16) Duke CS, Fall 2016 CompSci 516: Data Intensive Computing Systems 2 Reading Material NOSQL: • “Scalable SQL and NoSQL Data Stores” Rick Cattell, SIGMOD Record, December 2010 (Vol. 39, No. 4) • see webpage http://cattell.net/datastores/ for updates and more pointers Column Store: • D. Abadi, P. Boncz, S. Harizopoulos, S. Idreos and S. Madden. The Design and Implementation of Modern Column-Oriented Database Systems. Foundations and Trends in Databases, vol. 5, no. 3, pp. 197–280, 2012. • See VLDB 2009 tutorial: http://nms.csail.mit.edu/~stavros/pubs/tutorial2009- column_stores.pdf Optional: • “Dynamo: Amazon’s Highly Available Key-value Store” By Giuseppe DeCandia et. al. SOSP 2007 • “Bigtable: A Distributed Storage System for Structured Data” Fay Chang et. al. OSDI 2006 Duke CS, Fall 2016 CompSci 516: Data Intensive Computing Systems 3 NoSQL Duke CS, Fall 2016 CompSci 516: Data Intensive Computing Systems 4 Duke CS, Fall 2016 CompSci 516: Data Intensive Computing Systems 5 So far -- RDBMS • Relational Data Model • Relational Database Systems (RDBMS) • RDBMSs have – a complete pre-defined fixed schema – a SQL interface – and ACID transactions Duke CS, Fall 2016 CompSci 516: Data Intensive Computing Systems 6 Today • NoSQL: ”new” database systems – not typically RDBMS – relax on some requirements, gain efficiency and scalability • New systems choose to use/not use several concepts we learnt so far – e.g.
    [Show full text]
  • Query Optimizing for On-Line Analytical Processing Adventures in the Land of Heuristics
    Aalto University School of Science Master's Programme in Computer, Communication and Information Sciences Jonas Berg Query Optimizing for On-line Analytical Processing Adventures in the land of heuristics Master's Thesis Espoo, May 22, 2017 Supervisor: Professor Eljas Soisalon-Soininen Advisors: Jarkko Miettinen M.Sc. (Tech.) Marko Nikula M.Sc. (Tech) Aalto University School of Science Master's Programme in Computer, Communication and In- ABSTRACT OF formation Sciences MASTER'S THESIS Author: Jonas Berg Title: Query Optimizing for On-line Analytical Processing { Adventures in the land of heuristics Date: May 22, 2017 Pages: vii + 71 Major: Computer Science Code: SCI3042 Supervisor: Professor Eljas Soisalon-Soininen Advisors: Jarkko Miettinen M.Sc. (Tech.) Marko Nikula M.Sc. (Tech) Newer database technologies, such as in-memory databases, have largely forgone query optimization. In this thesis, we presented a use case for query optimization for an in-memory column-store database management system used for both on- line analytical processing and on-line transaction processing. To date, the system in question has used a na¨ıve query optimizer for deciding on join order. We went through related literature on the history and evolution of database technology, focusing on query optimization. Based on this, we analyzed the current system and presented improvements for its query processing abilities. We implemented a new query optimizer and experimented with it, seeing how it performed on several queries, concluding that it is a successful improvement
    [Show full text]