IBM Data Warehousing. with IBM Business .Pdf
Total Page:16
File Type:pdf, Size:1020Kb
IBM® Data Warehousing with IBM Business Intelligence Tools Michael L. Gonzales The WILEY Dear Valued Customer, advantage We realize you’re a busy professional with deadlines to hit. Whether your goal is to learn a new technology or solve a critical problem, we want to be there to lend you a hand. Our primary objective is to provide you with the insight and knowledge you need to stay atop the highly competitive and ever- changing technology industry. Wiley Publishing, Inc., offers books on a wide variety of technical categories, including security, data warehousing, software development tools, and networking — everything you need to reach your peak. Regardless of your level of expertise, the Wiley family of books has you covered. • For Dummies – The fun and easy way to learn • The Weekend Crash Course –The fastest way to learn a new tool or technology • Visual – For those who prefer to learn a new topic visually • The Bible – The 100% comprehensive tutorial and reference • The Wiley Professional list – Practical and reliable resources for IT professionals The book you now hold, IBM Data Warehousing: With IBM Business Intelligence Tools, is the first comprehensive guide to the complete suite of IBM tools for data warehousing. Written by a leading expert, with contributions from key members of the IBM development teams that built these tools, the book is filled with detailed examples, as well as tips, tricks and workarounds for ensuring maximum performance. You can be assured that this is the most complete and authoritative guide to IBM data warehousing. Our commitment to you does not end at the last page of this book. We’d want to open a dialog with you to see what other solutions we can provide. Please be sure to visit us at www.wiley.com/compbooks to review our complete title list and explore the other resources we offer. If you have a comment, suggestion, or any other inquiry, please locate the “contact us” link at www.wiley.com. Finally, we encourage you to review the following page for a list of Wiley titles on related topics. Thank you for your support and we look forward to hearing from you and serving your needs again in the future. Sincerely, Richard K. Swadley Vice President & Executive Group Publisher Wiley Technology Publishing more information on related titles The Next Step in Data Warehousing Available from Wiley Publishing 0471384291 Create more 0471202436 powerful, flexible The official guide, data sharing written by the applications using a authors of the new XML-based Common Warehouse standard Metamodel INTERMEDIATE/ADVANCED 0471200522 An introduction to 0471219711 the standard for data The comprehensive warehouse guide to implement- integration ing SAP BW BEGINNER Available at your favorite bookseller or visit www.wiley.com/compbooks Advance Praise for IBM Data Warehousing “This book delivers both depth and breadth, a highly unusual combination in the business intelligence field. It not only describes the intricacies of var- ious IBM products, such as IBM DB2, IBM Intelligent Miner, and IBM DB2 OLAP, but it also sets the context for these products by providing a com- prehensive overview of data warehousing architecture, analytics, and data management.” Wayne Eckerson Director of Research, The Data Warehousing Institute “Organizations today are faced with a ‘data deluge’ about customers, sup- pliers, partners, employees and competitors. To survive and to prosper requires an increasing commitment to information management solutions. Michael Gonzales’ book provides an outstanding look at business intelli- gence software from IBM that can help companies excel through quicker, better-informed business decisions. In addition to a comprehensive explo- ration of IBM’s data warehouse, OLAP, data mining and spatial analysis capabilities, Michael clearly explains the organizational and data architec- ture underpinnings necessary for success in this information-intensive age.” Jeff Jones Senior Program Manager, IBM Data Management Solutions “IBM leads the way in delivering integrated, easy-to-use data warehous- ing, analysis and data management technology. This book delivers what every data warehousing professional needs most: a thorough overview of business intelligence fundamentals followed by solid practical advice on using IBM’s rich product suite to build, maintain and mine data warehouses.” Thomas W. Rosamilia Vice President, IBM Data Management (DB2) Worldwide Development IBM® Data Warehousing with IBM Business Intelligence Tools Michael L. Gonzales Publisher: Joe Wikert Executive Editor: Robert M. Elliott Assistant Developmental Editor: Emilie Herman Managing Editor: Micheline Frederick Media Development Specialist: Travis Silvers Text Design & Composition: Wiley Composition Services Designations used by companies to distinguish their products are often claimed as trade- marks. In all instances where Wiley Publishing, Inc., is aware of a claim, the product names appear in initial capital or ALL CAPITAL LETTERS. Readers, however, should contact the appro- priate companies for more complete information regarding trademarks and registration. This book is printed on acid-free paper. ∞ Copyright © 2003 by Michael L. Gonzales. Copyright © 2003 IBM. Some text and illustrations. Published by Wiley Publishing, Inc., Indianapolis, Indiana Published simultaneously in Canada No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, Inc., 222 Rose- wood Drive, Danvers, MA 01923, (978) 750-8400, fax (978) 750-4470. Requests to the Pub- lisher for permission should be addressed to the Legal Department, Wiley Publishing, Inc., 10475 Crosspointe Blvd., Indianapolis, IN 46256, (317) 572-3447, fax (317) 572-4447, E-mail: [email protected]. Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best efforts in preparing this book, they make no representations or warranties with respect to the accuracy or completeness of the contents of this book and specifically disclaim any implied warranties of merchantability or fitness for a particular purpose. No warranty may be created or extended by sales representatives or written sales materials. The advice and strategies contained herein may not be suitable for your situation. You should consult with a professional where appropriate. Neither the publisher nor author shall be liable for any loss of profit or any other commercial damages, including but not limited to special, inci- dental, consequential, or other damages. For general information on our other products and services please contact our Customer Care Department within the United States at (800) 762-2974, outside the United States at (317) 572-3993 or fax (317) 572-4002. Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic books. Library of Congress Cataloging-in-Publication Data: ISBN: 0-471-13305-1 Printed in the United States of America 10 9 8 7 6 5 4 3 2 1 To AMG2. Contents Acknowledgments xx Introduction xxiii Part One Fundamentals of Business Intelligence and the Data Warehouse 1 Chapter 1 Overview of the BI Organization 3 Overview of the BI Organization Architecture 4 Providing Information Content 10 Planning for Information Content 10 Designing for Information Content 13 Implementing Information Content 15 Justifying Your BI Effort 18 Linking Your Project to Known Business Requirements 18 Measuring ROI 18 Applying ROI 19 Questions for ROI Benefits 21 Making the Most of the First Iteration of the Warehouse 22 IBM and The BI Organization 22 Seamless Integration 23 Data Mining 24 Online Analytic Processing 24 Spatial Analysis 25 Database-Resident Tools 25 Simplified Data Delivery System 26 Zero-Latency 27 Summary 28 vii viii Contents Chapter 2 Business Intelligence Fundamentals 29 BI Components and Technologies 31 Business Intelligence Components 31 Data Warehouse 31 Data Sources 32 Data Targets 32 Warehouse Components 36 Extraction, Transformation, and Loading 37 Extraction 38 Transformation/Cleansing 39 Data Refining 39 Data Management 40 Data Access 40 Meta Data 41 Analytical User Requirements 42 Reporting and Querying 43 Online Analytical Processing 43 Multidimensional Views 44 Calculation-Intensive Capabilities 45 Time Intelligence 45 Statistics 46 Data Mining 46 Dimensional Technology and BI 47 The OLAP Server 48 MOLAP 49 ROLAP 50 Defining the Dimensional Spectrum 50 Touch Points 52 Zero-Latency and Your Warehouse Environment 53 Closed-Loop Learning 53 Historical Integrity 54 Summary 58 Chapter 3 Planning Data Warehouse Iterations 59 Planning Any Iteration 61 Building Your BI Plan 62 Enterprise Strategy 63 Designing the Technical Architecture 64 Designing the Data Architecture 66 Implementing and Maintaining the Warehouse 69 Planning the First Iteration 70 Aligning the Warehouse with Corporate Strategy 71 Conducting a Readiness Assessment 71 Resource Planning 74 Identifying Opportunities with the DIF Matrix1 77 Determining the Right Approach 78 Applying the DIF Matrix 78 Antecedent Documentation and Known Problems 80 Contents ix IT JAD Sessions 80 Select Candidate Iteration Opportunities 80 Get IT Scores 81