Index

Symbols ADO.NET, 137 <> (angle brackets), SSRS mock-ups, 345 SSIS, 195 <<>> (angle brackets-double), SSRS SSRS, 330 mock-ups, 345 Adventure Works Cycles, 6 \\ \\ (back slashes), SSRS mock-ups, 345 bus matrix, 21 – 22, 36 – 37 { } (curly brackets), SSRS mock-ups, 345 , 472 – 488 ( ) (parentheses), SSRS mock-ups, 345 dimensions, 69 – 72 + (plus sign), ETL schematics, 191 attributes, 72 – 74 [ ] (square brackets), SSRS mock-ups, 345 dimensional model, 69 – 78 SCDs, 75 – 76 A enterprise-level business requirements accumulating snapshots, 52 – 53 documentation, 19 – 20 ETL facts, 72 – 74, 76 – 77 fact providers, 238 interview documentation, 16 , 229 preparation, 12 – 13 partitions, 229 prioritization grid, 23 SSAS cubes, 229 project planning, 26 – 27 actions, 288 – 289 SharePoint BI portal, 405 – 406 Active Directory, 185, 530 SSRS template, 353 , 535 subcategory tables, 50 SharePoint affinity grouping, 436 BI portal, 419COPYRIGHTEDaggregates MATERIAL security, 396 dimensional model, 53 – 54 SSAS, 506 ETL, 190 Windows Integrated Security, 496 fact providers, 238 – 239 Activity Monitor, 588 – 590 fact table, 239 Activity Viewer, 593 MDX, 53 ad hoc reporting, 369 – 372 OLAP, 53 Excel, 377 SSAS, 53, 239 PivotTable, 377 tables, relational , 150 – 151 Add Wizard, 268 usage complexity, 103 additivity, 32

625

640388bindex.indd 625 1/31/11 6:44:40 PM 626 Index n A–B

aggregate dimensions ASP.NET, 131 dimensional model, 49 – 51 association, 436, 449 ETL, 50 atomic level, 33 SSAS, 51 attributes, 72 – 74 Aggregation Design Wizard, 297 – 298 conformed, 169 aggregations degenerate dimensions, 43 cubes, 292 dimensions, 34 performance Adventure Works Cycles, 72 – 74 real-time, 310 – 311 dimensional model, 65 – 66 SSAS OLAP, 296 – 298 surrogate keys, 39 query performance, 248 – 249 domains, 182 SSAS OLAP, 248 – 249, 296 – 298 ETL dimension manager, 238 agile software development, 5 freeform, 183 algorithms, 432 incomplete, 167 – 168 business tasks, 438 – 439 junk dimensions, 51, 224 classification, 434 MDS, 179 clustering, 436 – 437 properties, 266 – 267 data mining, 445 – 450 SCDs, 40 estimation, 435 SSAS OLAP, 262 – 263, 266 – 267, 270 – 273 hashing, 499 standard dimensions, 258 – 259 All(), 388 – 389 audit columns allocations, 76 ETL, 200 ALTER , 155 source systems, 200 ALTER PARTITION FUNCTION, 162 audit dimension, 141 alternate access mapping, 415 ETL, 215 – 216 Analysis Management Objects (AMO), 528 FK, 215 data mining, 443 – 444 master packages, 215 .NET, 532 SSIS packages, 215 SSAS, 569 audit keys, 215 Analysis Services. See SQL Server Analysis authentication Services Kerberos, 505 analytics SharePoint BI portal, 420 application developer, 120 SQL Server, 504 bus matrix, 21 Windows Integrated Security, 496 business requirements, 15 mashup, 381 B PowerPivot, 385 – 387, 399 backups SSRS, 326 compression, 145 announcements, 408 deployment, 565 anomaly detection, 438 ETL, 240 architecture planning, 606 – 610 data mining, 445 relational databases, 606 – 608 dimensional model, 58 SSAS, 569, 609 – 610 NUMA, 109 SSIS, 608 – 609 PowerPivot, 378 – 380 SSRS, 610 SharePoint BI portal, 412 – 416 Bayesian method, 445 SQL Server data mining, 440 – 445 BCG. See Boston Consulting Group SSIS packages, 197 – 198 Berry, Michael J.A., 433, 450 SSRS, 330 – 332 BETWEEN, 140, 237 archiving BETWEEN RowStartDate and , 203 RowEndDate, 237 ETL, 190

640388bindex.indd 626 1/31/11 6:44:40 PM Index n B 627

BI applications Browser, Dimension Designer, 273 – 274 Business Dimensional Lifecycle, 617 bubble charts, 65 extending, 586 – 587 bulk loads, 204 – 206 SSRS, 323 – 373 bus matrix, 18 value, 326 – 328 Adventure Works Cycles, 21 – 22, 36 – 37 BI portal analytics, 21 announcements, 408 business processes, 20 – 21 architecture, 412 – 416 dimensional model, 36 – 38 building, 409 – 411 enterprise-level business requirements, 38 Business Dimensional Lifecycle, 617 Business Dimensional Lifecycle, 136, 584 business processes, 407 – 408 BI applications, 617 calendars, 408 BI portal, 617 completing, 424 – 425 business requirements, 616 feedback, 409 databases, 616 – 617 forum, 408 deployment, 618 hierarchies, 405 phases, 615 – 619 HTML, 411 problems, 615 – 619 maintenance, 585 – 586 SSAS, 245 – 246 metadata, 408 Business Intelligence Development Studio personalization, 408 (BIDS), 79, 81, 95 – 97 planning, 405 – 411 BIDS Helper, 193, 528 – 529, 548 search, 408 SSAS, 570 SharePoint, 403 – 427 hierarchies, 95 Active Directory, 419 Preview tab, 356 announcements, 408 Report Designer, 130, 333 architecture, 412 – 416 SSAS, 95, 117, 253 authentication, 420 SSIS, 95, 611 building, 409 – 411 SSRS, 95, 356 business processes, 407 – 408 Visual Studio, 88, 95, 549 calendars, 408 Business Intelligence Wizard, 286 – 287 feedback, 409 business keys, 39 forum, 408 business metadata, 525 hierarchies, 405 Business Objects, 501 HTML, 411 business phase, data mining, 451 – 453 metadata, 408 business processes, 9 – 10 personalization, 408 bridge tables, 47 planning, 405 – 411, 419 – 420 bus matrix, 20 – 21 product keys, 419 business requirements document, 18 – 19 search, 408 dimensional model, 29 – 78 SharePoint, 403 – 427 interviews, 10, 15, 17 templates, 425 – 426 Kimball Lifecycle, 18 testing, 421 – 426 prioritization, 23, 25 versions, 419 SharePoint BI portal, 407 – 408 BIDS. See Business Intelligence summary, 18 Development Studio business requirements, 3 – 28 BIDS Helper, 193, 528 – 529, 548 analytics, 15 SSAS, 570 Business Dimensional Lifecycle, 616 Boston Consulting Group (BCG), 24 data profiling, 16 bridge tables, 45 dimensional model, 56 – 57 business processes, 47 documentation, 18 – 22 dimensions, 46 enterprise-level, 8 – 22 multi-valued dimensions, ETL, 235 ETL, 189 SSIS, 235 executive , 19

640388bindex.indd 627 1/31/11 6:44:40 PM 628 Index n B–C

group sessions, 27 classification interviews, 13 – 15, 17 algorithm, 44 Kimball Lifecycle, 4 data mining, 434 prioritization, 22 – 25 classification matrix, 458 – 459 project planning, 25 – 28 Clay, Ryan, 514 scorecards, 19 cleaning. See data cleaning sponsorship, 7 – 8 closed loop applications, 587 SSRS, 328 – 330 SSRS, 326 strategic goals, 14 clustering, 436 – 437, 448 value, 5 – 22 COBOL, 202 business review, data mining, 460 Cognos, 501 business rules columns. See also specific column types error tables, 214 DSV, 256 screens, 207 error tables, 214 business task summary, 438 – 439 extended properties, 142 bXtrctOK, 213 PivotTable databases, 390 – 391 PK, 223 C relational databases, 137 cache renaming, 210 PowerPivot, 396 sorting, dimension tables, 141 proactive, 292, 319 column screens, 206 uncached lookups, 237 transforms, 207 CALCULATE(), 387 – 389 Command Line Actions, 289 calculations Command transform, OLE DB, 228, 234 cubes, 282 – 286 Common Warehouse Metamodel MDX, 284 – 286 (CWM), 527 PivotTable, 386 – 387 compliance, 189, 243 PowerPivot, 386 – 387 compression. See data compression SSAS OLAP, 248 computed columns, PivotTable Calculations tab, Cube Designer, 283 – 284 databases, 390 – 391 calendars, 408 concatenated keys, 269 Cascading Lookups conditional formatting derived column transforms, 237 Excel, 397 late arriving data handler, 236 SSRS, 330, 365 surrogate key pipeline, 230 – 232, 237 – 238 The Conditions of Learning and Theory of case, 432 Instruction (Gagné), 578 case sets, 432, 455 – 456 Configuration Manager cast, 211 SQL Server, 123 CDC. See change data capture SSRS, 131 cell security, 512 – 513 conformed attributes, 169 Census Bureau, 40, 381 conformed dimensions, 21, 33 Central Administration, 398, 423 dimensional model, 36 – 38 change data capture (CDC) master data, 169 ETL, 200 – 202 shrunken dimensions, 225 replication, 201 conformed facts, 77 Change Tracking, 136, 198, 387 conforming. See data conforming ETL, 201 consolidated requirements, 18 char, 139 constraints, relational databases, 142 – 153 eckSignatureOnLoad, 520 Content Manager, 503 child packages, 198 continuous variables, 432 – 433 audit keys, 215 control flow precedence arrows, 197 CoSort, 242

640388bindex.indd 628 1/31/11 6:44:40 PM Index n C–D 629

COUNT(), 387 halting package execution, 211 – 214 COUNTA(), 387 nulls, 207 COUNTROWS(), 387 SSIS transforms, 211 CRC. See cyclic redundancy checksum surrogate keys, 230 CREATE, 609 data compression CREATE PARTITION FUNCTION, 155 – 156 backups, 145 CREATE TABLE, 152 – 153 pages, 144 – 145 credentials, 504 relational databases, 144 – 145 CRISP. See Cross Industry Standard Process rows, 144 for Data Mining SQL Server, 144 Cross Industry Standard Process for Data data conforming Mining (CRISP), 450 data mining, 453 – 454 Cross Validation, 459 dimensions, 217 CSV, 331, 338 drill across, 217 – 218 cubes ETL, 217 – 218 aggregations, 292 fact table, 230 calculations, 282 – 286 SSIS packages, 217 dimensions, 278 surrogate keys, 230 ETL, 190 (DDL), 137 OLAP, 239 indexes, 153 measures, 359 permissions, 518 OLAP data destinations, 196 ETL, 239 data extensions, 330 SSAS, 250 – 252, 262, 274 – 291, 299 data extraction PowerPivot, 396 archiving, 203 properties, 278 – 279 data flow, 203 SSAS, 507 ETL, 199 – 206 accumulating snapshots, 229 packages, 202 OLAP, 250 – 252, 262, 274 – 291, 299 push model, 202 Cube Designer, 275 – 276 transforms, 202 Calculations tab, 283 – 284 data flow Partitions and Aggregations tab, 289 data cleaning, 207 – 211 Cube Wizard, 262, 275 data destinations, 196 customer segmentation, 436 – 437 data extraction, 203 CWM. See Common Warehouse Metamodel data sources, 196 cyclic redundancy checksum (CRC), 202 halting package execution, 212 packages, 212 D pipelines, 195– 197 dashboards ETL, 197 Excel, 377 quality screens, 206 – 207 executive, 19 SSIS performance, 197 Management Dashboard, 379, 397 – 398 pseudo-partitions, 227 SSRS, 326 SSIS, 195 – 196 Data Analysis Expressions (DAX), 385 transforms, 196 data auditing. See data profiling SSIS, 207, 216 Data Center Edition, 90 – 92 data governance, 167 data cleaning data integration, 623. See also SQL Server data flows, 207 – 211 Integration Services data mining, 453 – 454 master data, 168 – 170 data type conversions, 207 data latency, 359 – 361, 444 database joins, 234 ETL, 190 ETL, 206 – 214 MOLAP, 293 fact table, 230 real-time, 305 – 319

640388bindex.indd 629 1/31/11 6:44:40 PM 630 Index n D

data loading. See also extract, transform, Data Mining Extensions (DMX), 94, 443, 590 and load Data Mining Techniques: For Marketing, Sales, error flows, 204 and Customer Relationship Management ETL, 204 – 206 (Berry and Linoff), 433, 450 data mining, 429 – 489 Data Mining Wizard, 440 Adventure Works Cycles, 472 – 488 data modelers, 55 – 56 affinity grouping, 436 data pipeline, 110 – 111 algorithms, 445 – 450 data presentation, 218 – 240 AMO, 443, 444 data profiling, 10 anomaly detection, 438 business requirements, 16 association, 436, 449 data sources, 14 business phase, 451 – 453 dimensional model, 60 – 63 business review, 460 ETL, 199 business task summary, 438 – 439 project planning, 27 business uses, 433 – 439 source systems, 16 case study, 472 – 488 SQL Server Profiler, 528 – 529, 594 classification, 434 SSIS, 199 classification matrix, 458 – 459 SSRS, 61 clustering, 436 – 437, 448 data propagation manager, 240 data cleaning, 453 – 454 data quality data conforming, 453 – 454 ETL, 189 data mining phase, 453 – 460 testing, 557 – 558 defined, 430 – 432 data security, 498 – 502 description and profiling, 438 data sensitivity directed, 431 – 432 hashing algorithm, 499 environment, 442 itemizing, 500 examples, 463 – 472 Data Source View (DSV) Excel, 443 Mining Structure, 442 exploratory, 431 SSAS OLAP, 255 – 257 extensibility, 444 data sources external variables, 454 – 455 data flow, 196 metadata, 462 – 463 data profiling, 14 model, 456 – 460 lookups, 231 model developer, 120 PowerPivot, 395 – 396 operations phase, 460 – 461 SSAS, 363 opportunity, 452 – 453 SSRS, 357 – 361 phase, 453 – 460 data storage prediction, 435 – 436 relational databases, 142 – 153 preparation, 453 SAN, 112 – 113 process, 450 – 463 SSAS, 127 – 128, 292 – 293 regression, 434 – 435 SSIS, 130 sequence clustering, 448 system setup, 110 – 113 SQL Server, 440 – 445 data track, 29, 187, 245 SSAS, 80, 92, 443, 457 data types SSIS, 443 – 444 casting, 210 SSRS, 325 conversions, 207 stored procedures, 444 error tables, 214 terminology, 432 – 433 data validation usage complexity, 103 BI applications, 327 user-defined functions, 444 dimensional model, 68 validation, 457 data volume, 101 – 102 Data Mining Designer, 442, 457 Database Diagram, 137

640388bindex.indd 630 1/31/11 6:44:40 PM Index n D 631

databases. See also relational databases deployment, 545 – 582 Business Dimensional Lifecycle, 616 – 617 backups, 565 engines, 196 – 197 Business Dimensional Lifecycle, 618 in-memory, column-store, 378 desktop readiness, 580 – 581 joins documentation, 573 – 576 data cleaning, 234 environment, 546 – 549 late arriving data handler, 237 Kimball Lifecycle, 545 surrogate key pipeline, 233 – 234 MDS, 572 PivotTable columns, 390 – 391 relational databases, 565 – 567 PowerPivot, 380 – 392 SSAS, 568 – 570 recovery model, SQL Server, 125 – 126 SSIS packages, 567 – 568 SCD, 222 SSRS, 350 – 351, 571 – 572 server, 121 testing, 550 – 564 SharePoint, 414 training, 576 – 579 SSAS OLAP, 254 – 255 user support, 579 structures, MDS, 173 Visual Studio, 566 training, 578 Deployment Wizard, 567, 569 datasets derived columns, 76 Report Builder, 370 transforms, Cascading Lookups, 237 SSRS, 355, 357 – 361 derived facts, 32 Date, 44 Description, 142 date and time dimensions, 224 design surfaces, 194 – 197 DateCalendarYear, 361 Design tab, Report Designer, 362 dates Destination Editor, 206 dimensional model, 42 – 43 Developer Edition, 90 – 92 PowerPivot, 392 development environment. See surrogate keys, 42 – 43 environment DAX. See Data Analysis Expressions developmental testing, 551 – 554 DDL. See data definition language deviation, 433 debriefing, 14 – 15 dictionary compression, 144 – 145 Decision Trees, 434, 446 – 447, 458 dimensions. See also specific dimension types deduplication, 216 Adventure Works Cycles, 69 – 72 faultBufferMaxRows, 242 attributes, 72 – 74 DefaultBufferSize, 242 attributes, 34 definition step, 10 – 11 Adventure Works Cycles, 72 – 74 degenerate dimensions dimensional model, 65 – 66 Adventure Works Cycles dimensional real-time, 310 model, 77 surrogate keys, 39 attributes, 43 bridge tables, 46 dimensional model, 43 cubes, 278 DELETE FROM, 161 data conforming, 217 deletes, 200, 303 date and time, 224 denormalization denormalization, 34 dimensions, 34 dimensional model, 33 – 34 ETL, 44 ETL, 218 – 222 SSAS OLAP standard dimensions, 258 facts dependencies Analysis Services, 43 dimension tables, 197 dimensional model, 34 – 38 ETL, 242 RI, 229 Dependency Network, 483 – 484 fact table, 45 – 46 dependent variables, 432 FK, 36

640388bindex.indd 631 1/31/11 6:44:40 PM 632 Index n D

hierarchies, 34, 266, 562 drill across, 36 – 38 indexes, 147 – 149 environment, 58 – 59 junk, 224 ETL, 29 lookups, 225 facts, 34 – 38 many-to-many, 46 – 47 measures, 65 – 66 master data, 165 – 166 fact table, 31 – 33, 52 – 53 groups, 281 hierarchies, 47 – 49 normalization, 34 junk dimensions, 51 – 52 occurrences, 33 – 34 Kimball Lifecycle, 29 – 30 page compression, 145 many-to-many, 44 – 47 PK, 35 naming conventions, 59 – 60 properties, 265 – 266 participants, 55 – 56 re-using, 36 PowerPivot, 392 security, 507 – 511 preparation, 55 – 60 SSAS OLAP, 261 – 274 process, 54 – 68 surrogate keys, 38 review, 68 Time and Account, 267 – 268 RI, 230 updates, 219 roles, 55 – 56 late arriving data handler, 236 SCDs, 39 – 41 SSAS OLAP, 302 – 303 simplicity, 30 – 31 Dimension Designer, 264, 273 – 274 snowflaking, 43 – 44 dimension manager, 238 source systems, 60 – 63 dimension members, 235 – 236 surrogate keys, 38 – 39 dimension tables testing, 68 dependencies, 197 validation, 68 DSV, 256 directed data mining, 431 – 432 ETL, 218 discrete variables, 432 – 433 fact table, FK, 145 – 146 disk drive. See also Redundant Array of partitions, 154 Independent Disks PK, 138 performance, 111 RI, 229 space, 602 – 603 sorting columns, 141 DISTINCT(), 387 updates, ETL fact providers, 239 Distinct, 61 Dimension Usage, 279 – 280 DMX. See Data Mining Extensions dimensional model DNS. See Domain Name Server Adventure Works Cycles, 69 – 78 documentation SCDs, 75 – 76 business requirements, 18 – 22 aggregates, 53 – 54 deployment, 573 – 576 aggregate dimensions, 49 – 51 enterprise-level business architecture, 58 requirements, 10 – 16 building, 63 – 66 Adventure Works Cycles, 19 – 20 bus matrix, 36 – 38 interviews, 14 – 15 business processes, 29 – 78 SSRS, 345 – 347 business requirements, 56 – 57 Domain Name Server (DNS), 415 concepts, 30 – 54 domains, attributes, 182 conformed dimensions, 36 – 38 double-hop problem, 420 data profiling, 60 – 63 DQ Continue, 213 dates, 42 – 43 drill across degenerate dimensions, 43 data conforming, 217 – 218 detail development, 66 – 67 dimensional model, 36 – 38 dimensions, 33 – 38 fact table, 36 dimensions, attributes, 65 – 66 SELECT, 217 – 218

640388bindex.indd 632 1/31/11 6:44:40 PM Index n D–E 633

drill-down, 345, 574 events DSV. See Data Source View error event schema, ETL, 214 – 215 Dynamic Management Views, 529, 589 source systems, 32 dynamic security, 513 – 514 Excel, 81 ad hoc reporting, 377 E conditional formatting, 397 Einstein, Albert, 30 dashboards, 377 EM. See Expectation-Maximization data mining, 443 ENDOFYEAR(), 392 hierarchies, 223 – 224 EngineThreads, 242 macros, 377 Enterprise Edition, 90 – 92 PowerPivot, 89, 375 – 401 Enterprise Resource Planning (ERP), 20 report designer, 119 enterprise-level business SSAS, 376 – 377, 620 requirements, 8 – 22 SSRS, 331, 338 Adventure Works Cycles, 19 – 20 Execute SQL task, 195 bus matrix, 38 Execution Log, 529 definition step, 10 – 11 executive dashboard, 19 flow chart, 10 Expectation-Maximization (EM), 448 documentation, 10 – 16 exploratory data mining, 431 Adventure Works Cycles, 19 – 20 export interviews, 10 MDS, 177 – 178, 185 preparation, 11 – 13 SSRS, 338 prioritization, 25 extended properties, 142, 529 project scope, 9 – 10 extensibility, 444 Entity Data Model, 137 external reports, 501 entity integrity, 145 – 147 external variables, 454 – 455 environment extract, transform, and load (ETL), 86 – 87 data mining, 442 accumulating snapshots fact table, 229 deployment, 546 – 549 aggregates, 190, 239 dimensional model, 58 – 59 aggregate dimensions, 50 ETL, 240 – 243 archiving, 190 PowerPivot, 399 – 400 audit columns, 200 security, 497 – 498 audit dimension, 215 – 216 SharePoint, 421 available skills, 190 SSRS, 351 – 352 backups, 240 ERP. See Enterprise Resource Planning bulk loads, 204 – 206 error event business needs, 189 ETL, 214 – 215 CDC, 200 – 202 logging, 208 Change Tracking, 201 schema, 214 – 215 compliance, 189, 243 error flows cubes, 190 data loading, 204 data cleaning, 206 – 214 OLE DB, 196 data conforming, 217 – 218 SSIS, 196 – 197 data extraction, 199 – 206 error levels, 215 data flow pipelines, 197 error tables, 214, 230 data latency, 190 error types, 214 data loading, 204 – 206 ER/Studio, 137 data presentation, 218 – 240 ERwin, 137 data profiling, 199 estimation algorithms, 435 data propagation manager, 240 ETL. See extract, transform, and load data quality, 189 Event Viewer, 594 data volume, 101

640388bindex.indd 633 1/31/11 6:44:41 PM 634 Index n E–F

deduplication, 216 system developer, 118 – 119 denormalization, 44 triggers, 200 dependencies, 242 versions, 241 dimensions, 218 – 222 workflow monitor, 241 dimension manager, 238 extraction. See data extraction dimension tables, 218 dimensional model, 29 F directed data mining, 431 facts, 32 – 33 environment, 240 – 243 additivity, 32 error event schema, 214 – 215 Adventure Works Cycles, 72 – 74, 76 – 77 fact providers, 238 – 239 conformed, 77 fact table, 32, 218, 225 – 229 derived, 32 full diff, 201 – 202 dimensions hierarchies, 223 – 224 Analysis Services, 43 job scheduler, 240 dimensional model, 34 – 38 late arriving data handler, 235 – 238 RI, 229 lights-out, 169 late arriving data handler, 236 – 238 lineage, 190, 242 measures, dimensional model, 65 – 66 Metadata Repository Manager, 243 real-time, 310 multi-valued dimensions bridge semi-additive, 32, 33 tables, 235 MDX, 33 non SQL Server techniques, 201 SSAS partitions, 292 nulls, 140 updates, 303 OLAP cubes, 239 fact providers, ETL, 238 – 239 packages, 611 fact table parallelizing, 242 accumulating snapshots, ETL, 229 partitions, 162 aggregates, 239 periodic snapshot fact table, 228 Cube Wizard, 275 pipelines, 242 data cleaning, 230 error event schema, 214 data conforming, 230 planning, 191 – 192 dimensions, 45 – 46 problem escalation, 242 dimension tables, 145 – 146 recovery and restart, 241 dimensional model, 31 – 33, 52 – 53 relational databases, 114 – 115 drill across, 36 replication, 200 ETL, 32, 218, 225 – 229 requirements, 188 – 190 fact providers, 238 SCDs, 40 – 41 factless, 32 schematics, 191 – 192 FK, 32, 146 security, 243 RI, 229 snowflaking, 223 granularity, 33, 269 sorting, 241 – 242 transactions, 226 – 228 special dimensions manager, 224 – 225 indexes, 149 SSIS, 80, 114 – 115, 187 – 244 intermediate, 47 scripts, 213 – 214 ledgered, 303 SSRS, 339, 368 measure groups, 277 staging tables, 163 nulls, 140 subsystems, 198 – 218 packages surrogate keys, 39 RI, 225 generator, 223 SSAS, 226 pipeline, 229 – 234 page compression, 145

640388bindex.indd 634 1/31/11 6:44:41 PM Index n F–I 635

partitions, 162, 227 G periodic snapshot, 52 – 53 Gagné, Robert, 578 ETL, 228 granularity table partitions, 228 fact table, 33, 269 PK, 32 measure groups, 280 – 281 PK index, 146 shrunken dimensions, 225 surrogate key pipeline, 224 group sessions, 27 transaction, 52 – 53 GUID transaction granularity, 228 SSIS, 533 types, 52 – 53 surrogate keys, 39 updates, ETL fact providers, 239 factless fact table, 32 H farms, SharePoint, 414 hard deletes, 200 feedback hashing algorithm, 499 BI applications, 327 Health Analyzer, 424 SharePoint BI portal, 409 hierarchies files. See also flat files; raw files BIDS, 95 partitions, 154 – 155 dimensions, 34, 266, 562 relational databases, 142 – 144 dimensional model, 47 – 49 SQL Server, 123 – 124 ETL, 223 – 224 file groups Excel, 223 – 224 partitions, 154 – 155 fixed, 223 relational databases, 142 – 144 frequently changing, 49 SQL Server, 123 – 124 Master Data Manager, 175 , 156 MDS, 224 filtered reports, 504 one-to-many, 47 – 48 filtering security, 517 – 519 parent-child, SSRS, 49 fixed hierarchies, 223 ragged, 223 FK. See foreign keys RI, 182, 269 flat files security, 511 error tables, 214 SharePoint BI portal, 405 PowerPivot, 381 SSAS, 251 foreign keys (FK) OLAP, 258 – 259, 268 audit dimensions, 215 SSRS, parent-child, 49 dimensions, 36 variable-depth, 48 – 49 dimension tables, 145 – 146 web applications, 415 DSV, 256 high availability, 114 – 115 error tables, 214 Hist, 191 fact table, 32, 145 – 146 HOLAP. See Hybrid OLAP RI, 229 housekeeping columns, 140 – 141 relational databases, 137 HTML forms authentication, 505 scripts, 289 forums, 408 SharePoint BI portal, 411 freeform attributes, 183 SSRS, 331 frequently changing hierarchies, 49 HTTP, 330 full diff, 201 – 202 Hybrid OLAP (HOLAP), 292 – 293 full processing, 292, 300 Fuzzy Grouping, 216 I Fuzzy Lookup, 216 ICD-10. See International Classification of Diseases - Tenth Modification IDENTITY, 223

640388bindex.indd 635 1/31/11 6:44:41 PM 636 Index n J–L

IIS. See Internet Information Services J impact analysis, 527 job scheduler, 240 import, 176 – 177 joins INCLUDE, 148 database incomplete attributes, 167 – 168 data cleaning, 234 Incr, 191 late arriving data handler, 237 incremental processing, 292, 299, 300 – 302 surrogate key pipeline, 233 – 234 PowerPivot, 392 outer, 210 independent variables, 432 star join optimization, 31 indexes tables, 211 DDL, 153 transforms, 211 dimensions, 147 – 149 junk dimensions, 224 fact table, 149 attributes, 51, 224 partitions, 160 dimensional model, 51 – 52 PK, 146 rows, 224 – 225 relational databases, 147 – 150 SSAS, 51 SQL Server, 149 SSAS, 150 K statistics, 149 KDD. See Knowledge Discovery and Data views, 151, 248 Mining Information Provision system, 502 Kerberos, 420, 504 in-memory, column-store authentication, 505 database, 378 keys. See specific key types INSERT, 176 Key Performance Indicators (KPIs), 276 insert streams, 204 MDX, 287 – 288 Installation Wizard, 567 SSAS, 288 integers, 138 Kimball Bus Matrix, 36 integration. See data integration; SQL Kimball Lifecycle, 2 – 3, 80 Server Integration Services business processes, 18 Integration Services. See SQL Server business requirements, 4 Integration Services deployment, 545 interaction list, 346 – 347 dimensional model, 29 – 30 Interactive Sorting, 366 SSAS, 245 – 246 intermediate fact table, 47 SSRS, 340 International Classification of Diseases system setup, 100 Tenth Modification (ICD-10), 47 Kimball Method, 250, 257 – 258, 279 Internet Information Services (IIS), 414 metadata, 229 interviews SCD, 222 business processes, 10, 15, 17 Knowledge Discovery and Data Mining business requirements, 13 – 15, 17 (KDD), 430 debriefing, 14 – 15 KPIs. See Key Performance Indicators documentation, 14 – 15 Adventure Works Cycles, 16 L enterprise-level business requirements, 10 late arriving data handler project planning, 26 database joins, 237 relationships, 12 dimensions source systems, 14 members, 235 – 236 project planning, 27 updates, 236 ISEMPTY, 361 ETL, 235 – 238 isnull, 211 fact providers, 238 IsRowCurrent, 219 facts, 236 – 238 issues list, 66 SSAS, 236 Union All transform, 237

640388bindex.indd 636 1/31/11 6:44:41 PM Index n L–M 637

latency. See data latency incomplete attributes, 167 – 168 LazyAggregations, 302 master reference data, 166 – 171 ledgered fact table, 303 system integration, 170 – 171 libraries, 416 Master Data Manager lifestyle types, 437 hierarchies, 175 lights-out ETL, 169 MDS, 174 lineage user interface, 174 – 176 analysis, 527 Master Data Services (MDS), 81, 83, 88, ETL, 190, 242 92, 622 transparency, 197 application creation, 179 – 185 Linoff, Gordon S., 433, 450 attributes, 179 lists, 416 database structures, 173 loading. See data loading; extract, deduplication, 216 transform, and load deployment, 572 Log Analyzer, SSIS, 529 export, 177 – 178, 185 log scraper tools, 201 hierarchies, 224 lookups. See also Cascading Lookups import, 176 – 177 data sources, 231 Master Data Manager, 174 dimensions, 225 metadata, 534 outer joins, 210 nulls, 185 RI, 231 RI, 185 surrogate key pipeline, 230 – 232 security, 173 – 174 uncached, 237 SharePoint, 81 violation errors, 211 SQL Server, 171 – 179 Lookup transform, 234 staging tables, 176 subcategories, 183 – 184 M updates, 176 – 177 macros, 377 user interface, 172 – 173 Manage Relationships, 384 master packages, 198, 215 Management Dashboard, 379 – 380, 397 – 398 master reference data, 166 – 171 Management (MDW), matrix. See also bus matrix 529, 594 – 598 classification, 458 – 459 Management Studio Kimball Bus Matrix, 36 permissions, 503 two-by-two, 24 relational databases, 117 – 118 mdm.tblStgMemberAttributes, 184 SSAS, 506 MDS. See Master Data Services many-to-many MDW. See Management Data Warehouse dimensions, 46 – 47 MDX. See Multidimensional Expressions dimensional model, 44 – 47 measures fact table and dimensions, 45 – 46 cubes, 359 PK, 46 dimensional model, 65 – 66 SSAS, 47 PivotTable, 386 – 387 OLAP, 259 – 260 PowerPivot, 386 – 387 surrogate keys, 46 SSAS OLAP, 32 market basket analysis, 436 measure groups mashup, 381 dimensions, 281 massively parallel processing (MPP), 109 fact table, 277 master data, 165 – 186 granularity, 280 – 281 conformed dimensions, 169 partitions, 294 – 295 data integration, 168 – 170 SSAS, 33 dimensions, 165 – 166 tables, 274

640388bindex.indd 637 1/31/11 6:44:41 PM 638 Index n M–N

memory MPP. See massively parallel processing in-memory, column-store database, 378 Multidimensional Expressions (MDX), 94 NUMA, 109 aggregates, 53 SSAS, 128 – 129 calculations, 284 – 286 MERGE, 228 ISEMPTY, 361 MERGE RANGE, 162 KPIs, 287 – 288 metadata, 523 – 544, 621 RANK, 361 Active Directory, 535 security, 510 categories, 525 semi-additive facts, 33 data mining, 462 – 463 SSRS, 358 Kimball Method, 229 UserName, 514 Master Data Services, 534 Multidimensional OLAP ongoing management, 543 (MOLAP), 292 – 293 purpose, 524 – 525 data latency, 293 RDL, 527 data volume, 101 relational databases, 163 – 164 multi-valued dimensions relational engine, 527 bridge tables, ETL, 235 Report Builder, 534 SSAS OLAP, 259 – 260 repository, 526 SharePoint, 534 N BI portal, 408 Naïve Bayes, 434, 447 – 448, 458 SQL Server, 527 – 535 naming conventions SSAS, 128, 532, 538 – 539 dimensional model, 59 – 60 OLAP, 247 – 248 PowerPivot, 392 SSIS, 533 natural keys, 39 SSRS, 345, 365, 533 – 534 navigation framework, 347 standards, 526 – 527 .NET System Monitor, 534 – 535 AMO, 532 user-oriented, 247 – 248 data extraction, 202 Metadata Repository Manager, 243 Neural Network, 434, 449 – 450 mini-dimensions, 40 Nielsen Claritas, 434, 437 mining. See data mining non SQL Server techniques, ETL, 201 Mining Accuracy Chart non-uniform memory architecture Cross Validation, 459 (NUMA), 109 Data Mining Designer, 457 normalization Mining Model Prediction, 443 dimensions, 34 Mining Model Viewer, 442 SSAS OLAP standard dimensions, 258 Mining Structure, 433, 442 North American Industrial Classification mixed granularity partitions, 161 System, 47 mock-ups, 345 – 346 NOT nulls, 156 MOLAP. See Multidimensional OLAP NSort, 242 monitoring NTLM, 420 performance, 593 – 600 nulls PowerPivot, 600 data cleaning, 207 relational databases, 594 – 596 MDS, 185 SSAS, 597 – 598 quality screens, 210 SSIS, 598 – 600 relational databases, 140 SSRS, 598 surrogate keys, 231 security, 521 – 522 NUMA. See non-uniform memory usage, 600 – 602 architecture

640388bindex.indd 638 1/31/11 6:44:41 PM Index n O–P 639

O fact table occurrences, 33 – 34 RI, 225 OLAP. See OnLine Analytic Processing SSAS, 226 OLE DB, 195 master, 198, 215 Command transform, 234 raw files, 205 data extraction, 202 security, 520 – 521 error flows, 196 SSIS, 193, 520 – 521, 533 fact table, 228 architecture, 197 – 198 SSRS, 330 audit dimension, 215 one-to-many, 44, 47 – 48 data conforming, 217 OnLine Analytic Processing (OLAP) deployment, 567 – 568 aggregates, 53 variables, 198 cubes, 239 page compression, 144 – 145 HOLAP, 292 – 293 parallel structures, 160 MOLAP, 292 – 293 parallelizing, 242 data latency, 293 parameterized labeling, 365 data volume, 101 parent-child, 48 ROLAP, 292 – 293 SSAS, 49 SSAS, 31, 83 – 85, 92, 245 – 304 SSAS OLAP, 258 – 259 aggregations, 248 – 249 SSRS, 49 calculations, 248 partitions cubes, 251 – 252 accumulating snapshots, 229 measures, 32 CREATE PARTITION metadata, 247 – 248 FUNCTION, 155 – 156 performance, 247 dimension tables, 154 planning, 250 – 252 ETL, 162 query performance, 248 fact table, 162, 227 Report Builder, 247 files, 154 – 155 Reporting Services, 247 file groups, 154 – 155 security, 248 indexes, 160 structure design, 250 – 303 measure groups, 294 – 295 usage complexity, 103 mixed granularity, 161 open access, 498 – 500 parallel structures, 160 operating systems, 122 pseudo-partitions, 159 – 160 security, 495 – 497 relational databases, 115 operations phase, data mining, 460 – 461 scheme, 156 opportunity, data mining, 452 – 453 SSAS, 115, 154 optimizer, 149 facts, 292 Oracle, 202 OLAP, 294 – 296 OrderDateKey, 156 tables, 153 – 163 outer joins, 210 dropping old data, 161 outrigger tables, 44 limitations, 162 – 163 management, 158 – 161 P periodic snapshot fact table, 228 packages Partitions and Aggregations tab, Cube child, 198, 215 Designer, 289 data cleaning, 211 – 212 , 156 data extraction, 202 PDF, 331, 338 data flows, 212 performance ETL, 611 aggregations real-time, 310 – 311 SSAS OLAP, 296 – 298

640388bindex.indd 639 1/31/11 6:44:41 PM 640 Index n P

disk drives, 111 ETL, 191 – 192 monitoring, 593 – 600 interviews, 26 PowerPivot, 600 preparation, 26 relational databases, 594 – 596 SharePoint BI portal, 405 – 411, 419 – 420 SSAS, 597 – 598 source systems interviews, 27 SSIS, 598 – 600 SSAS OLAP, 250 – 252 SSRS, 598 SSRS, 351 – 354 OLAP, 85 policy statements, 499 SSAS, 247 portals. See also BI portal query reporting portal developer, 119 – 120 aggregations, 248 – 249 SSRS, 326 BI applications, 327 PowerDesigner, 137 SSAS OLAP, 248, 268 PowerPivot, 81 testing, 561 analytics, 385 – 387, 399 SCD, 221 architecture, 378 – 380 SSAS OLAP, 247 cache, 396 SSIS data flow pipelines, 197 calculations, 386 – 387 SSRS, 339 cubes, 396 testing, 559 – 561 data sources, 395 – 396 tuning, 604 – 606 databases, 380 – 392 Performance tool, System Monitor, 530 dates, 392 periodic snapshot fact table, 52 – 53 dimensional model, 392 ETL, 228 environment, 399 – 401 table partitions, 228 Excel, 89, 375 – 401 permissions, 516 – 517 flat files, 381 DDL, 518 incremental processing, 392 Management Studio, 503 Manage Relationships, 384 Report Manager, 503 Management Dashboard, SSAS, 501 379 – 380, 397 – 398 Perspective, 290 measures, 386 – 387 pipelines, 110 – 111. See also surrogate key naming conventions, 392 pipeline performance monitoring, 600 data flow, 195 – 197 PivotTable, 385 – 386 ETL, 197 relational databases, 381 quality screens, 206 – 207 relationships, 383 – 385 SSIS performance, 197 report designer, 119 ETL, 242 security, 514 error event schema, 214 SharePoint, 81, 89, 92, 327 – 380, 378 – 380, PivotTable 392 – 399, 413, 422 ad hoc reporting, 377 monitoring and management, 397 – 399 calculations, 386 – 387 SSAS, 381, 532 database columns, 390 – 391 standard reports, 395 measures, 386 – 387 tables, 381 – 385 PowerPivot, 385 – 386 workbook publishing, 398 – 399 PK. See primary key preparation placeholders, 230, 233 Adventure Works Cycles, 12 – 13 planning. See also project planning data mining, 453 Adventure Works Cycles, 26 – 27 dimensional model, 55 – 60 backups, 606 – 610 enterprise-level business business requirements, 25 – 28 requirements, 11 – 13 data profiling, 27 project planning, 26 ERP, 20 SSRS, 349, 351 – 354

640388bindex.indd 640 1/31/11 6:44:41 PM Index n P–R 641

Preview tab, BIDS, 356 Query Designer, 358 PREVIOUSYEAR(), 392 , 85 primary key (PK) query performance columns, 223 aggregations, 248 – 249 dimensions, 35 BI applications, 327 dimension tables, 138 SSAS OLAP, 248, 268 DSV, 256 testing, 561 fact table, 32, 146 indexes, 146 R many-to-many, 46 ragged hierarchies, 223 relational databases, 137 RAID. See Redundant Array of tables, 145 – 146 Independent Disks primary testing environment, 547 RANK, 361 prioritization raw files business processes, 23, 25 error tables, 214 business requirements, 22 – 25 packages, 205 enterprise-level business requirements, 25 SSIS, 204 – 205 two-by-two matrix, 24 RC-Xtrct, 213 PRIZM, 434, 437 RDL. See Report Definition Language proactive caching, 292, 319 real-time, 305 – 320 process metadata, 525, 541 – 542 data latency, 305 – 319 ProcessingMode, 302 facts, 310 processors, 113 performance aggregations, 310 – 311 ProClarity, 620 relational databases, 317 – 318 ProdCount, 213 scenarios and solutions, 311 ProdLowLimit, 213 SSAS, 318 – 319 product keys, 419 SSIS, 319 profiling. See data profiling recovery. See backups project planning recursive subroutines, 48 Adventure Works Cycles, 26 – 27 Redundant Array of Independent Disks business requirements, 25 – 28 (RAID) data profiling, 27 operating systems, 122 interviews, 26 SQL Server, 123 – 124 preparation, 26 SSAS, 127 – 128 source systems interviews, 27 system setup, 112 promotion response rate, 19 reference dimensions, 44 properties referential integrity (RI) attributes, 266 – 267 dimensions, 229 cubes, 278 – 279 dimension tables, 229 dimensions, 265 – 266 dimensional model, 230 extended, 142, 529 error tables, 214, 230 pseudo-partitions, 159 – 160 facts, 229 data flows, 227 fact table standalone tables, 227 FK, 229 Publisher, 503 packages, 225 pull model, 336 hierarchies, 182, 269 push model lookups, 231 data extraction, 202 MDS, 185 Report Manager, 338 placeholders, 230, 233 relational databases, 145 – 147 Q rows, 230 quality screens surrogate key pipeline, 225 data flow pipelines, 206 – 207 regression, 433 – 435 nulls, 210

640388bindex.indd 641 1/31/11 6:44:41 PM 642 Index n R–S

relational databases, 91. See also SQL Server Report Builder, 12, 80, 620 aggregate tables, 150 – 151 datasets, 370 backups, 606 – 608 metadata, 534 columns, 137 SSAS OLAP, 247 constraints, 142 – 153 SSRS, 88, 333, 530 CREATE TABLE, 152 – 153 Report Definition Language (RDL), 331, 333 creating, 135 – 164 metadata, 527 data compression, 144 – 145 Report Designer, 119 data storage, 142 – 153 BIDS, 130, 333 deployment, 565 – 567 Design tab, 362 entity integrity, 145 – 147 Report Manager ETL, 114 – 115 permissions, 503 extended properties, 142 push model, 338 files, 142 – 144 SSRS, 334, 403, 529 file groups, 142 – 144 Report Part Gallery, 371 – 372 FK, 137 Report Server, 370 housekeeping columns, 140 – 141 Report Wizard, 353 – 354 indexes, 147 – 150 resilience, 222 Management Studio, 93, 117 – 118 Resource Governor, 592 metadata, 163 – 164 resource pools, 592 nulls, 140 Response_Likelihood, 434 – 435 object names, 136 restart points, 203 partitions, 115 Restricted Access, 498 performance monitoring, 594 – 596 RI. See referential integrity physical design, 137 – 142 ROLAP. See Relational OLAP PK, 137 roles PowerPivot, 381 dimensional model, 55 – 56 real-time, 317 – 318 relational databases, 515 – 520 RI, 145 – 147 SSAS, 506 – 513 roles, 515 – 520 SSRS, 504 SQL Server, 31 role-playing dimensions, 42, 260 – 261 security, 123 rows SSAS, 239 compression, 144 string columns, 138 – 139 ETL dimension manager, 238 surrogate keys, 138 junk dimensions, 224 – 225 system management, 588 – 589 RI, 230 tables security, 517 – 519 partitions, 153 – 163 RowChangeReason, 141 views, 151 – 152 RowEndDate, 140 – 141, 219 unknown member row, 152 RowIsCurrent, 141 views, 151 – 152, 516 relational engine, 80, 144, 197, 368, 621 – 622 S metadata, 527 SAMEPERIODLASTYEAR(), 392 Resource Governor, 592 SAN. See storage area networks Relational OLAP (ROLAP), 292 – 293 SAS. See Statistical Analysis System relational security, 514 scalability, 85 testing, 519 – 520 SCDs. See slowly changing dimensions relational stores, 86 scheduled processing, 300 rendering, 331 SchemaCompare, 565 replication, 200, 201 scorecards, 19, 326 scripts ETL SSIS, 213 – 214

640388bindex.indd 642 1/31/11 6:44:42 PM Index n S 643

HTML, 289 calendars, 408 testing, 567 – 568 feedback, 409 SCRUBS, 530, 533 forum, 408 search, 408 hierarchies, 405 second level business processes, 18 HTML, 411 security, 493 – 522 metadata, 408 cells, 512 – 513 personalization, 408 data, 498 – 502 planning, 405 – 411, 419 – 420 dimensions, 507 – 511 product keys, 419 dynamic, 513 – 514 search, 408 environment, 497 – 498 SharePoint, 403 – 427 ETL, 243 templates, 425 – 426 external reports, 501 testing, 421 – 426 filtering, 517 – 519 versions, 419 hierarchies, 511 Central Administration, 398, 423 manager, 494 – 495 databases, 414 MDS, 173 – 174 environment, 421 MDX, 510 farms, 414 monitoring, 521 – 522 Health Analyzer, 424 open access, 498 – 500 installation, 417 – 426 operating systems, 495 – 497 MDS, 81 packages, 520 – 521 metadata, 534 PowerPivot, 514 PowerPivot, 81, 89, 92, 378 – 380, 392 – 399, relational, 514, 519 – 520 413, 422 rows, 517 – 519 monitoring and management, 397 – 399 SharePoint, 396, 504 – 505 security, 396 sponsorship, 499 SharePoint BI portal, 403 – 427 SQL Server relational databases, 123 SSIS standard reports, 504 SSAS, 505 – 520 SSRS, 336, 424 OLAP, 248, 268 security, 504 – 505 SSIS, 520 – 521 terminology, 414 – 417 SSRS, 332, 338, 501 – 505 SharePoint Server, 422 subscriptions, 501 – 502 shrunken dimensions, 50, 225 Windows Integrated Security, 496 – 497 Silverlight, 395 SELECT, 12, 61, 68, 176 simultaneous users, 104 – 105 drill across, 217 – 218 sites, 414 SELECT DISTINCT, 199 site collections, 414 self-referencing tables, 48 SLAs. See Service Level Agreements semi-additive facts, 32 slowly changing dimensions (SCDs), 75 – 76 MDX, 33 Adventure Works Cycles dimensional sequence clustering, 448 model, 75 – 76 service application, 413 attributes, 40 Service Level Agreements (SLAs), customization, 221 559 – 560, 603 – 604 databases, 222 SharePoint, 81, 530 dimensional model, 39 – 41 BI portal ETL, 40 – 41, 218 – 222 Active Directory, 419 Kimball Method, 222 announcements, 408 performance, 221 architecture, 412 – 416 resilience, 222 authentication, 420 SSIS, 218 – 222 building, 409 – 411 TableDifference, 222 business processes, 407 – 408 wizard, 220

640388bindex.indd 643 1/31/11 6:44:42 PM 644 Index n S

small static dimensions, 225 Fast Track Data Warehouse, 111 SmartDiff, 548 files, 123 – 124 Smith, Bryan, 514 file groups, 123 – 124 SMO. See SQL Management Objects indexes, 149, 248 SMP. See symmetric multiprocessing MDS, 171 – 179 snapshots. See accumulating snapshots; metadata, 527 – 535 periodic snapshot fact table RAID, 123 – 124 snowflaking relational databases, 31, 122 – 126 dimensional model, 43 – 44 security, 123 ETL, 223 system setup, 122 – 126 SSAS, 44 views, 248 views, 152 SQL Server 2008 MDX Step by Step (Smith SOAP, 330 and Clay), 514 AMO, 443 SQL Server Agent, 123, 528 – 529, 608 Report Manager, 335 SQL Server Analysis Services (SSAS), software. See also specific programs 80, 622 installation/configuration, system actions, 288 – 289 setup, 115 – 131 Active Directory, 506 operating systems, 122 Activity Viewer, 593 testing, 120 – 121 aggregates, 53, 150 – 151, 239 Solid-state drives (SSDs), 111, 128 aggregate dimensions, 51 Solution Explorer, 95 – 96, 278 backups, 609 – 610 sorting BIDS, 95, 117, 253 dimension table columns, 141 Business Dimensional Lifecycle, 245 – 246 ETL, 241 – 242 cubes, 229, 507 Interactive Sorting, SSRS, 366 data mining, 80, 92, 443, 457 source systems, 12 data sources, 363 audit columns, 200 data storage, 127 – 128, 292 – 293 data profiling, 16 data volume, 101 dimensional model, 60 – 63 deployment, 568 – 570 events, 32 disadvantages, 249 – 250 full diff, 202 Dynamic Management Views, 529 interviews, 14 Excel, 376 – 377, 620 project planning, 27 fact dimensions, 43 surrogate keys, 38 fact table packages, 226 special dimensions manager, 224 – 225 hierarchies, 251 sp_estimate_data_compression_ indexes, 150 savings, 145 junk dimensions, 51 SPLIT RANGE, 162 Kimball Lifecycle, 245 – 246 sponsorship KPIs, 288 business requirements, 7 – 8 late arriving data handler, 236 security, 499 Management Studio, 93 – 94, 506 SPSS. See Statistical Package for the Social many-to-many, 47 Sciences MDW, 529 SQL Management Objects (SMO), 528, 532 measure groups, 33 SQL Server memory, 128 – 129 authentication, 504 metadata, 128, 532, 538 – 539 Configuration Manager, 123 object model, 529 data compression, 144 OLAP, 31, 83 – 85, 92, 245 – 304 data mining, 440 – 445 aggregations, 248 – 249 database designer, 118 attributes, 262 – 263, 266 – 267, 270 – 273 database recovery model, 125 – 126 calculations, 248 DMX, 443 cube content, 251 – 252

640388bindex.indd 644 1/31/11 6:44:42 PM Index n S 645

cube granularity, 252 bridge tables, 235 cube usage, 252 data flow, 195 – 196 cubes, 262, 274 – 291, 299 pipelines, 197 databases, 254 – 255 transforms, 207, 216 dimensions, 261 – 274 data mining, 443 – 444 dimension updates, 302 – 303 data profiling, 199 DSV, 255 – 257 design surfaces, 194 – 197 hierarchies, 268 Destination Editor, 206 many-to-many, 259 – 260 error flows, 196 – 197 measures, 32 ETL, 114 – 115, 187 – 244 metadata, 247 – 248 scripts, 213 – 214 multi-valued dimensions, 259 – 260 GUID, 533 partitions, 294 – 296 Log Analyzer, 529 performance, 247 Management Studio, 94 performance aggregations, 296 – 298 metadata, 533 physical design, 291 – 303 packages, 193, 520 – 521, 533 planning, 250 – 252 architecture, 197 – 198 query performance, 248, 268 audit dimension, 215 Report Builder, 247 data conforming, 217 Reporting Services, 247 deployment, 567 – 568 role-playing dimensions, 260 – 261 testing, 554 security, 248, 268 performance monitoring, 598 – 600 standard dimensions, 258 – 259 raw files, 204 – 205 structure design, 250 – 303 real-time, 319 views, 253 – 254 SCD, 218 – 222 parent-child, 49 security, 520 – 521 partitions, 115, 154 SharePoint, standard reports, 504 facts, 292 string columns, 139 performance monitoring, 597 – 598 surrogate key pipeline, 230 permissions, 501 system setup, 129 – 130 Perspective, 290 Term Extraction, 444 PowerPivot, 381, 532 transaction grain fact table, 228 RAID, 127 – 128 transforms, data cleaning, 211 real-time, 318 – 319 XML, 568 relational databases, 239 SQL Server Management Studio (SSMS), 12, report designer, 119 79, 92 – 94 roles, 506 – 513 Activity Monitor, 588 – 590 security, 505 – 520 Database Diagram, 137 snowflaking, 44 SQL Server Metadata Toolkit, 528 string columns, 139 SQL Server Profiler, 528 – 529, 594 synonyms, 42 SQL Server Reporting Services (SSRS), system management, 589 – 591 80, 92 system setup, 126 ad hoc reporting, 369 – 372 translations, 290 – 291 ADO.NET, 330 usage complexity, 103, 104 architecture, 330 – 332 views, 517 assessment, 339 – 340 Windows Integrated Security, 496 backups, 610 SQL Server Integration Services (SSIS), 80, 91 BI applications, 323 – 373 ADO.NET, 195 BIDS, 95, 356 backups, 608 – 609 browsers, 336 BIDS, 95, 611 build, 349

640388bindex.indd 645 1/31/11 6:44:42 PM 646 Index n S

business requirements, 328 – 330 Windows Integrated Security, 496 conditional formatting, 330, 365 workarounds, 355 Configuration Manager, 131 XML, 333, 338 data extensions, 330 SSAS. See SQL Server Analysis Services data profiling, 61 SSDs. See Solid-state drives data sources, 357 – 361 SSIS. See SQL Server Integration Services datasets, 355, 357 – 361 SSMS. See SQL Server Management Studio delivery, 332 SSRS. See SQL Server Reporting Services deployment, 350 – 351, 571 – 572 SSRS template, Adventure Works design, 341 – 348 Cycles, 353 development, 348 – 351 StageProduct, 210 documentation, 345 – 347 staging tables, 163, 176 environment, 351 – 352 standalone tables, 227 ETL, 339, 368 standard dimensions, 258 – 259 Execution Log, 529 Standard Edition, 90 – 92 export, 338 standard reports, 356 – 368, 504 interaction list, 346 – 347 PowerPivot, 395 Interactive Sorting, 366 security, 504 Kimball Lifecycle, 340 SharePoint SSIS, 504 Management Studio, 94 SSRS, 326, 504 MDX, 358 star join optimization, 31 metadata, 345, 365, 533 – 534 , 31 mock-ups, 345 – 346 STARTOFMONTH(), 392 navigation framework, 347 Statistical Analysis System (SAS), 431 navigation portal, 350 Statistical Package for the Social Sciences OLE DB, 330 (SPSS), 431 parameterized labeling, 365 storage. See data storage parent-child hierarchies, 49 storage area networks (SAN), 112 – 113 performance, 339 stored procedures, 444, 529 monitoring, 598 string columns, 138 – 139 planning, 351 – 354 structure screens, 206, 208 preparation, 349, 351 – 354 subcategories rendering, 331 MDS, 183 – 184 Report Builder, 88, 333, 530 tables, 50 report creation, 354 – 368 Adventure Works Cycles, 50 Report Manager, 335, 403, 529 subscriptions report model, 370 – 371 security, 501 – 502 report parts, 371 – 372 SSRS, 368 roles, 504 sub-sites, 414 security, 332, 338, 501 – 505 subsystems, ETL, 198 – 218 SharePoint, 336, 424 SUM, 12 security, 504 – 505 summary tables. See aggregates SSAS OLAP, 247 surrogate keys as standard reporting tool, 332 – 339 data cleaning, 230 subscriptions, 368 data conforming, 230 system management, 591 dates, 42 – 43 system setup, 130 – 131 dimensions, 38 templates, 343 – 345, 352 – 354 attributes, 39 testing, 350, 366 – 367 dimensional model, 38 – 39 usage complexity, 103 ETL, 39 user review, 348 dimension manager, 238

640388bindex.indd 646 1/31/11 6:44:42 PM Index n S–T 647

generator, 223 T GUID, 39 tables. See also specific table types IDENTITY, 223 aggregates, relational databases, 150 – 151 integers, 138 DSV, 256 many-to-many, 46 extended properties, 142 nulls, 231 joins, 211 relational databases, 138 measure groups, 274 source systems, 38 partitions, 153 – 163 surrogate key pipeline, 39 dropping old data, 161 Cascading Lookups, 230 – 232, 237 – 238 limitations, 162 – 163 database joins, 233 – 234 management, 158 – 161 ETL, 229 – 234 periodic snapshot fact table, 228 fact table, 224 PK, 145 – 146 lookups, 230 – 232 PowerPivot, 381 – 385 RI, 225 staging, 163 SSIS, 230 subcategories, 50 symmetric multiprocessing (SMP), 109 views, 151 – 152 SyncSort, 242 Table Description, 142 synonyms, 42 TableDifference, 222 sysssispackages, 520 tblStgMember, 176 System Administrator, 503 tblStgMemberAttribute, 176 system management, 587 – 612 tblSTGRelationship, 176 relational databases, 588 – 589 TDE. See transparent database encryption SSAS, 589 – 591 technical metadata, 525, 542 SSRS, 591 templates System Monitor, 121, 594 SharePoint BI portal, 425 – 426 metadata, 534 – 535 SSRS, 343 – 345, 352 – 354 Performance tool, 530 Term Extraction, 444 system setup, 99 – 131 testing configuration, 105 – 115 data quality, 557 – 558 data pipeline, 110 – 111 deployment, 550 – 564 data storage, 110 – 113 developmental, 551 – 554 data volume, 101 – 102 dimensional model, 68 disk drive performance, 111 performance, 559 – 561 high availability, 114 – 115 query performance, 561 Kimball Lifecycle, 100 relational security, 519 – 520 processors, 113 scripts, 567 – 568 RAID, 112 SharePoint BI portal, 421 – 426 SAN, 112 – 113 software, 120 – 121 simultaneous users, 104 – 105 SSIS packages, 554 size, 100 – 105 SSRS, 350, 366 – 367 software installation/ system, 555 – 556 configuration, 115 – 131 usability, 562 SQL Server, 122 – 126 text mining, 444 SSAS, 126 Thoreau, Henry David, 79 SSIS, 129 – 130 TIFF, 338 SSRS, 130 – 131 Time and Account, 267 – 268 system availability requirements, 105 Time Series, 449 usage complexity, 102 – 104 toolset, 79 – 97 system testing, 555 – 556 BIDS, 95 – 97 Systems Center, 121 ETL, 86 – 87

640388bindex.indd 647 1/31/11 6:44:42 PM 648 Index n T–X

MDS, 88 V relational stores, 86 validation SQL Server, 92 – 97 BI applications, 327 SSAS, 84 – 85 Cross Validation, 459 top assemblies, 48 data mining model, 457 training dimensional model, 68 databases, 578 varchar, 139 deployment, 576 – 579 variables, 432 – 433 transaction fact table, 52 – 53 external, 454 – 455 transaction granularity, fact table, 226 – 228 packages, 198 transforms. See also extract, transform, and variable-depth hierarchies, 48 – 49 load; slowly changing dimensions versions column screens, 207 ETL, 241 data extraction, 202 SharePoint BI portal, 419 data flow, 196 VertiPaq, 378 SSIS, 207 views derived columns, Cascading Lookups, 237 Dynamic Management Views, 529, 589 joins, 211 indexes, 151 SSIS, 207, 211 SQL Server, 248 structure screens, 208 relational databases, 516 translations, 290 – 291 SharePoint, 416 transparency, 197 snowflaking, 152 transparent database encryption (TDE), 515 SSAS, 517 triggers, 200 OLAP, 253 – 254 two-by-two matrix, 24 tables, 151 – 152 violation errors, 211 U virtual machines, 418 UAT. See User Acceptance Test Visual Studio, 81 uncached lookups, 237 BIDS, 88, 95, 549 Unicode, 139 deployment, 566 Union All transform, 234, 237 Visual Totals, 511 unit testing. See testing unknown member row, 152 W UPDATE, 228 web applications, 414 updates hierarchies, 415 dimensions, 219 Web Front End (WFE), 421 late arriving data handler, 236 Web Parts, 416 dimension tables, ETL fact providers, 239 WFE. See Web Front End facts, 303 WHERE, 295 fact table, ETL fact providers, 239 Windows Integrated Security, 496 – 497 MDS, 176 – 177 WMI, 335 usability testing, 562 workflow monitor, 241 User Acceptance Test (UAT), 548, 557 World Health Organization, 47 user interface Master Data Manager, 174 – 176 X MDS, 172 – 173 XML user support, 584 – 587 SSIS, 568 deployment, 579 SSRS, 333, 338 user-defined functions, 444 XML for Analysis (XMLA), 94, 569 – 570 UserName, 514 XMLA. See XML for Analysis user-oriented metadata, 247 – 248 XtrctLowLimit, 213

640388bindex.indd 648 1/31/11 6:44:42 PM