![Part 23 - Data Mining](https://data.docslib.org/img/3a60ab92a6e30910dab9bd827208bcff-1.webp)
<p> Part 23</p><p>Data Mining OLAP (On-Line Analytical Processing)</p><p>Oracle OLAP Products Oracle Express Oracle Express Web Agent Oracle Express Objects Oracle Express Analyzer Oracle Financial Analyzer Oracle Financial Controller Oracle Sales Analyzer</p><p>OLAP Uses Lets users play with information stored in the data warehouse Can view the data from a variety of angles Can analyze sales over time, by region, by product type</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 2 Categories of OLAP Tools</p><p>DOLAP - Desktop OLAP PC tools with limited analysis of small warehouses ROLAP - Relational OLAP Server-based tools for savvy relational database users MOLAP - Multidimensional OLAP Server-based tools for read-only pre-computed cubes of data</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 3 Star Schemas</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 4 Data Mining</p><p>Discover hidden patterns in data Also known as Knowledge Discovery in Databases (KDD) Pattern has not yet been identified by the end user</p><p>Discover anomalies in the data One pattern is inconsistent with other patterns Identify bad data, bad reporting, bad modeling, vs. real differences that need to be dealt with or exploited</p><p>Pattern: A pattern is an inherent relationship and/or distributions in the data under consideration</p><p>Data: Data is the collection stored in the Data Warehouse environment</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 5 Data Mining Initial Process</p><p>Problem identification Determine the scope of the problem Determine if the data is available to address the problem</p><p>Data preparation Data collection Data cleaning Data reduction, including removal of useless variables and sampling of large datasets Data transformation to a limited set of numerical values</p><p>Model formulation Select a model and technique for mining the data</p><p>Data analysis/Pattern exploration/Data classification Exact approach will depend on model</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 6 Data Mining Continuing Process</p><p>Validation and iteration Test model against unknown cases that were not used to generate the model Verify validity of model Modify as necessary Must be able to predict unusual cases</p><p>Knowledge acquisition Run model against all cases Generate final statistics</p><p>Interpretation Translate numbers into words Build a theory of behavior based on subject matter knowledge that is consistent with the facts</p><p>Implementation Develop a plan of action that is theoretically sound under the theory developed</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 7 Fields used by Data Mining</p><p>Statistics Means, variances, normal distribution, regression (single and multiple, linear and non-linear), ANOVA</p><p>Artificial Intelligence Pattern recognition, deductive reasoning, vision algorithms, expert systems, genetic algorithms</p><p>Databases Clustering, pattern bit maps</p><p>Operations Research Linear programming, Lagrangian multipliers, branch and bound, integer programming</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 8 Business Applications of Data Mining</p><p>Market segmentation Market basket analysis Fraud detection Customer retention Forecasting Investment management Manufacturing and production</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 9 Data Mining Tasks</p><p>Classification Clustering Association Prediction Summarization Visualization</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 10 Data Mining Methods</p><p>Statistical regression Neural networks Decision trees Multivariate statistical methods Mathematical programming Case-based reasoning Genetic algorithms</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 11 Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 12</p>
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages12 Page
-
File Size-