Part 23 - Data Mining

Part 23 - Data Mining

<p> Part 23</p><p>Data Mining OLAP (On-Line Analytical Processing)</p><p>Oracle OLAP Products Oracle Express Oracle Express Web Agent Oracle Express Objects Oracle Express Analyzer Oracle Financial Analyzer Oracle Financial Controller Oracle Sales Analyzer</p><p>OLAP Uses Lets users play with information stored in the data warehouse Can view the data from a variety of angles Can analyze sales over time, by region, by product type</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 2 Categories of OLAP Tools</p><p>DOLAP - Desktop OLAP PC tools with limited analysis of small warehouses ROLAP - Relational OLAP Server-based tools for savvy relational database users MOLAP - Multidimensional OLAP Server-based tools for read-only pre-computed cubes of data</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 3 Star Schemas</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 4 Data Mining</p><p>Discover hidden patterns in data Also known as Knowledge Discovery in Databases (KDD) Pattern has not yet been identified by the end user</p><p>Discover anomalies in the data One pattern is inconsistent with other patterns Identify bad data, bad reporting, bad modeling, vs. real differences that need to be dealt with or exploited</p><p>Pattern: A pattern is an inherent relationship and/or distributions in the data under consideration</p><p>Data: Data is the collection stored in the Data Warehouse environment</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 5 Data Mining Initial Process</p><p>Problem identification Determine the scope of the problem Determine if the data is available to address the problem</p><p>Data preparation Data collection Data cleaning Data reduction, including removal of useless variables and sampling of large datasets Data transformation to a limited set of numerical values</p><p>Model formulation Select a model and technique for mining the data</p><p>Data analysis/Pattern exploration/Data classification Exact approach will depend on model</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 6 Data Mining Continuing Process</p><p>Validation and iteration Test model against unknown cases that were not used to generate the model Verify validity of model Modify as necessary Must be able to predict unusual cases</p><p>Knowledge acquisition Run model against all cases Generate final statistics</p><p>Interpretation Translate numbers into words Build a theory of behavior based on subject matter knowledge that is consistent with the facts</p><p>Implementation Develop a plan of action that is theoretically sound under the theory developed</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 7 Fields used by Data Mining</p><p>Statistics Means, variances, normal distribution, regression (single and multiple, linear and non-linear), ANOVA</p><p>Artificial Intelligence Pattern recognition, deductive reasoning, vision algorithms, expert systems, genetic algorithms</p><p>Databases Clustering, pattern bit maps</p><p>Operations Research Linear programming, Lagrangian multipliers, branch and bound, integer programming</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 8 Business Applications of Data Mining</p><p>Market segmentation Market basket analysis Fraud detection Customer retention Forecasting Investment management Manufacturing and production</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 9 Data Mining Tasks</p><p>Classification Clustering Association Prediction Summarization Visualization</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 10 Data Mining Methods</p><p>Statistical regression Neural networks Decision trees Multivariate statistical methods Mathematical programming Case-based reasoning Genetic algorithms</p><p>Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 11 Copyright ã 1971-2002 Thomas P. Sturm Data Mining Part 23, Page 12</p>

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    12 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us