2 Points Dataset Description, and Initial Data Preprocessing If Any (At Most Page)

2 Points Dataset Description, and Initial Data Preprocessing If Any (At Most Page)

<p>1</p><p>CS548 Knowledge Discovery and Data Mining. Spring 2014. Project 4: Clustering. Student’s Name: write your name here</p><p>[2 points] Dataset Description, and initial data preprocessing if any (at most ½ page):</p><p>[3 points] Three Guiding Questions about the dataset domain (at most ½ page):</p><p>1. …</p><p>2. …</p><p>3. … 2</p><p>[5 points] Summary of Experiments with Partitional Clustering (k-means). At most 1/2 page. Tool # clusters Distanc # SSE % of Observations about experiment You can e iteration instances Observations about visualization add function s per cluster Interpretation of centroids other Classes to cluster evaluation? columns P Weka? 1 RapidM? Matlab? P … 2 P … 3 … … … … … …</p><p>[5 points] Summary of Experiments with EM. At most 1/2 page. Tool Pre- # Distance # Log likelihood Observations about You can process clusters function iterations experiment add Observations about columns visualization Interpretation of means & std dev Classes to cluster evaluation? E1 Weka? RapidM? Matlab? E2 … E3 … … … … … 3</p><p>… …</p><p>[6 points] Summary of Experiments with Hierarchical Clustering (single link, complete link, average, centroid, Ward). At most 1 page. Tool Pre-process # Link # Time Observations about You can clusters type iteration taken experiment add s Observations about other visualization columns Classes to cluster evaluation? H Weka? 1 RapidM? Matlab? H … 2 H … 3 … … … … … … 4</p><p>[7 points] Analysis of Results: 1. Analyze the effect of varying parameters/experimental settings on the results. 2. Analyze the results from the point of view of the Domain, and discuss the answers that the experiments provided to your guiding questions. 3. Include and explain (some of) the best / most interesting results you obtained in your experiments. 4. Include a visualization of the best k-means clustering, the best hierarchical clustering, and the best EM clustering you obtained. 5</p><p>[7 points] Advanced Topic: <include name of the topic here></p><p>List of sources/books/papers used for this topic (include URLs if available):</p><p> …</p><p> …</p><p> …</p><p>...</p><p>Description of the topic and summary of what you learned:</p><p>How does this topic relate to clustering and the material covered in this course?</p>

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    5 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us