236620 Spring 2013 Syllabus

236620 Spring 2013 Syllabus

<p> 236620 Spring 2013 Syllabus</p><p>1. Chapter 1: Introduction (two weeks)  Week 1: course administration and logistics; intro lecture describing course outline, sampling a little bit of everything.  Week 2:  Principles of large scale systems: scalability, consistency, fault-tolerance, workload optimization.  Data centers: high level architecture, physical layout and implications of data locality, intra-machine communication etc. 2. Chapter 2: Batch data processing and analytics – Map/Reduce, Hadoop and beyond  Week 3: real-life problem motivating Map-Reduce (indexing), and the Map-Reduce programming paradigm at a high-level.  Week 4: Map-Reduce implementation in depth.  Week 5: high-level analytical languages (e.g. Pig, Hive). High-level abstractions (select, join, aggregate), and how they map to Map-Reduce.  Week 6: ad-hoc/interactive analytics, e.g. OLAP and data cubes, with systems such as BigQuery/Hive/Shark. 3. Chapter 3: incremental processing via OLTP/NoSQL databases  Week 7: example of incremental computation on large data that requires scalable key/value stores – online page importance computation.  Week 8: OLTP basics, BigTable, HBASE.  Week 9: motivation for incremental processing, Transactions for NoSQL (in a nutshell), Percolator, Omid. 4. Chapter 4: Stream processing  Week 10: canonical stream mining problems (approximate counting, frequent itemsets)  Week 11: online optimization: multi-armed bandits  Week 12: Storm, S4. 5. Chapter 5: Misc  Week 13: Controlled Experiments on the Web, A/B testing.  Week 14: Cloud Computing; short course summary. </p>

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    1 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us