Did You Notice That DB2 11 and Hadoop Are Now Closer To

Did You Notice That DB2 11 and Hadoop Are Now Closer To

Did you notice that DB2 11 and Hadoop are now closer to each other than ever before? DB2 11 includes features that allow integration with Hadoop/IBM BigInsights, which simply means that DB2 can work with Big Data much easier. This presentation will make a brief overview of the technologies behind this relationship and will show a step-by-step guideline of how to start exploring DB2 11 and Hadoop/BigInsights integration. 1 2 3 4 5 6 7 8 9 10 Technologies from left to right, up to bottom: Apache Hbase – distributed database modeled after Google’s Big table, runs on top of HDFS; http://hbase.apache.org/ Apache Pig - a platform for analyzing large data sets, programs written in Pig Latin; http://pig.apache.org/ Apache Zookeeper – distributed configuration and synchronization service; http://zookeeper.apache.org/ Oozie – worklfow scheduler to manage Hadoop jobs; http://oozie.apache.org/ Apache Chukwa – data collection system for monitoring distributed systems, built on top of Hadoop; https://chukwa.apache.org/ Apache Hive – a data warehouse infrastructure built on top of Hadoop, provides SQL like language – HiveQL; https://hive.apache.org/ Apache Hadoop - Open source software framework for storage and large scale parallel processing; http://hadoop.apache.org/ IBM Infosphere BigInsights – Hadoop based big data platform; http://www- 01.ibm.com/software/data/infosphere/biginsights/ Apache mahout – distributed machine learning algorithms for filtering, clustering, classification; https://mahout.apache.org/ Jaql – data processing and query language for big data; http://www- 11 01.ibm.com/software/data/infosphere/hadoop/jaql/ Json - an open standard format that uses human-readable text to transmit data objects consisting of attribute–value pairs; http://json.org/ Apache avro – serialization framework for Hadoop; http://avro.apache.org/ Sqoop – an interface for transferring the data between relational databases and Hadoop, Sqoop = SQL to Hadoop; http://sqoop.apache.org/ Apache flume – a service for for collecting, aggregating, and moving large amount of data; http://flume.apache.org/ 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    71 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us