Part II (c) – Desktop Installation © Net Serpents LLC, USA Installation Desktop Installation ¡ Supported Platforms ¡ Required Software ¡ Releases &Mirror Sites ¡ Install ¡ Configure ¡ Format ¡ Start/ Stop ¡ Verify © Net Serpents LLC, USA Supported Platforms Installation ¡ GNU Linux supported for ¡ Development ¡ Production ¡ Demonstrated on 2000 node cluster ¡ Win32 ¡ Development only ¡ Not supported as a production platform © Net Serpents LLC, USA Required Required Software Software Following to be installed first ¡ Java 1.6.x or higher ¡ ssh: ¡ Ubuntu: ssh and rsync ¡ Windows: openssh © Net Serpents LLC, USA Releases and Mirror Releases and Mirror Sites Sites Releases ¡ http://hadoop.apache.org/releases.html ¡ Stable Release 2.7.1 (July 2015) ¡ Stable release: 2.6.0 (released Nov 2014) ¡ Earlier good releases: ¡ 2.4.0 (April 2014) ¡ 2.2.0 (GA Release – Oct 2013) ¡ 1.0.0 (Dec 2011) Visit: http://hadoop.apache.org/releases.html © Net Serpents LLC, USA Mirror Sites Mirror Sites ¡ Downloads available at several mirror sites ¡ Suggested by Apache: ¡ http://www.apache.org/dyn/closer.cgi/hadoop/ common ¡ Other mirror sites at: ¡ http://www.apache.org/dyn/closer.cgi/hadoop/ common Install - Overview Install Hadoop 2.6.0 on Ubuntu 14.0.4 Step 1- Install Java Ubuntu is Step 2 – Create a dedicated hadoop user the most popular Step 3 – Install ssh Linux Step 4 – Create ssh certificates distribution Step 5 – Install Hadoop Step 6 – Setup configuration Files Step 7 – Format Step 8 – Start/ Stop Step 9 - Verify Step 1 - Install Java Install Login as an admin user: $ cd ~ # Update the source list $ sudo apt-get update $ sudo apt-get install default-jdk # Verify version of Java is 1.6.0 or higher $ java -version java version "1.7.0_65" OpenJDK Runtime Environment (IcedTea 2.5.3) (7u71-2.5.3-0ubuntu0.14.04.1) OpenJDK 64-Bit Server VM (build 24.65-b04, mixed mode) Install - Overview Install Hadoop 2.6.0 on Ubuntu 14.0.4 Step 1- Install Java Step 2 – Create a dedicated hadoop user Step 3 – Install ssh Step 4 – Create ssh certificates Step 5 – Install Hadoop Step 6 – Setup configuration Files Step 7 – Format Step 8 – Start/ Stop Step 9 - Verify Step 2 – Create a dedicated Hadoop user Install # Create a hadoop group $ sudo addgroup hadoop Adding group `hadoop' (GID 1009) … Done. # Create hadoop user $ sudo adduser --ingroup hadoop huser Adding user `huser' ... Adding new user `huser' (1001) with group `hadoop' ... Creating home directory `/home/huser' ... Copying files from `/etc/skel' ... Enter new UNIX password: Retype new UNIX password: passwd: password updated successfully Changing the user information for huser Step 2 – Create a dedicated Hadoop user Install Enter the new value, or press ENTER for the default Full Name []: Room Number []: Work Phone []: Home Phone []: Other []: Is the information correct? [Y/n] Y # Add new user to sudoers $ sudo adduser huser sudo [sudo] password for admin: Adding user `huser' to group `sudo' ... Adding user huser to group sudo Done. Install - Overview Install Hadoop 2.6.0 on Ubuntu 14.0.4 Step 1- Install Java Step 2 – Create a dedicated hadoop user Step 3 – Install ssh Step 4 – Create ssh certificates Step 5 – Install Hadoop Step 6 – Setup configuration Files Step 7 – Format Step 8 – Start/ Stop Step 9 - Verify Step 3 – Install SSH Install $ sudo apt-get install ssh # Verify SSH is installed $ which ssh /usr/bin/ssh $ which sshd /usr/sbin/sshd Step 4 – Create SSH Certificates Install $ sudo su huser #Generate a key pair ¡ $ ssh-keygen -f ~/.ssh/id_rsa -t rsa -P "" Generating public/private rsa key pair. Enter file in which to save the key (/home/huser/.ssh/id_rsa): Created directory '/home/huser/.ssh'. Your identification has been saved in /home/huser/.ssh/id_rsa. Your public key has been saved in /home/huser/.ssh/id_rsa.pub. The key fingerprint is: 20:6c:f3:ff:0f:33:bf:30:72:c3:22:70:24:cc:2d:d3 huser@laptop The key's randomart image is: +--[ RSA 2048]----+ | .oo.o | © Net Serpents LLC, USA Step 4 – Create SSH Certificates Install # Create list of authorized keys to avoid being prompted for password $ cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys © Net Serpents LLC, USA Install - Overview Install Hadoop 2.6.0 on Ubuntu 14.0.4 Step 1- Install Java Step 2 – Create a dedicated hadoop user Step 3 – Install ssh Step 4 – Create ssh certificates Step 5 – Install Hadoop Step 6 – Setup configuration Files Step 7 – Format Step 8 – Start/ Stop Step 9 - Verify Step 5 – Install Hadoop Install # Download distribution from a mirror site $ sudo wget http://mirrors.sonic.net/apache/hadoop/common/hadoop-2.6.0/hadoop-2.6.0.tar.gz # Extract the files $ tar xvzf hadoop-2.6.0.tar.gz $ cd hadoop-2.6.0 # Move files to /usr/local/hadoop $ sudo mkdir /usr/local/hadoop $ sudo mv * /usr/local/hadoop # Change ownership to hadoop user $ sudo chown -R huser:hadoop /usr/local/hadoop © Net Serpents LLC, USA Install - Overview Install Hadoop 2.6.0 on Ubuntu 14.0.4 Step 1- Install Java Step 2 – Create a dedicated hadoop user Step 3 – Install ssh Step 4 – Create ssh certificates Step 5 – Install Hadoop Step 6 – Setup configuration Files Step 7 – Format Step 8 - Verify Step 6 – Configure Install # Update links to point to Java $update-alternatives --config java There is only one alternative in link group java (providing /usr/bin/java): /usr/lib/jvm/ java-7-openjdk-amd64/jre/bin/java Nothing to configure. # Note down JAVA_HOME variable value $ which javac /usr/bin/javac $ readlink -f /usr/bin/javac /usr/lib/jvm/java-7-openjdk-amd64/bin/javac (Note: JAVA_HOME would be everything before /bin/javac) © Net Serpents LLC, USA Step 6 – Configure Install # Add variables to the end of .bashrc $ vi ~/.bashrc export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64 export HADOOP_INSTALL=/usr/local/hadoop export PATH=$PATH:$HADOOP_INSTALL/bin export PATH=$PATH:$HADOOP_INSTALL/sbin export HADOOP_MAPRED_HOME=$HADOOP_INSTALL export HADOOP_COMMON_HOME=$HADOOP_INSTALL export HADOOP_HDFS_HOME=$HADOOP_INSTALL © Net Serpents LLC, USA Step 6 – Configure Install export YARN_HOME=$HADOOP_INSTALL export HADOOP_COMMON_LIB_NATIVE_DIR= $HADOOP_INSTALL/lib/native export HADOOP_OPTS="-Djava.library.path= $HADOOP_INSTALL/lib” # Execute the commands in .bashrc $ source ~/.bashrc © Net Serpents LLC, USA Step 6 – Configure Install # Configure hadoop-env.sh Change variable JAVA_HOME in hadoop-env.sh $ vi /usr/local/hadoop/etc/hadoop/hadoop-env.sh export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64 © Net Serpents LLC, USA Step 6 – Configure Install # Configure core-site.xml # First create a tmp folder for hadoop $ sudo mkdir -p /app/hadoop/tmp $ sudo chown huser:hadoop /app/hadoop/tmp # Modify core-site.xml $ vi /usr/local/hadoop/etc/hadoop/core-site.xml © Net Serpents LLC, USA Step 6 – Configure Install Modify as follows: <configuration> <property> <name>hadoop.tmp.dir</name> <value>/app/hadoop/tmp</value> <description>A base for other temporary directories.</description> </property> <property> <name>fs.default.name</name> <value>hdfs://localhost:54310</value> <description>The name of the default file system. A URI whose scheme and authority determine the FileSystem implementation. The uri's scheme determines the config property (fs.SCHEME.impl) naming the FileSystem implementation class. The uri's authority is used to determine the host, port, etc. for a filesystem.</description> </property> </configuration> © Net Serpents LLC, USA Step 6 – Configure Install # Configure mapred-site.xml # First copy the file from the template provided $ cp /usr/local/hadoop/etc/hadoop/mapred- site.xml.template /usr/local/hadoop/etc/hadoop/ mapred-site.xml # Modify mapred-site.xml $ vi /usr/local/hadoop/etc/hadoop/mapred-site.xml © Net Serpents LLC, USA Step 6 – Configure Install Modify as follows: <configuration> <property> <name>mapred.job.tracker</name> <value>localhost:54311</value> <description>The host and port that the MapReduce job tracker runs at. </description> </property> </configuration> © Net Serpents LLC, USA Step 6 – Configure Install # Configure hdfs-site.xml # First create the directories for data node and name node $ sudo mkdir -p /usr/local/hadoop_store/hdfs/namenode $ sudo mkdir -p /usr/local/hadoop_store/hdfs/datanode $ sudo chown -R huser:hadoop /usr/local/hadoop_store # Modify hdfs-site.xml $ vi /usr/local/hadoop/etc/hadoop/mapred-site.xml © Net Serpents LLC, USA Step 6 – Configure Install Modify as follows: <configuration> <property> <name>dfs.replication</name> <value>1</value> <description>This is a default value for block replication. Tis could be different from the value specified when the file is created. This value is just a default if none is specified at file creation. </description> </property> <property> …continued…see next page © Net Serpents LLC, USA Step 6 – Configure Install <property> <name>dfs.namenode.name.dir</name> <value>file:/usr/local/hadoop_store/hdfs/namenode</value> </property> <property> <name>dfs.datanode.data.dir</name> <value>file:/usr/local/hadoop_store/hdfs/datanode</value> </property> </configuration> © Net Serpents LLC, USA Install - Overview Install Hadoop 2.6.0 on Ubuntu 14.0.4 Step 1- Install Java Step 2 – Create a dedicated hadoop user Step 3 – Install ssh Step 4 – Create ssh certificates Step 5 – Install Hadoop Step 6 – Setup configuration Files Step 7 – Format Step 8 – Start/ Stop Step 9 - Verify Step 7 – Format Format $ hadoop namenode -format DEPRECATED: Use of this script to execute hdfs command is deprecated. Instead use the hdfs command for it. 15/04/18 14:43:03 INFO namenode.NameNode: STARTUP_MSG: /************************************************************
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages39 Page
-
File Size-