0% found this document useful (0 votes)

410 views10 pages

Hadoop Cluster Setup

This document provides instructions for configuring a Hadoop cluster across multiple machines including modifying host files, setting up passwordless SSH, editing configuration files like core-site.xml and hdfs-site.xml, formatting the namenode, and starting and stopping the Hadoop processes. Key steps include modifying hosts files on all machines, setting up SSH between master and slaves, editing configuration files like masters and slaves files on all machines, formatting the namenode on the master, and starting/stopping processes by running scripts on the master only.

Uploaded by

bispsolutions

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

410 views10 pages

Hadoop Cluster Setup

Uploaded by

bispsolutions

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 10

Configuring Hadoop Cluster on multiple machine

Agenda
Modify your hosts file SSH from master to all slaves SSH to all slaves to master Edit masters file Edit slaves file Modify hadoop-env.sh file Modify core-site.xml file Modify hdfs-site.xml file Modify mapred-site.xml file Formatting of name node Start Hadoop cluster Stop Hadoop cluster

Modify your hosts file

Hosts file contains mapping of ip to hostname Edit your hosts file by typing the below command in your terminal

sudo vi /etc/hosts

Add entries for master & slaves

Repeat the same step on all master/slaves machines.

Master needs to communicate with each slave machine

There should be passwordless ssh from master machine to slave machine Follow the 3 commands to set passwordless ssh from master to slave username@master:~> ssh-keygen -t rsa username@master:~> ssh username@slave1 mkdir -p .ssh username@master:~> cat .ssh/id_rsa.pub | ssh username@slave1 'cat >> .ssh/authorized_keys' Repeat the same steps for each slave machine.

Each slave needs to communicate with master machine

There should be passwordless ssh from each slave machine to master machine Follow the 3 commands to set passwordless ssh from slave to master username@slave1:~> ssh-keygen -t rsa username@slave1:~> ssh username@master mkdir -p .ssh username@slave1:~> cat .ssh/id_rsa.pub | ssh username@master 'cat >> .ssh/authorized_keys' Repeat the same steps on each slave machine

Edit masters file

Open masters file ( HADOOP_HOME/conf/masters ) Add master machine entry in the file Save the master file Make these changes on each machine on cluster (master/slaves)

Edit slaves file

Open slaves file ( HADOOP_HOME/conf/slaves ) Add all slaves machine entry in the file Add slave entry 1 per line. Save the slaves file Make these changes on each machine on cluster (master/slaves)

Modify hadoop-env.sh file

hadoop-env.sh file contains system level variable. Make the following entry in HADOOP_HOME/conf/hadoop-env.sh

export JAVA_HOME=/usr export HADOOP_HOME=/home/neeraj/local_cluster_home/hadoop-1.0.3 Make these changes on each machine on cluster (master/slaves)

Modify core-site.xml file

We need to make the following entry in core-site.xml..

<configuration> <property> <name>fs.default.name</name> <value>hdfs://master:9000</value> </property> <property> <name>hadoop.tmp.dir</name> <value>/home/neeraj/local_cluster_home/hadoop1.0.3/hdfs_temp</value> </property> </configuration> Make these changes on each machine on cluster (master/slaves)

Modify hdfs-site.xml file

We need to make the following entry in hdfs-site.xml.. <configuration> <property> <name>dfs.replication</name> <value>1</value> <description>It's the number of times the block of a file will be replicated on cluster. Default is 3

</description> </property> <property> <name>dfs.data.dir</name> <value>/home/neeraj/local_cluster_home/hadoop1.0.3/hdfs_data</value> </property> </configuration>

Make these changes on each machine on cluster (master & slaves)

Modify mapred-site.xml file

We need to make the following entry in mapred-site.xml..

<configuration> <property> <name>mapred.job.tracker</name> <value>master:9001</value> <description>The host and port on MapReduce job tracker runs at. </description> </property>

</configuration> Make these changes on each machine on cluster (master/slaves)

Format your Namenode

Run the following command on your master machine ./hadoop namenode -format

Start your Hadoop cluster

Run the following command on master machine ./start-all.sh No need to start anything on slave machines

Check Hadoop daemons

Run the jps command on master machine

Run the jps command on slave machines

Stop your Hadoop cluster

Run the following command on master machine ./stop-all.sh No need to stop anything on slave machines

Thanks

Contact Point :www.bispsolutions.com

Pharmacy Cross Software User v5.0
100% (1)
Pharmacy Cross Software User v5.0
23 pages
Quick Start Guide of 4G HD IP Camera
No ratings yet
Quick Start Guide of 4G HD IP Camera
12 pages
Manual Zyxel VMG8825-T50 V5.13 - 5.50
No ratings yet
Manual Zyxel VMG8825-T50 V5.13 - 5.50
424 pages
Hadoop Multi Node Cluster
No ratings yet
Hadoop Multi Node Cluster
7 pages
Lab 1
No ratings yet
Lab 1
12 pages
Type 2 Multi Access Control and Locks With Checkpointing
No ratings yet
Type 2 Multi Access Control and Locks With Checkpointing
3 pages
Hadoop Multinode Setup
No ratings yet
Hadoop Multinode Setup
16 pages
Lab 0-Cluster With Multiple VMs-30-01-2024
No ratings yet
Lab 0-Cluster With Multiple VMs-30-01-2024
6 pages
Hadoop Installation
No ratings yet
Hadoop Installation
5 pages
On Master Nodes Nodes: Install and Edit Bashrc On All Nodes For JAVA and HADOOP
No ratings yet
On Master Nodes Nodes: Install and Edit Bashrc On All Nodes For JAVA and HADOOP
10 pages
Hadoop Multinode Cluster Installation
No ratings yet
Hadoop Multinode Cluster Installation
4 pages
Multi Node Cluster Setup
No ratings yet
Multi Node Cluster Setup
5 pages
6 Hadoop
No ratings yet
6 Hadoop
20 pages
Steps Single Node Setup
No ratings yet
Steps Single Node Setup
4 pages
Hadoop Cluster Setup Guide
No ratings yet
Hadoop Cluster Setup Guide
5 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Ex3a 2
No ratings yet
Ex3a 2
1 page
Hadoop
No ratings yet
Hadoop
27 pages
Ex 3
No ratings yet
Ex 3
3 pages
CentOS Hadoop Cluster Setup Guide
No ratings yet
CentOS Hadoop Cluster Setup Guide
3 pages
TP2 - 3IM - en
No ratings yet
TP2 - 3IM - en
7 pages
Hadoop Cluster
No ratings yet
Hadoop Cluster
26 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Hadoop
No ratings yet
Hadoop
18 pages
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2
No ratings yet
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2
25 pages
How To Install and Set Up A 3-Node Hadoop Cluster
No ratings yet
How To Install and Set Up A 3-Node Hadoop Cluster
36 pages
Hadoop Setup Guide for Linux Users
No ratings yet
Hadoop Setup Guide for Linux Users
23 pages
PRACTICAL 4 - Single and Multi Node Hadoop Install
No ratings yet
PRACTICAL 4 - Single and Multi Node Hadoop Install
11 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
Ex 1
No ratings yet
Ex 1
5 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Unit 3 PART 2
No ratings yet
Unit 3 PART 2
11 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
4 pages
Group A 1st
No ratings yet
Group A 1st
4 pages
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2 - WithScreenShots
No ratings yet
How To Set Up A Multi-Node Hadoop Cluster On Amazon EC2 - WithScreenShots
42 pages
Notes
No ratings yet
Notes
76 pages
BDA Unit-4
No ratings yet
BDA Unit-4
38 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
32 pages
Hadoop Installatio1
No ratings yet
Hadoop Installatio1
22 pages
Hadoop 6
No ratings yet
Hadoop 6
5 pages
Hadoop 2.7.1 Setup on CentOS 6.4
No ratings yet
Hadoop 2.7.1 Setup on CentOS 6.4
4 pages
3 Introduction To Hadoop Administration
No ratings yet
3 Introduction To Hadoop Administration
8 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
BDA Lab File
No ratings yet
BDA Lab File
4 pages
Setup 8
No ratings yet
Setup 8
16 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Start Hadoop
No ratings yet
Start Hadoop
4 pages
Hadoop Installation Cluster
No ratings yet
Hadoop Installation Cluster
9 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
7 pages
BDA Unit-4
No ratings yet
BDA Unit-4
38 pages
Installation Process of HADOOP
No ratings yet
Installation Process of HADOOP
12 pages
Big Data Record
No ratings yet
Big Data Record
69 pages
Bda A2
No ratings yet
Bda A2
17 pages
Install Hdfs
No ratings yet
Install Hdfs
3 pages
Bi Lab File
No ratings yet
Bi Lab File
19 pages
Department of Computer Engineering Istanbul S. Zaim University, Istanbul, Turkey
No ratings yet
Department of Computer Engineering Istanbul S. Zaim University, Istanbul, Turkey
42 pages
Inro Demo v1 31 July
No ratings yet
Inro Demo v1 31 July
16 pages
Hyperion Web Analysis: Amit Sharma Hyperion Trainer
No ratings yet
Hyperion Web Analysis: Amit Sharma Hyperion Trainer
35 pages
Introduction To Data Warehouse Using Cognos
100% (2)
Introduction To Data Warehouse Using Cognos
56 pages
NetSuite PBCS Implementation & Advance Planning
No ratings yet
NetSuite PBCS Implementation & Advance Planning
7 pages
Abhishek Network Engineer
No ratings yet
Abhishek Network Engineer
8 pages
HDFS Command Guide for Beginners
No ratings yet
HDFS Command Guide for Beginners
8 pages
HFM Intro
No ratings yet
HFM Intro
27 pages
An Introduction To: Hyperion Financial Data Quality Management
No ratings yet
An Introduction To: Hyperion Financial Data Quality Management
26 pages
Controlling Accesst 1 Aug
No ratings yet
Controlling Accesst 1 Aug
17 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
HFM Intro
No ratings yet
HFM Intro
27 pages
Training On Oracle Hyperion Products Suite: Amit Sharma
No ratings yet
Training On Oracle Hyperion Products Suite: Amit Sharma
35 pages
Essbase Standards for Developers
No ratings yet
Essbase Standards for Developers
24 pages
Inro Demo v1 31 July
No ratings yet
Inro Demo v1 31 July
16 pages
IR MetaData Guide
No ratings yet
IR MetaData Guide
8 pages
Training On Oracle Hyperion Products Suite: Amit Sharma
No ratings yet
Training On Oracle Hyperion Products Suite: Amit Sharma
35 pages
O O O O O: Some Common Mistakes While Loading Data Into Essbase Server
No ratings yet
O O O O O: Some Common Mistakes While Loading Data Into Essbase Server
4 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
Implementing Multiple Fact Tables
No ratings yet
Implementing Multiple Fact Tables
4 pages
Template 1
No ratings yet
Template 1
2 pages
Inro Demo v1 31 July
No ratings yet
Inro Demo v1 31 July
16 pages
Template 1
No ratings yet
Template 1
2 pages
OBIEE Installation 31 July
No ratings yet
OBIEE Installation 31 July
24 pages
LTE Router User'S Manual
No ratings yet
LTE Router User'S Manual
38 pages
(SOLVED) Error Code - 0x80070035 Network Path Not Found - Tech Support Forum
100% (1)
(SOLVED) Error Code - 0x80070035 Network Path Not Found - Tech Support Forum
7 pages
Addis Ababa University: Institute of Technology
No ratings yet
Addis Ababa University: Institute of Technology
23 pages
SDN and IoT Integration Guide
No ratings yet
SDN and IoT Integration Guide
31 pages
Sip Error2
No ratings yet
Sip Error2
7 pages
Digital Vision Network DVN 5000 Real-Time Rackmount Series (Version 2.9+) Catalog Page
No ratings yet
Digital Vision Network DVN 5000 Real-Time Rackmount Series (Version 2.9+) Catalog Page
4 pages
Drive Info
No ratings yet
Drive Info
2 pages
Audio and Motion Dimensions of Information and Media: Mil Q2 L5
No ratings yet
Audio and Motion Dimensions of Information and Media: Mil Q2 L5
27 pages
Multisim
No ratings yet
Multisim
4 pages
MP3 Player System Design
No ratings yet
MP3 Player System Design
21 pages
Cocomo Model1
100% (1)
Cocomo Model1
12 pages
Cleared - CCDH-410 Certification (Certification Results Forum at Coderanch)
No ratings yet
Cleared - CCDH-410 Certification (Certification Results Forum at Coderanch)
4 pages
Shift Registers, Counters, FSM
No ratings yet
Shift Registers, Counters, FSM
170 pages
MIPS Assembly Code Exam 2019/20
No ratings yet
MIPS Assembly Code Exam 2019/20
6 pages
Can Analyzer
100% (1)
Can Analyzer
39 pages
Assembly Jump Instructions Guide
No ratings yet
Assembly Jump Instructions Guide
13 pages
Quiz 2 - Model Answer
No ratings yet
Quiz 2 - Model Answer
2 pages
Online Quiz Examination System: Bachelor of Technology
No ratings yet
Online Quiz Examination System: Bachelor of Technology
21 pages
Logic Design 1 PDF
No ratings yet
Logic Design 1 PDF
74 pages
ICC20 Chassis
No ratings yet
ICC20 Chassis
73 pages
En FM-Eco4 User Manual
No ratings yet
En FM-Eco4 User Manual
34 pages
ARM Embedded System Workshop 2014
No ratings yet
ARM Embedded System Workshop 2014
2 pages
Physiology Prepladder
No ratings yet
Physiology Prepladder
179 pages
Update On Telecommunications For Disaster Relief, Mitigation, and Early Warning
No ratings yet
Update On Telecommunications For Disaster Relief, Mitigation, and Early Warning
16 pages
Adc and Dac
No ratings yet
Adc and Dac
5 pages
ASROCK X399 Taichi - AMD RAID Installation Guide PDF
No ratings yet
ASROCK X399 Taichi - AMD RAID Installation Guide PDF
38 pages
PCM & Delta Modulation Basics
No ratings yet
PCM & Delta Modulation Basics
7 pages

Hadoop Cluster Setup

Uploaded by

Hadoop Cluster Setup

Uploaded by

Configuring Hadoop Cluster on multiple machine

Modify your hosts file

Add entries for master & slaves

Repeat the same step on all master/slaves machines.

Master needs to communicate with each slave machine

Each slave needs to communicate with master machine

Edit masters file

Edit slaves file

Modify hadoop-env.sh file

Modify core-site.xml file

Modify hdfs-site.xml file

</description> </property> <property> <name>dfs.data.dir</name> <value>/home/neeraj/local_cluster_home/hadoop1.0.3/hdfs_data</value> </property> </configuration>

Make these changes on each machine on cluster (master & slaves)

Modify mapred-site.xml file

</configuration> Make these changes on each machine on cluster (master/slaves)

Format your Namenode

Start your Hadoop cluster

Check Hadoop daemons

Run the jps command on master machine

Run the jps command on slave machines

Stop your Hadoop cluster

Contact Point :www.bispsolutions.com

You might also like