0% found this document useful (0 votes)

19 views4 pages

PART 1 - Install Java and Hadoop On Ubuntu

This document provides a step-by-step guide to install Java and Hadoop on Ubuntu, configure environment variables, and write a WordCount Java program. It includes instructions for compiling the program, creating a JAR file, and running a MapReduce job to count word occurrences in a text file. The final output displays the count of each word processed by the job.

Uploaded by

ayeshagujrati00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views4 pages

PART 1 - Install Java and Hadoop On Ubuntu

Uploaded by

ayeshagujrati00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

✅ PART 1: Install Java and Hadoop on Ubuntu

🧰 Step 1: Install Java (JDK)

sudo apt update
sudo apt install openjdk-11-jdk -y
java -version

📦 Step 2: Download and Configure Hadoop (Standalone Mode)

🔽 Download Hadoop
cd ~
wget https://downloads.apache.org/hadoop/common/hadoop-3.3.6/hadoop-3.3.6.tar.gz
tar -xzf hadoop-3.3.6.tar.gz
mv hadoop-3.3.6 hadoop

🔧 Set Environment Variables

Edit ~/.bashrc:

nano ~/.bashrc

Add these at the end:

export HADOOP_HOME=~/hadoop
export PATH=$PATH:$HADOOP_HOME/bin
export JAVA_HOME=/usr/lib/jvm/java-11-openjdk-amd64
export HADOOP_CLASSPATH=$JAVA_HOME/lib/tools.jar

Apply the changes:

source ~/.bashrc

✅ Test:
hadoop version

✅ PART 2: Write the WordCount Java Code

Create a folder and Java file:

mkdir ~/wordcount
cd ~/wordcount
nano WordCount.java

import java.io.IOException;
import java.util.StringTokenizer;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class WordCount {

public static class TokenizerMapper

extends Mapper<Object, Text, Text, IntWritable> {
private final static IntWritable one = new IntWritable(1);
private Text word = new Text();
public void map(Object key, Text value, Context context)
throws IOException, InterruptedException {
StringTokenizer itr = new StringTokenizer(value.toString());
while (itr.hasMoreTokens()) {
word.set(itr.nextToken());
context.write(word, one);
}
}
}

public static class IntSumReducer

extends Reducer<Text,IntWritable,Text,IntWritable> {
private IntWritable result = new IntWritable();
public void reduce(Text key, Iterable<IntWritable> values,
Context context) throws IOException, InterruptedException {
int sum = 0;
for (IntWritable val : values) {
sum += val.get();
}
result.set(sum);
context.write(key, result);
}
}

public static void main(String[] args) throws Exception {

Configuration conf = new Configuration();
Job job = Job.getInstance(conf, "word count");
job.setJarByClass(WordCount.class);
job.setMapperClass(TokenizerMapper.class);
job.setCombinerClass(IntSumReducer.class);
job.setReducerClass(IntSumReducer.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));
System.exit(job.waitForCompletion(true) ? 0 : 1);
}
}

✅ PART 3: Compile and Run the Program

🔧 Step 1: Compile
mkdir classes
javac -classpath
"$HADOOP_HOME/share/hadoop/common/*:$HADOOP_HOME/share/hadoop/mapreduce/*" -d
classes WordCount.java

📦 Step 2: Create a JAR

jar -cvf wordcount.jar -C classes/ .

✅ PART 4: Run WordCount Job (Standalone)

📁 Step 1: Create Input File
mkdir input
echo "hadoop mapreduce hadoop word count word count" > input/test.txt

▶️ Step 2: Run MapReduce Job

hadoop jar wordcount.jar WordCount input output

📄 Step 3: View Output

cat output/part-r-00000

count 2
hadoop 2
mapreduce 1
word 2

Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Career Counseling Competencies Guide
100% (1)
Career Counseling Competencies Guide
13 pages
WordCount Java Program with Hadoop
No ratings yet
WordCount Java Program with Hadoop
2 pages
Interlocking Paver Block Making Cost: Top Layer 500
No ratings yet
Interlocking Paver Block Making Cost: Top Layer 500
5 pages
Wordcount
No ratings yet
Wordcount
3 pages
Practical 3bcbs
No ratings yet
Practical 3bcbs
5 pages
Simulation and System Modeling
No ratings yet
Simulation and System Modeling
16 pages
Hadoop WordCount
No ratings yet
Hadoop WordCount
2 pages
1 Word Count
No ratings yet
1 Word Count
2 pages
Sribharanitharan.M 71762234049
No ratings yet
Sribharanitharan.M 71762234049
2 pages
Legal Appeal on Property Dispute
No ratings yet
Legal Appeal on Property Dispute
7 pages
Word Count Program
No ratings yet
Word Count Program
2 pages
498 - SJ. 462-I - Sheikh Imran Zahid (3158-23)
No ratings yet
498 - SJ. 462-I - Sheikh Imran Zahid (3158-23)
5 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
HGI Development Deck 2021
No ratings yet
HGI Development Deck 2021
57 pages
Final Exam Denis Bonilla
100% (1)
Final Exam Denis Bonilla
7 pages
Codigo Haddop
No ratings yet
Codigo Haddop
3 pages
Methods For Testing Tar and Bituminous Materials - Determination of Specific Gravity
100% (1)
Methods For Testing Tar and Bituminous Materials - Determination of Specific Gravity
10 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
Ex No 04
No ratings yet
Ex No 04
4 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Exp 11
No ratings yet
Exp 11
4 pages
MapReduce Word Count Guide
No ratings yet
MapReduce Word Count Guide
1 page
Java WordCount with Hadoop Guide
No ratings yet
Java WordCount with Hadoop Guide
6 pages
Lesson 2 - Stem Cells PPT Notes
No ratings yet
Lesson 2 - Stem Cells PPT Notes
8 pages
LOADING SEQ - Iron Ore 4
100% (1)
LOADING SEQ - Iron Ore 4
1 page
BDF Programs
No ratings yet
BDF Programs
32 pages
Map Reduce Example
No ratings yet
Map Reduce Example
6 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
History of Tango
No ratings yet
History of Tango
22 pages
RA. 9266 - Architecture Act of 2004
No ratings yet
RA. 9266 - Architecture Act of 2004
2 pages
People Versus Baluyot
No ratings yet
People Versus Baluyot
6 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
THCDC
No ratings yet
THCDC
132 pages
Minor Project On Customer Satisfaction Towards Airtel
No ratings yet
Minor Project On Customer Satisfaction Towards Airtel
52 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
CTBD Sol02
No ratings yet
CTBD Sol02
2 pages
The Thick of It - 3x02
No ratings yet
The Thick of It - 3x02
51 pages
#1 - Introduction To Management (Chapter 1) #1 - Introduction To Management (Chapter 1)
No ratings yet
#1 - Introduction To Management (Chapter 1) #1 - Introduction To Management (Chapter 1)
6 pages
Experiment 6 BDA
No ratings yet
Experiment 6 BDA
4 pages
ADA Lab Manual
No ratings yet
ADA Lab Manual
34 pages
Map Reduce Program
No ratings yet
Map Reduce Program
2 pages
B1 Instructions
No ratings yet
B1 Instructions
9 pages
6 - Simple Wordcount
No ratings yet
6 - Simple Wordcount
2 pages
Sanoob BDA - 2
No ratings yet
Sanoob BDA - 2
4 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Chap 2
No ratings yet
Chap 2
4 pages
Source Code For Wordcount
No ratings yet
Source Code For Wordcount
3 pages
Ravikant Hadoop File
No ratings yet
Ravikant Hadoop File
22 pages
Dsa Prac 5 19DCS038
No ratings yet
Dsa Prac 5 19DCS038
16 pages
Word Count Example
No ratings yet
Word Count Example
4 pages
BDA3
No ratings yet
BDA3
7 pages
Introduction To Literary Theory Syllabus
No ratings yet
Introduction To Literary Theory Syllabus
2 pages
BDT Lab 6 22mis1067
No ratings yet
BDT Lab 6 22mis1067
13 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
WordCountApp
No ratings yet
WordCountApp
2 pages
Nevada Report HSGAC Testimony Binnall Elections 2020
No ratings yet
Nevada Report HSGAC Testimony Binnall Elections 2020
2 pages
BDAPract 4
No ratings yet
BDAPract 4
5 pages
Lab-1-Steps-Word Count Problem-Hadoop
No ratings yet
Lab-1-Steps-Word Count Problem-Hadoop
6 pages
Classcreation
No ratings yet
Classcreation
2 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
Map Reduce
No ratings yet
Map Reduce
4 pages
Soal Ulangan Genap3
No ratings yet
Soal Ulangan Genap3
7 pages
Exp3 - Map Reduce Code
No ratings yet
Exp3 - Map Reduce Code
2 pages
Amrit Navy Form 2023
No ratings yet
Amrit Navy Form 2023
2 pages
Hafiz M Shahbaz Rafique: Objective
No ratings yet
Hafiz M Shahbaz Rafique: Objective
1 page
Map Reduce Java Program
No ratings yet
Map Reduce Java Program
2 pages
Systems and Data Sharding
No ratings yet
Systems and Data Sharding
5 pages
Fuzzy Logic
No ratings yet
Fuzzy Logic
23 pages
Hadoop Mini Project
No ratings yet
Hadoop Mini Project
8 pages
Supreme Court: Eusebio C. Encarnacion For Appellant. Attorney-General Jaranilla For Appellee
No ratings yet
Supreme Court: Eusebio C. Encarnacion For Appellant. Attorney-General Jaranilla For Appellee
2 pages
Cloud LAB 10.1,11.1,12.1
No ratings yet
Cloud LAB 10.1,11.1,12.1
6 pages
All
No ratings yet
All
11 pages
Experiment 1 Copy 1
No ratings yet
Experiment 1 Copy 1
8 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
October 2023 Online Version
No ratings yet
October 2023 Online Version
5 pages
Sanjith BDA 2
No ratings yet
Sanjith BDA 2
4 pages
Lab3 BigData-MapReduce
No ratings yet
Lab3 BigData-MapReduce
8 pages
Solution Unit Test 1 Calculus 1
No ratings yet
Solution Unit Test 1 Calculus 1
3 pages
FMCG 2425 05141
No ratings yet
FMCG 2425 05141
3 pages
DSBDA11 Code
No ratings yet
DSBDA11 Code
3 pages
Promotion Form
No ratings yet
Promotion Form
2 pages
Aji Bda2 Final
No ratings yet
Aji Bda2 Final
4 pages
Purchase Order: # Item & Description Qty Unit Rate Amount
No ratings yet
Purchase Order: # Item & Description Qty Unit Rate Amount
1 page

PART 1 - Install Java and Hadoop On Ubuntu

Uploaded by

PART 1 - Install Java and Hadoop On Ubuntu

Uploaded by

✅ PART 1: Install Java and Hadoop on Ubuntu

🧰 Step 1: Install Java (JDK)

📦 Step 2: Download and Configure Hadoop (Standalone Mode)

🔧 Set Environment Variables

Add these at the end:

Apply the changes:

✅ PART 2: Write the WordCount Java Code

public class WordCount {

​ public static class TokenizerMapper

​ public static class IntSumReducer

​ public static void main(String[] args) throws Exception {

✅ PART 3: Compile and Run the Program

📦 Step 2: Create a JAR

✅ PART 4: Run WordCount Job (Standalone)

▶️ Step 2: Run MapReduce Job

📄 Step 3: View Output

You might also like

public static class TokenizerMapper

public static class IntSumReducer

public static void main(String[] args) throws Exception {