Practical 2c

Uploaded by

rodylogin69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views2 pages

Practical 2c

Uploaded by

rodylogin69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Practical 2-1: Write a program to calculate word count using map reduce framework.

Input file – input.txt

What do you mean by Object
What do you know about Java
What is Java Virtual Machine
How Java enabled High Performance

import java.io.IOException;
import java.util.StringTokenizer;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class WordCount {

public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable> {
private final static IntWritable one = new IntWritable(1);
private Text word = new Text();
public void map(Object key, Text value, Context context ) throws IOException, InterruptedException
{
StringTokenizer itr = new StringTokenizer(value.toString());
while (itr.hasMoreTokens()) {
word.set(itr.nextToken());
context.write(word, one);
}
}
}
public static class IntSumReducer extends Reducer<Text,IntWritable,Text,IntWritable> {
private IntWritable result = new IntWritable();
public void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException,
InterruptedException {
int sum = 0;
for (IntWritable val : values)
sum += val.get();
result.set(sum);
context.write(key, result);
}
}
public static void main(String[] args) throws Exception {
Configuration conf = new Configuration();
Job job = Job.getInstance(conf, "word count");
job.setJarByClass(WordCount.class);
job.setMapperClass(TokenizerMapper.class);
job.setCombinerClass(IntSumReducer.class);
job.setReducerClass(IntSumReducer.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(job, new Path(args[0]));
FileOutputFormat.setOutputPath(job, new Path(args[1]));
System.exit(job.waitForCompletion(true) ? 0 : 1);
}
}
Compilation and Execution:
Let us assume we are in the home directory of Hadoop user (for example, /home/hadoop).

Follow the steps given below to compile and execute the above program.
Step 1 − Use the following command to create a directory to store the compiled java classes.
$ mkdir units

Step 2 − Download Hadoop-core-1.4.0.jar, which is used to compile and execute the MapReduce program.
You can download the jar from mvnrepository.com.
Let us assume the downloaded folder is /home/hadoop/.

Step 3 − Use the following commands to compile the WordCount.java program and to create a jar for the
program.
$ javac -classpath hadoop-core-1.4.0.jar -d units WordCount.java
$ jar -cvf units.jar -C units/ .

Step 4 − Use the following command to create an input directory in HDFS.

$HADOOP_HOME/bin/hadoop fs -mkdir input_dir

Step 5 − Use the following command to copy the input file named input.txt in the input directory of HDFS.
$HADOOP_HOME/bin/hadoop fs -put /home/hadoop/input.txt input_dir

Step 6 − Use the following command to verify the files in the input directory.
$HADOOP_HOME/bin/hadoop fs -ls input_dir/

Step 7 − Use the following command to run the Word count application by taking input files from the input
directory.
$HADOOP_HOME/bin/hadoop jar units.jar hadoop.ProcessUnits input_dir output_dir Wait for a
while till the file gets executed. After execution, the output contains a number of input splits, Map
tasks, and Reducer tasks.

Step 8 − Use the following command to verify the resultant files in the output folder.
$HADOOP_HOME/bin/hadoop fs -ls output_dir/

Step 9 − Use the following command to see the output in Part-00000 file. This file is generated by HDFS.
$HADOOP_HOME/bin/hadoop fs -cat output_dir/part-00000
Following is the output generated by the MapReduce program.
What 3
do 2
you 2
mean 1
by 1
Object 1
know 1
about 1
Java 3
is 1
Virtual 1
Machine 1
How 1
enabled 1
High 1
Performance 1

Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
No ratings yet
Steps: /usr/lib/hadoop-0.20/ Usr/lib/hadoop-0.20/lib
4 pages
Altivar 71
No ratings yet
Altivar 71
83 pages
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
No ratings yet
Tutorial-Counting Words in File (S) Using Mapreduce: Prerequisites
11 pages
IMF - Hacking Etico Unidad 5
100% (1)
IMF - Hacking Etico Unidad 5
63 pages
Service Manual 800RB
No ratings yet
Service Manual 800RB
21 pages
Wordcount
No ratings yet
Wordcount
3 pages
BDF Programs
No ratings yet
BDF Programs
32 pages
Samsung LE22S86BD Chassis GJA22SEN
100% (4)
Samsung LE22S86BD Chassis GJA22SEN
123 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
Hadoop and Map Reduce
No ratings yet
Hadoop and Map Reduce
27 pages
Revision BIOS Tweaking Guide
100% (1)
Revision BIOS Tweaking Guide
6 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
Ravinder Big Data 4 PDF
No ratings yet
Ravinder Big Data 4 PDF
15 pages
HDMI+LVDS 选型表
No ratings yet
HDMI+LVDS 选型表
2,346 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Palak
No ratings yet
Palak
10 pages
Assignment 11 DSBDA
No ratings yet
Assignment 11 DSBDA
4 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
58 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
No ratings yet
CS246 TA Session: Hadoop Tutorial: Peyman Kazemian 1/11/2011
13 pages
Ex No 04
No ratings yet
Ex No 04
4 pages
02-Wordcount Mapreduce
No ratings yet
02-Wordcount Mapreduce
5 pages
Running Jar Program
No ratings yet
Running Jar Program
3 pages
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
No ratings yet
Developing A Simple Map-Reduce Program For Hadoop: Big Data Course CS6350 Professor: Dr. Latifur Khan
22 pages
BDM Lab Manual 2
No ratings yet
BDM Lab Manual 2
4 pages
Ravikant Hadoop File
No ratings yet
Ravikant Hadoop File
22 pages
Import Import Import Import Import Import Import Import Public Class Extends Implements
No ratings yet
Import Import Import Import Import Import Import Import Public Class Extends Implements
7 pages
3D Printer Partslist Pricelist
No ratings yet
3D Printer Partslist Pricelist
18 pages
Software Testing Basics for Students
No ratings yet
Software Testing Basics for Students
34 pages
Memory Selection of ES
No ratings yet
Memory Selection of ES
37 pages
MTL Hart Muxes PDF
No ratings yet
MTL Hart Muxes PDF
30 pages
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
No ratings yet
Setting Up Eclipse:: Codelab 1 Introduction To The Hadoop Environment (Version 0.17.0)
9 pages
Hadoop MapReduce WordCount Guide
No ratings yet
Hadoop MapReduce WordCount Guide
5 pages
Dsbda 11
No ratings yet
Dsbda 11
15 pages
Practical 2-1
No ratings yet
Practical 2-1
4 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Experiment-4 BDA LAB
No ratings yet
Experiment-4 BDA LAB
7 pages
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
No ratings yet
Steps To Create Jar File and Execute Word Count Problem in Mapper Reducer
5 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
5 pages
Practical 3bcbs
No ratings yet
Practical 3bcbs
5 pages
CG8250 UserManual
No ratings yet
CG8250 UserManual
400 pages
Experiment 6 BDA
No ratings yet
Experiment 6 BDA
4 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
B1 Instructions
No ratings yet
B1 Instructions
9 pages
Adhiparasakthi College of Engineering, G.B.Nagar, Kalavai
No ratings yet
Adhiparasakthi College of Engineering, G.B.Nagar, Kalavai
19 pages
Hadoop MapReduce Beginner Lab
No ratings yet
Hadoop MapReduce Beginner Lab
2 pages
DA Lab Program-2
No ratings yet
DA Lab Program-2
6 pages
M337x - 387x - 407x - Release Note - English
No ratings yet
M337x - 387x - 407x - Release Note - English
3 pages
Exp 11
No ratings yet
Exp 11
4 pages
MapReduce Word Count Guide
No ratings yet
MapReduce Word Count Guide
1 page
2SC 5250
No ratings yet
2SC 5250
5 pages
WEG cfw501 Users Manual 10001991016 Manual English
No ratings yet
WEG cfw501 Users Manual 10001991016 Manual English
149 pages
Assignment 2
No ratings yet
Assignment 2
7 pages
Sanoob BDA - 2
No ratings yet
Sanoob BDA - 2
4 pages
Labs Lecture2
No ratings yet
Labs Lecture2
6 pages
PART 1 - Install Java and Hadoop On Ubuntu
No ratings yet
PART 1 - Install Java and Hadoop On Ubuntu
4 pages
XI Syllabus
No ratings yet
XI Syllabus
5 pages
1 Word Count
No ratings yet
1 Word Count
2 pages
CSC 308 Fault Tolerant Computing
No ratings yet
CSC 308 Fault Tolerant Computing
24 pages
Topic 11 - Logical Efforts
No ratings yet
Topic 11 - Logical Efforts
17 pages
Sanoob BDA 1 S Merged
No ratings yet
Sanoob BDA 1 S Merged
8 pages
Lab-1-Steps-Word Count Problem-Hadoop
No ratings yet
Lab-1-Steps-Word Count Problem-Hadoop
6 pages
EMS Requirements
No ratings yet
EMS Requirements
37 pages
MapReduce Programs
No ratings yet
MapReduce Programs
10 pages
Lab3 BigData-MapReduce
No ratings yet
Lab3 BigData-MapReduce
8 pages
Sanjith BDA 2
No ratings yet
Sanjith BDA 2
4 pages
Experiment 1 Diode Logic: A. Background
No ratings yet
Experiment 1 Diode Logic: A. Background
6 pages
Router Setup for CS Students
No ratings yet
Router Setup for CS Students
11 pages
CA Chap5 Memory
No ratings yet
CA Chap5 Memory
64 pages
Cisco AMP for Endpoint Security
No ratings yet
Cisco AMP for Endpoint Security
11 pages
SF Dump
No ratings yet
SF Dump
27 pages
OSRAM High-Speed Switching of IR-LEDs - Background and Data Sheet Definition
No ratings yet
OSRAM High-Speed Switching of IR-LEDs - Background and Data Sheet Definition
15 pages
Systems and Data Sharding
No ratings yet
Systems and Data Sharding
5 pages
Lab Manual No 16 (Use Case Diagram)
No ratings yet
Lab Manual No 16 (Use Case Diagram)
12 pages
Debian 8 MPD Install With Botic Optional
No ratings yet
Debian 8 MPD Install With Botic Optional
14 pages
Run Wordcount
No ratings yet
Run Wordcount
3 pages
System Specs for Tech Support
No ratings yet
System Specs for Tech Support
28 pages
Aji Bda2 Final
No ratings yet
Aji Bda2 Final
4 pages
HTTP Assessment - Netacad.net Virtuoso Delivery Pub-Doc Exam - shtml1
67% (3)
HTTP Assessment - Netacad.net Virtuoso Delivery Pub-Doc Exam - shtml1
4 pages
Experiment 1 Copy 1
No ratings yet
Experiment 1 Copy 1
8 pages
M3600 Controller Datasheet
No ratings yet
M3600 Controller Datasheet
11 pages
BDAPract 4
No ratings yet
BDAPract 4
5 pages
Exp3 - Map Reduce Code
No ratings yet
Exp3 - Map Reduce Code
2 pages
Mapreduce Program
No ratings yet
Mapreduce Program
3 pages

Practical 2c

Uploaded by

Practical 2c

Uploaded by

Practical 2-1: Write a program to calculate word count using map reduce framework.

Input file – input.txt

public class WordCount {

Step 4 − Use the following command to create an input directory in HDFS.

You might also like