0% found this document useful (0 votes)

142 views7 pages

Practical-1 AIM: To Understand The Overall Programming Architecture Using Map Reduce Api

Uploaded by

2203051057108

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

142 views7 pages

Practical-1 AIM: To Understand The Overall Programming Architecture Using Map Reduce Api

Uploaded by

2203051057108

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

FACULTY OF ENGINEERING & TECHNOLOGY

Big-Data Analysis(203105348)
B. Tech.7TH SEM

PRACTICAL-1

AIM: To understand the overall programming architecture using Map Reduce

API.

The MapReduce task is mainly divided into two phase and reduce phase.

1. Map(), filter(), and reduce() in python.

2. These function are most commonly used with lambda function.

1. Map():
“A map function executes certain instruction or functionality provided to it on every
item of an iterable could be a list, tuple, set, etc.

SYNTAX:
map(function,iterable)

example:
items=[1,2,3,4,5]
a=list(map((lambda x: x**3), items))
print(a)

output:
[1, 8, 27, 64, 125]

2. filter():-
“ A filter function in python tests a specific tests a specific user-defined condition for a
function and returns an iterable for the elements and values that satisfy the condition or,
in other words, return true.”

SYNTAX:
filter(function, variable)

example:
a=[1,2,3,4,5,6]
b=[2,5,0,7,3]
c=list(filter(lambda x: x in a, b))
print(c)

output:

ENROLLMENT NO: 2203051057108

FACULTY OF ENGINEERING & TECHNOLOGY
Big-Data Analysis(203105348)
B. Tech.7TH SEM

3. reduce():
“reduce functions apply a function to every item of an iterable and gives back to a single
value as a resultant”.

We have to import the reduce function from functools module using the statement

SYNTAX:
reduce(function, iterable)

example:
from functools import reduce
a=reduce((lambda x, y: x*y), [1,2,3,4])
print(a)

output:

ENROLLMENT NO: 2203051057108

FACULTY OF ENGINEERING & TECHNOLOGY
Big-Data Analysis(203105348)
B. Tech.7TH SEM

Practical-2

AIM: Write a program of word Count in Map Reduce over HDFS.

Description:
MapReduce is a framework for processing large datasets using a large number of computers
(nodes), collectively referred to as a cluster. Processing can occur on data stored in a file
system (HDFS).A method for distributing computation across multiple nodes.Each node
processes the data that is stored at that node.

Consists of two main phases

Mapper Phase

Reduce phase

Input data set is split into independent blocks – processed in parallel. Each input split is
converted in Key Value pairs. Mapper logic processes each key value pair and produces and
intermediate key value pairs based on the implementation logic. Resultant key value pairs can be
of different type from that of input key value pairs. The output of Mapper is passed to the
reducer. Output of Mapper function is the input for Reducer. Reducer sorts the intermediate key
value pairs. Applies reducer logic upon the key value pairs and produces the output in desired
format.Output is stored in HDFS

ENROLLMENT NO: 2203051057108

FACULTY OF ENGINEERING & TECHNOLOGY
Big-Data Analysis(203105348)
B. Tech.7TH SEM

Execution Step:

http://content.udacity-data.com/courses/ud617/Cloudera-Udacity-Training-VM-
4.1.1.c.zip

Create the jar file of this program and name it wordcount.jar.

Run the jar file

hadoop fs -mkdir/input
hadoop fs -put/home/training/Desktop/sample.txt /output

Output:

hadoop fs -cat /output/part-00000

Word Count Java Program

import org.apache.hadoop.conf.Configured;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.FileInputFormat;
import org.apache.hadoop.mapred.FileOutputFormat;
import org.apache.hadoop.mapred.JobClient;
import org.apache.hadoop.mapred.JobConf;
import org.apache.hadoop.util.Tool;
import org.apache.hadoop.util.ToolRunner;

public class wordcount extends Configured implements Tool {

@Override

public int run(String[] args) throws Exception {

ENROLLMENT NO: 2203051057108

FACULTY OF ENGINEERING & TECHNOLOGY
Big-Data Analysis(203105348)
B. Tech.7TH SEM

if(args.length<2)
{
System.out.println("Plz Give Input Output Directory Correctly");
return -1;
}

JobConf conf = new JobConf(wordcount.class);

FileInputFormat.setInputPaths(conf,new Path(args[0]));
FileOutputFormat.setOutputPath(conf, new Path(args[1]));
conf.setMapperClass(wordmapper.class);
conf.setReducerClass(wordreducer.class);
conf.setMapOutputKeyClass(Text.class);
conf.setMapOutputValueClass(IntWritable.class);
conf.setOutputKeyClass(Text.class);
conf.setOutputValueClass(IntWritable.class);
JobClient.runJob(conf);
return 0;
}

public static void main(String args[]) throws Exception

{
int exitcode = ToolRunner.run(new wordcount(), args);
System.exit(exitcode);
}

import java.io.IOException;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.Mapper;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reporter;

public class wordmapper extends MapReduceBase implements

Mapper<LongWritable,Text,Text,IntWritable>
{
public void map(LongWritable key, Text value,

OutputCollector<Text, IntWritable> output, Reporter r)

throws IOException {
String s =value.toString();
for(String word:s.split(" "))

ENROLLMENT NO: 2203051057108

FACULTY OF ENGINEERING & TECHNOLOGY
Big-Data Analysis(203105348)
B. Tech.7TH SEM

{
if(word.length()>0)
{
output.collect(new Text(word), new IntWritable(1));
}
}

import java.io.IOException;
import java.util.Iterator;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reducer;
import org.apache.hadoop.mapred.Reporter;

public class wordreducer extends MapReduceBase implements

Reducer<Text,IntWritable,Text,IntWritable>
{

public void reduce(Text key, Iterator<IntWritable> values,

OutputCollector<Text, IntWritable> output, Reporter r)

throws IOException {

int count=0;
while(values.hasNext())

{
IntWritable i= values.next();
count+= i.get();
}
output.collect(key, new IntWritable(count));

ENROLLMENT NO: 2203051057108

FACULTY OF ENGINEERING & TECHNOLOGY
Big-Data Analysis(203105348)
B. Tech.7TH SEM

ENROLLMENT NO: 2203051057108

(Original PDF) Introduction To Operations and Supply Chain Management (5th Edition) Instant Download
No ratings yet
(Original PDF) Introduction To Operations and Supply Chain Management (5th Edition) Instant Download
48 pages
Zambian Grid Code
100% (1)
Zambian Grid Code
174 pages
Module2 C MapReduceParadigm
No ratings yet
Module2 C MapReduceParadigm
74 pages
Unit 3 - Map Reduce Applications
No ratings yet
Unit 3 - Map Reduce Applications
25 pages
Introduction To Batch Processing
No ratings yet
Introduction To Batch Processing
23 pages
MapReduce for Big Data Enthusiasts
No ratings yet
MapReduce for Big Data Enthusiasts
18 pages
02 The MapReduce Computational Model 22-04
No ratings yet
02 The MapReduce Computational Model 22-04
12 pages
Module 3
No ratings yet
Module 3
36 pages
Hadoop MapReduce
No ratings yet
Hadoop MapReduce
25 pages
Bda Lab Exercises Lab Mannual - 2023
No ratings yet
Bda Lab Exercises Lab Mannual - 2023
72 pages
MapReduce Tutorial: Write Your First Program
No ratings yet
MapReduce Tutorial: Write Your First Program
16 pages
Unit 5 Notes Data Analytics Kit 601
No ratings yet
Unit 5 Notes Data Analytics Kit 601
44 pages
Map-Reduce For Parallel Computing: Amit Jain
No ratings yet
Map-Reduce For Parallel Computing: Amit Jain
72 pages
Bda Practical 2
No ratings yet
Bda Practical 2
3 pages
Paper Map Reduce
No ratings yet
Paper Map Reduce
16 pages
Module2 C MapReduceParadigm
No ratings yet
Module2 C MapReduceParadigm
74 pages
Pranjali P Jagtap - Resume
No ratings yet
Pranjali P Jagtap - Resume
6 pages
ECS765P - W2 - The MapReduce Programming Model
No ratings yet
ECS765P - W2 - The MapReduce Programming Model
53 pages
Assignment 04 - Saiful Islam
No ratings yet
Assignment 04 - Saiful Islam
6 pages
Map Reduce
No ratings yet
Map Reduce
33 pages
(Instructor Version) : Packet Tracer - Configuring Extended Acls - Scenario 3 Desarrollado Por: Oscar Vanegas Landinez
No ratings yet
(Instructor Version) : Packet Tracer - Configuring Extended Acls - Scenario 3 Desarrollado Por: Oscar Vanegas Landinez
6 pages
Bda Unit III r20csm
No ratings yet
Bda Unit III r20csm
54 pages
MapReduce Framework & Examples
No ratings yet
MapReduce Framework & Examples
44 pages
Bda Lab
No ratings yet
Bda Lab
11 pages
Bda Megh
No ratings yet
Bda Megh
50 pages
Mapreduce Model Principles
No ratings yet
Mapreduce Model Principles
65 pages
Lecture 03
No ratings yet
Lecture 03
26 pages
Bda Unit 3
No ratings yet
Bda Unit 3
20 pages
Bda Practical 1
No ratings yet
Bda Practical 1
2 pages
MapReduce: Big Data Processing Guide
No ratings yet
MapReduce: Big Data Processing Guide
25 pages
Construction Methods Course Guide
No ratings yet
Construction Methods Course Guide
5 pages
BDA Mayur
No ratings yet
BDA Mayur
43 pages
Da Unit 5 Data Analytics
No ratings yet
Da Unit 5 Data Analytics
43 pages
MapReduce Programming Model Guide
No ratings yet
MapReduce Programming Model Guide
55 pages
Map Reduce
No ratings yet
Map Reduce
35 pages
3.Map-Reduce Framework - 1
No ratings yet
3.Map-Reduce Framework - 1
47 pages
084 Liza Bda File
No ratings yet
084 Liza Bda File
23 pages
Chapter 9 - Processing Big Data With Mapreduce
No ratings yet
Chapter 9 - Processing Big Data With Mapreduce
157 pages
Map Reduce
No ratings yet
Map Reduce
57 pages
CC Unit-7
No ratings yet
CC Unit-7
16 pages
Ir MR 1
No ratings yet
Ir MR 1
34 pages
Lez.d-01-Hadoop (A) Intro
No ratings yet
Lez.d-01-Hadoop (A) Intro
58 pages
Lecture 4: Mapreduce and Hadoop: Indranil Gupta (Indy)
No ratings yet
Lecture 4: Mapreduce and Hadoop: Indranil Gupta (Indy)
37 pages
Installing NetBackup Media Server Software On Linux Installing Server Software On UNIX Systems NetBackup™ Installation Guide Veritas™
No ratings yet
Installing NetBackup Media Server Software On Linux Installing Server Software On UNIX Systems NetBackup™ Installation Guide Veritas™
5 pages
MapReduce & Hadoop for CS Students
No ratings yet
MapReduce & Hadoop for CS Students
25 pages
Hadoop Architecture & MapReduce Guide
No ratings yet
Hadoop Architecture & MapReduce Guide
7 pages
Lec 8
No ratings yet
Lec 8
19 pages
BDA Manual SHUBHAM
No ratings yet
BDA Manual SHUBHAM
22 pages
Map Reduce Design and Execution Framework Part 1
No ratings yet
Map Reduce Design and Execution Framework Part 1
19 pages
Map Reduce
No ratings yet
Map Reduce
3 pages
Unit 4 2 - CC
No ratings yet
Unit 4 2 - CC
6 pages
Week-8 de
No ratings yet
Week-8 de
9 pages
Exp5 BDI 60004200124
No ratings yet
Exp5 BDI 60004200124
5 pages
Unit 3 MapReduce Part 2
No ratings yet
Unit 3 MapReduce Part 2
12 pages
Big Data 4 Vivek
No ratings yet
Big Data 4 Vivek
3 pages
Geometric Sequences (Using Standard Formulae) - Lesson3
No ratings yet
Geometric Sequences (Using Standard Formulae) - Lesson3
15 pages
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
49 pages
Installing Configuring Automation Orchestrator July2024
No ratings yet
Installing Configuring Automation Orchestrator July2024
89 pages
Network Redundancy with STP
No ratings yet
Network Redundancy with STP
39 pages
MapReduce for Data Engineers
No ratings yet
MapReduce for Data Engineers
29 pages
Crew Acceptance Checklist 2022 Ed Af DL 27052022
No ratings yet
Crew Acceptance Checklist 2022 Ed Af DL 27052022
49 pages
Map Reduce Examples
No ratings yet
Map Reduce Examples
7 pages
Big Data Infrastructure: Week 2: Mapreduce Algorithm Design (2/2)
No ratings yet
Big Data Infrastructure: Week 2: Mapreduce Algorithm Design (2/2)
55 pages
Cybersecurity Analytics 1st Edition Rakesh M Verma David J Marchette PDF Download
No ratings yet
Cybersecurity Analytics 1st Edition Rakesh M Verma David J Marchette PDF Download
81 pages
Problems and Prospects of E-Marketing
75% (20)
Problems and Prospects of E-Marketing
13 pages
OLED Module 1.12 Inch-White-27 - 38.9 - 1.28mm - Datasheet
No ratings yet
OLED Module 1.12 Inch-White-27 - 38.9 - 1.28mm - Datasheet
22 pages
Unit Ii Iintroduction To Map Reduce
No ratings yet
Unit Ii Iintroduction To Map Reduce
4 pages
Unit V Big Data Analytics
No ratings yet
Unit V Big Data Analytics
47 pages
PeopleSoft v9.2 Product Review
No ratings yet
PeopleSoft v9.2 Product Review
163 pages
Lighting Control Spec for Contractors
No ratings yet
Lighting Control Spec for Contractors
22 pages
Lec 8
No ratings yet
Lec 8
24 pages
AS-i Profinet Gateway Solutions
No ratings yet
AS-i Profinet Gateway Solutions
4 pages
Cyber Security Unit 3 Unit 3
No ratings yet
Cyber Security Unit 3 Unit 3
28 pages
Cloud Computing & MapReduce Basics
No ratings yet
Cloud Computing & MapReduce Basics
55 pages
Data Sheet BYD BMU
No ratings yet
Data Sheet BYD BMU
1 page
Mca II Sem Software Engineering and Pattern
No ratings yet
Mca II Sem Software Engineering and Pattern
63 pages
Famos Heat Sealers EN
No ratings yet
Famos Heat Sealers EN
18 pages
PAK - STUDIES - PRESENTATION Report
No ratings yet
PAK - STUDIES - PRESENTATION Report
21 pages
Job Interview Presentation Guide
No ratings yet
Job Interview Presentation Guide
37 pages
Digital Literacy Enhancement of Rural Women in Luna Apayao Philippines
No ratings yet
Digital Literacy Enhancement of Rural Women in Luna Apayao Philippines
25 pages
An Adaptive and Modular Blockchain Enabled Architecture For A Decentralized Metaverse
No ratings yet
An Adaptive and Modular Blockchain Enabled Architecture For A Decentralized Metaverse
12 pages
Marvell Phys Transceivers Alaska 88e1548 88e1548p Product Brief 2015 08
No ratings yet
Marvell Phys Transceivers Alaska 88e1548 88e1548p Product Brief 2015 08
2 pages
Kemuning/Icu Isolasi 3 JAB Rating Bobot N
No ratings yet
Kemuning/Icu Isolasi 3 JAB Rating Bobot N
7 pages
LoRa SDR Tool for Satellite IoT
No ratings yet
LoRa SDR Tool for Satellite IoT
6 pages
LVMH IT Landscape
No ratings yet
LVMH IT Landscape
5 pages
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
No ratings yet
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
23 pages
DISPLAYwiring
No ratings yet
DISPLAYwiring
1 page
2-3 - The Serial Monitor
No ratings yet
2-3 - The Serial Monitor
10 pages

Practical-1 AIM: To Understand The Overall Programming Architecture Using Map Reduce Api

Uploaded by

Practical-1 AIM: To Understand The Overall Programming Architecture Using Map Reduce Api

Uploaded by

FACULTY OF ENGINEERING & TECHNOLOGY

AIM: To understand the overall programming architecture using Map Reduce

1. Map(), filter(), and reduce() in python.

ENROLLMENT NO: 2203051057108

ENROLLMENT NO: 2203051057108

AIM: Write a program of word Count in Map Reduce over HDFS.

Consists of two main phases

ENROLLMENT NO: 2203051057108

Create the jar file of this program and name it wordcount.jar.

Run the jar file

hadoop fs -cat /output/part-00000

Word Count Java Program

public class wordcount extends Configured implements Tool {

public int run(String[] args) throws Exception {

ENROLLMENT NO: 2203051057108

JobConf conf = new JobConf(wordcount.class);

public static void main(String args[]) throws Exception

public class wordmapper extends MapReduceBase implements

OutputCollector&lt;Text, IntWritable&gt; output, Reporter r)

ENROLLMENT NO: 2203051057108

public class wordreducer extends MapReduceBase implements

public void reduce(Text key, Iterator&lt;IntWritable&gt; values,

OutputCollector&lt;Text, IntWritable&gt; output, Reporter r)

ENROLLMENT NO: 2203051057108

ENROLLMENT NO: 2203051057108

You might also like

OutputCollector<Text, IntWritable> output, Reporter r)

public void reduce(Text key, Iterator<IntWritable> values,

OutputCollector<Text, IntWritable> output, Reporter r)