0% found this document useful (0 votes)

30 views3 pages

Bda Practical 2

data analyst

Uploaded by

varmavikash990

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views3 pages

Bda Practical 2

data analyst

Uploaded by

varmavikash990

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Faculty of Engineering & Technology

Big Data Analytics (203105348)

B. Tech CSE 4rd Year 7th Semester

PRACTICAL 2

Aim: Write a program of Word Count in Map Reduce over HDFS.

Description:
MapReduce is a framework for processing large datasets using a large number of computers
(nodes), collectively referred to as a cluster. Processing can occur on data stored in a file
system (HDFS).A method for distributing computation across multiple nodes. Each node
processes the data that is stored at that node.

Consists of two main phases

Mapper Phase

Reduce phase

Input data set is split into independent blocks – processed in parallel. Each input split is
converted in Key Value pairs. Mapper logic processes each key value pair and produces and
intermediate key value pairs based on the implementation logic. Resultant key value pairs can
be of different type from that of input key value pairs. The output of Mapper is passed to the
reducer. Output of Mapper function is the input for Reducer. Reducer sorts the intermediate
key value pairs. Applies reducer logic upon the key value pairs and produces the output in
desired format. Output is stored in HDFS

Enrollment No.: 2203051057106

Roll Number: 68
Div: 7A9(CSE)
Faculty of Engineering & Technology
Big Data Analytics (203105348)
B. Tech CSE 4rd Year 7th Semester
Code:

import urllib.request
import random
from operator import itemgetter

current_word={}
current_count=0
story ='http://sixty-north.com/c/t.txt'
request=urllib.request.urlopen(story)
response=urllib.request.urlopen(story)

each_word=[]
words=None
count=1
same_words={}
word=[]

for line in response:

line_words=line.split()
for word in line_words:
each_word.append(word)

for words in each_word:

if words.lower() not in same_words.keys():
same_words[words.lower()]=1
else:
same_words[words.lower()]+=1

for each in same_words.keys():

print("word =",each,"count =",same_words[each])

Enrollment No.: 2203051057106

Roll Number: 68
Div: 7A9(CSE)
Faculty of Engineering & Technology
Big Data Analytics (203105348)
B. Tech CSE 4rd Year 7th Semester
Output:

Enrollment No.: 2203051057106

Roll Number: 68
Div: 7A9(CSE)

EDM Dumps
20% (10)
EDM Dumps
4 pages
key นายสิบตำรวจ อำนวยการ ตม 6 PDF
No ratings yet
key นายสิบตำรวจ อำนวยการ ตม 6 PDF
17 pages
Module2 C MapReduceParadigm
No ratings yet
Module2 C MapReduceParadigm
74 pages
Through The Language Glass Why The World PDF
0% (6)
Through The Language Glass Why The World PDF
7 pages
BDA Lab Manual 200305105108
No ratings yet
BDA Lab Manual 200305105108
44 pages
MapReduce for Big Data Developers
No ratings yet
MapReduce for Big Data Developers
9 pages
Map Reduce
No ratings yet
Map Reduce
30 pages
Installing Ubuntu Server
100% (1)
Installing Ubuntu Server
13 pages
Unit 3 - Map Reduce Applications
No ratings yet
Unit 3 - Map Reduce Applications
25 pages
02 The MapReduce Computational Model 22-04
No ratings yet
02 The MapReduce Computational Model 22-04
12 pages
Hadoop and Spark Overview
No ratings yet
Hadoop and Spark Overview
34 pages
MapReduce Programming Model Guide
No ratings yet
MapReduce Programming Model Guide
55 pages
Lecture 4: Mapreduce and Hadoop: Indranil Gupta (Indy)
No ratings yet
Lecture 4: Mapreduce and Hadoop: Indranil Gupta (Indy)
37 pages
MIL Module 2
No ratings yet
MIL Module 2
2 pages
Performance Tuning Guide As400
No ratings yet
Performance Tuning Guide As400
136 pages
Assignment 04 - Saiful Islam
No ratings yet
Assignment 04 - Saiful Islam
6 pages
Alteryx Webinar Lecture 1 - Slides PDF
100% (1)
Alteryx Webinar Lecture 1 - Slides PDF
56 pages
Wikipedia Consensus
No ratings yet
Wikipedia Consensus
6 pages
Practical-1 AIM: To Understand The Overall Programming Architecture Using Map Reduce Api
No ratings yet
Practical-1 AIM: To Understand The Overall Programming Architecture Using Map Reduce Api
7 pages
Mathematics Exercise Solutions
No ratings yet
Mathematics Exercise Solutions
17 pages
Module2 C MapReduceParadigm
No ratings yet
Module2 C MapReduceParadigm
74 pages
Final ETI Micro Project Report
0% (1)
Final ETI Micro Project Report
17 pages
Bda Lab Exercises Lab Mannual - 2023
No ratings yet
Bda Lab Exercises Lab Mannual - 2023
72 pages
Paper Map Reduce
No ratings yet
Paper Map Reduce
16 pages
Bda Megh
No ratings yet
Bda Megh
50 pages
Map-Reduce For Parallel Computing: Amit Jain
No ratings yet
Map-Reduce For Parallel Computing: Amit Jain
72 pages
Robotics Lab Manual
No ratings yet
Robotics Lab Manual
33 pages
Big Data 4 Vivek
No ratings yet
Big Data 4 Vivek
3 pages
Chapter 9 - Processing Big Data With Mapreduce
No ratings yet
Chapter 9 - Processing Big Data With Mapreduce
157 pages
ECS765P - W2 - The MapReduce Programming Model
No ratings yet
ECS765P - W2 - The MapReduce Programming Model
53 pages
084 Liza Bda File
No ratings yet
084 Liza Bda File
23 pages
BDA Mayur
No ratings yet
BDA Mayur
43 pages
Ir MR 1
No ratings yet
Ir MR 1
34 pages
Bda Lab
No ratings yet
Bda Lab
11 pages
MapReduce Algorithms Assignment
No ratings yet
MapReduce Algorithms Assignment
6 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
43 pages
Map Reduce Programming
No ratings yet
Map Reduce Programming
74 pages
BDA Practical
No ratings yet
BDA Practical
18 pages
Nti Serimux S 16 Ds
No ratings yet
Nti Serimux S 16 Ds
4 pages
E-Wallet Adoption and Impact Study
No ratings yet
E-Wallet Adoption and Impact Study
30 pages
Exp5 BDI 60004200124
No ratings yet
Exp5 BDI 60004200124
5 pages
CC Unit-7
No ratings yet
CC Unit-7
16 pages
Lec 8
No ratings yet
Lec 8
24 pages
BDA Manual SHUBHAM
No ratings yet
BDA Manual SHUBHAM
22 pages
Bda Unit III r20csm
No ratings yet
Bda Unit III r20csm
54 pages
Map Reduce - 3
No ratings yet
Map Reduce - 3
23 pages
09b - MapReduce
No ratings yet
09b - MapReduce
44 pages
Hadoop Architecture & MapReduce Guide
No ratings yet
Hadoop Architecture & MapReduce Guide
7 pages
BDA - Manual - 1to6 Ayushi
No ratings yet
BDA - Manual - 1to6 Ayushi
22 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
BigData-Assignment3-CSP 554
No ratings yet
BigData-Assignment3-CSP 554
5 pages
Jhilick Latest
No ratings yet
Jhilick Latest
4 pages
Lec 8
No ratings yet
Lec 8
19 pages
3.Map-Reduce Framework - 1
No ratings yet
3.Map-Reduce Framework - 1
47 pages
Lez.d-01-Hadoop (A) Intro
No ratings yet
Lez.d-01-Hadoop (A) Intro
58 pages
BDA Final Manual 1-8 Sourav
No ratings yet
BDA Final Manual 1-8 Sourav
43 pages
Mapreduce 190419130907
No ratings yet
Mapreduce 190419130907
12 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
PGP Machine Learning Brochure
No ratings yet
PGP Machine Learning Brochure
12 pages
DE 3000 Brochure
No ratings yet
DE 3000 Brochure
4 pages
INTERNAL
No ratings yet
INTERNAL
11 pages
MapReduce & Hadoop for CS Students
No ratings yet
MapReduce & Hadoop for CS Students
25 pages
CS-702 (D) BigData
No ratings yet
CS-702 (D) BigData
61 pages
Map Reduce Design and Execution Framework Part 1
No ratings yet
Map Reduce Design and Execution Framework Part 1
19 pages
Hdfs MR Wordcount
No ratings yet
Hdfs MR Wordcount
16 pages
Big Data Infrastructure: Week 2: Mapreduce Algorithm Design (2/2)
No ratings yet
Big Data Infrastructure: Week 2: Mapreduce Algorithm Design (2/2)
55 pages
REAKTOR 6 What Is New English 072220
No ratings yet
REAKTOR 6 What Is New English 072220
34 pages
Monitoring Plant Health Andd Detection of Plant Disease Using Iot
No ratings yet
Monitoring Plant Health Andd Detection of Plant Disease Using Iot
15 pages
Cloud Computing & MapReduce Basics
No ratings yet
Cloud Computing & MapReduce Basics
55 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
20 pages
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
49 pages
Features of Hadoop: - Suitable For Big Data Analysis
No ratings yet
Features of Hadoop: - Suitable For Big Data Analysis
6 pages
HTML Cheatsheet
No ratings yet
HTML Cheatsheet
6 pages
10 IPS 4 - Akun Office 365
No ratings yet
10 IPS 4 - Akun Office 365
1 page
Elkhoukhi 2019
No ratings yet
Elkhoukhi 2019
13 pages
MapReduce for Data Engineers
No ratings yet
MapReduce for Data Engineers
30 pages
GMC 300E Plus User Guide
No ratings yet
GMC 300E Plus User Guide
24 pages
Presentation 3 PDF
No ratings yet
Presentation 3 PDF
8 pages
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
No ratings yet
Lab Manual Big Data Analytics Lab (LC-CSE-410G) : Department of Computer Science and Engineering
28 pages
Fire & Gas System Module Guide
No ratings yet
Fire & Gas System Module Guide
10 pages
OrionSX-Datasheet 083022
No ratings yet
OrionSX-Datasheet 083022
2 pages
Electronics Engineer Internship Letter
No ratings yet
Electronics Engineer Internship Letter
2 pages
Hadoop and MR Programming: DR G Sudha Sadasivam Professor Cse, PSGCT
No ratings yet
Hadoop and MR Programming: DR G Sudha Sadasivam Professor Cse, PSGCT
71 pages
Cellular Gateway Release Notes Xe 17 11 X
No ratings yet
Cellular Gateway Release Notes Xe 17 11 X
6 pages
Mapreduce Programming Framework
No ratings yet
Mapreduce Programming Framework
23 pages
Barani Institute of Management Sciences: Final-Term Exam Fall-2019
No ratings yet
Barani Institute of Management Sciences: Final-Term Exam Fall-2019
2 pages
HTSO by Tosif Ghazi
No ratings yet
HTSO by Tosif Ghazi
11 pages
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
No ratings yet
Mapreduce Programming Model and Design Patterns: Andrea Lottarini January 17, 2012
23 pages

Bda Practical 2

Uploaded by

Bda Practical 2

Uploaded by

Faculty of Engineering & Technology

Big Data Analytics (203105348)

Aim: Write a program of Word Count in Map Reduce over HDFS.

Consists of two main phases

Enrollment No.: 2203051057106

for line in response:

for words in each_word:

for each in same_words.keys():

Enrollment No.: 2203051057106

Enrollment No.: 2203051057106

You might also like