MLT Lab 06

The document outlines a practical assignment for using the Naïve Bayesian Classifier to classify a set of documents, detailing the steps for text preprocessing, training, prediction, and evaluation metrics such as accuracy, precision, and recall. It explains the theoretical foundation of the classifier based on Bayes' Theorem and includes source code for implementation in Python using libraries like pandas and sklearn. Key assumptions of the model are also discussed, emphasizing the independence of features and the need for a representative training dataset.

Uploaded by

ponete3977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views3 pages

MLT Lab 06

Uploaded by

ponete3977

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Name – Amit Shukla

Roll No. – 2200971640010

Branch – AIML
Subject – Machine Learning Technique Lab

Practical-06

AIM - Assuming a set of documents that need to be classified, use the naïve Bayesian Classifier model to
perform this task. Built-in Java classes/API can be used to write the program. Calculate the accuracy,
precision, and recall for your data set

Theory:- The Naïve Bayesian Classifier is a probabilis c machine learning model used for text classifica on
tasks, such as spam detec on or sen ment analysis. It is based on Bayes' Theorem, with the "naïve" assump
on that all features (words in a document) are independent of each other given the class label. Despite this
simplifica on, it performs remarkably well in prac cal applica ons.

Key Concepts:
 Bayes’ Theorem:
It provides a way to calculate the probability of a hypothesis given the evidence.

Prior Probability P(H):

Probability of a class (e.g., posi ve or nega ve) before seeing the data.
Likelihood P(E∣H):
Probability of observing a word in a document, given the class.
Posterior Probability P(H∣E):
Final probability of the class given the observed features (words).
Feature Independence Assump on:
Assumes each word in the document contributes independently to the class probability.

How the Naïve Bayesian Classifier Works for Document Classifica on:
1. Preprocess the Text:
Convert documents into tokens (words), remove stopwords, and vectorize the data using techniques
like Bag of Words or TF-IDF.
2. Training Phase:
Use the training documents and their labels to calculate the prior and likelihood probabili es for
each class.
3. Predic on Phase:
For a new/unseen document, compute the posterior probability for each class, and assign the class
with the highest probability.
4. Evalua on:
Use metrics such as Accuracy, Precision, and Recall to evaluate model performance.

Assump ons of Naïve Bayesian Classifier:

• The features (words) are condi onally independent given the class.

• The training dataset is representa ve of the real-world distribu on.

• The input text is already preprocessed (cleaned and vectorized).

Source Code :-

import pandas as pd
msg = pd.read_csv('/content/sample_data/document.csv', names=['message', 'label'])
print("Total Instances of Dataset: ", msg.shape[0]) msg['labelnum'] =
msg.label.map({'pos': 1, 'neg': 0})

X = msg.message
y = msg.labelnum
from sklearn.model_selec on import train_test_split Xtrain,
Xtest, ytrain, ytest = train_test_split(X, y)
from sklearn.feature_extrac on.text import CountVectorizer

count_v = CountVectorizer()
Xtrain_dm = count_v.fit_transform(Xtrain)
Xtest_dm = count_v.transform(Xtest)
……………………………………………………………………………

df = pd.DataFrame(Xtrain_dm.toarray(), columns=count_v.get_feature_names_out())
print(df[0:5])
from sklearn.naive_bayes import Mul nomialNB clf = Mul nomialNB()
clf.fit(Xtrain_dm, ytrain)
pred = clf.predict(Xtest_dm)
…………………………………………………
………………………… for doc, p in
zip(Xtrain, pred): p = 'pos' if p == 1 else 'neg'
print("%s -> %s" % (doc, p))

from
sklearn.metrics import accuracy_score, confusion_matrix, precision_score, recall_score
print('Accuracy Metrics: \n') print('Accuracy: ', accuracy_score(ytest, pred)) print('Recall: ',
recall_score(ytest, pred)) print('Precision: ', precision_score(ytest, pred))
print('Confusion Matrix: \n', confusion_matrix(ytest, pred))

Naive Bayes Classifier in Machine Learning Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning Javatpoint
23 pages
Practical 3
No ratings yet
Practical 3
11 pages
AIML - Ex.3 Manual
No ratings yet
AIML - Ex.3 Manual
4 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
Na Ive Bayes Classifier
No ratings yet
Na Ive Bayes Classifier
3 pages
Vamshi ml-4
No ratings yet
Vamshi ml-4
3 pages
Naive Bayes Classifier Presentation
No ratings yet
Naive Bayes Classifier Presentation
10 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
NOTES
No ratings yet
NOTES
15 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
8 pages
Unit2 - 5 - Part 2
No ratings yet
Unit2 - 5 - Part 2
1 page
Naïve Bayes Algorithm Lab
No ratings yet
Naïve Bayes Algorithm Lab
4 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Lab5 NaiveBayes Full
No ratings yet
Lab5 NaiveBayes Full
5 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
24 pages
Lab7&8 NaiveBayes
No ratings yet
Lab7&8 NaiveBayes
5 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
24 Shivangi DMDW
No ratings yet
24 Shivangi DMDW
12 pages
NLP NB
No ratings yet
NLP NB
52 pages
Naive Bayes Classifiers - Parta
No ratings yet
Naive Bayes Classifiers - Parta
17 pages
Lab2 - Bayes Classification
No ratings yet
Lab2 - Bayes Classification
4 pages
16 - Naïve Bayes Classifier
No ratings yet
16 - Naïve Bayes Classifier
21 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
An Approach of The Naive Bayes Classifier For The Document Classification
No ratings yet
An Approach of The Naive Bayes Classifier For The Document Classification
4 pages
Navies Bayes
No ratings yet
Navies Bayes
18 pages
NB Slides
No ratings yet
NB Slides
29 pages
ML CLassification Naive Bayes
No ratings yet
ML CLassification Naive Bayes
6 pages
CSL0777 L24
No ratings yet
CSL0777 L24
38 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
16 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
BAI601 Module 3 PDF
No ratings yet
BAI601 Module 3 PDF
19 pages
Lec 09
No ratings yet
Lec 09
50 pages
Naive Bayes Classifier Overview
No ratings yet
Naive Bayes Classifier Overview
7 pages
Tackling The Poor Assumptions of Naive Bayes Text Classifiers
No ratings yet
Tackling The Poor Assumptions of Naive Bayes Text Classifiers
8 pages
Myppt
No ratings yet
Myppt
14 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Lec 09
No ratings yet
Lec 09
50 pages
Naive Bayes Classifier Program
No ratings yet
Naive Bayes Classifier Program
11 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
18 pages
Naïve Bayes
No ratings yet
Naïve Bayes
15 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Mechine Learning
No ratings yet
Mechine Learning
7 pages
Ai&Ml Lab: Dept of CSE, SUK
No ratings yet
Ai&Ml Lab: Dept of CSE, SUK
3 pages
UNIT 2 AAM Notes
No ratings yet
UNIT 2 AAM Notes
38 pages
W2 3-NaiveBayes
No ratings yet
W2 3-NaiveBayes
17 pages
Experiment No 6
No ratings yet
Experiment No 6
3 pages
Naive Bayes Classification
100% (3)
Naive Bayes Classification
10 pages
Naive Bayes Explanation Cleaned
No ratings yet
Naive Bayes Explanation Cleaned
2 pages
Naive Bayes Algorithm For Classification Tasks: Sana Badagan 1MS24RAI09
No ratings yet
Naive Bayes Algorithm For Classification Tasks: Sana Badagan 1MS24RAI09
31 pages
6d7701 - Bayesean Classifer
No ratings yet
6d7701 - Bayesean Classifer
8 pages
05 Text Classification - Naive Bayes
No ratings yet
05 Text Classification - Naive Bayes
64 pages
Class 12 Maths Exam Model Paper
No ratings yet
Class 12 Maths Exam Model Paper
8 pages
2024 Spring Project
No ratings yet
2024 Spring Project
7 pages
GIS-Based Erosion Risk Mapping
No ratings yet
GIS-Based Erosion Risk Mapping
17 pages
Computer Science Engineering Course Outcomes
No ratings yet
Computer Science Engineering Course Outcomes
17 pages
Performance Measurament
No ratings yet
Performance Measurament
42 pages
A.S Level Biology Edexcel Notes Unit 1 Part 1 Color 2side
No ratings yet
A.S Level Biology Edexcel Notes Unit 1 Part 1 Color 2side
134 pages
Strong Swan Documentation (Updated Till Eap-Md5)
No ratings yet
Strong Swan Documentation (Updated Till Eap-Md5)
58 pages
Skewb Puzzle Solving Guide
No ratings yet
Skewb Puzzle Solving Guide
12 pages
Introduction C
100% (1)
Introduction C
28 pages
Chapter 3: Semiconductors: Electronic Materials
No ratings yet
Chapter 3: Semiconductors: Electronic Materials
12 pages
Design Animation Tutorial #1: Assembly Sequence of Manifold
No ratings yet
Design Animation Tutorial #1: Assembly Sequence of Manifold
7 pages
Guidelines AdvancedWebProgramming
No ratings yet
Guidelines AdvancedWebProgramming
2 pages
1LE2321-1CA11-4GA3 Datasheet en
No ratings yet
1LE2321-1CA11-4GA3 Datasheet en
1 page
DAVIE Peterbilt
100% (2)
DAVIE Peterbilt
103 pages
Abg10 2 Abg 35 2 Multiturn Bevel Gearbox Technical Datasheet en
No ratings yet
Abg10 2 Abg 35 2 Multiturn Bevel Gearbox Technical Datasheet en
2 pages
S220 Loader Service Guide
No ratings yet
S220 Loader Service Guide
29 pages
SDOF System Vibrations Explained
No ratings yet
SDOF System Vibrations Explained
31 pages
mt940 Details
No ratings yet
mt940 Details
18 pages
A Coding Style Guide For Java WorkShop and Java Studio Programming - Achut Reddy
No ratings yet
A Coding Style Guide For Java WorkShop and Java Studio Programming - Achut Reddy
35 pages
AI Unit 4
No ratings yet
AI Unit 4
11 pages
JEE - Chemistry - Chemical Kinetics
No ratings yet
JEE - Chemistry - Chemical Kinetics
27 pages
IT SKILL LAB KMBN MBA 1st Sem
No ratings yet
IT SKILL LAB KMBN MBA 1st Sem
23 pages
Chemical Transducer
100% (1)
Chemical Transducer
15 pages
Blockholders' Power & Firm Value
No ratings yet
Blockholders' Power & Firm Value
13 pages
Wave Properties of Light
No ratings yet
Wave Properties of Light
36 pages
ADW511A
No ratings yet
ADW511A
61 pages
Problems and Solutions - C4
83% (6)
Problems and Solutions - C4
25 pages
Instruction Manual FOR New Mather Metals, Inc.: Ajax TOCCO Magnethermic Corporation
100% (1)
Instruction Manual FOR New Mather Metals, Inc.: Ajax TOCCO Magnethermic Corporation
289 pages
Banklogs Report
No ratings yet
Banklogs Report
3 pages
Phy Pract Mock
No ratings yet
Phy Pract Mock
9 pages

MLT Lab 06

Uploaded by

MLT Lab 06

Uploaded by

Name – Amit Shukla

Roll No. – 2200971640010

Prior Probability P(H):

Assump ons of Naïve Bayesian Classifier:

• The training dataset is representa ve of the real-world distribu on.

• The input text is already preprocessed (cleaned and vectorized).

You might also like