Naive Bayes Algorithm Notes

Algo notes

Uploaded by

prashantrajapunjab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

59 views10 pages

Naive Bayes Algorithm Notes

Algo notes

Uploaded by

prashantrajapunjab

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 10

Naive Bayes Naive bayes is a supervised machine learning algorithm based on Bayes theorem. It is a probabilistic classifier , means it predicts based on the probability. It is used for spam detection, sentiment analysis, text classification etc. Naive : It is called naive because it assume that the occurence of a certain feature is independent of the occurence of the other features. Example: Movie Genre Classification Suppose you want to classify movies into genres based on two features: the presence of keywords in the movie's description and the director of the movie. In a "naive" approach, you might assume that the presence of keywords (e.g., action, romance, comedy) and the director's name are independent factors when determining a movie's genre. In other words, you treat these features as if they don't influence each other. For instance: If a movie's description contains the keyword "action," you might immediately assume it's an action movie, without considering the director. If a movie is directed by a famous director known for making romantic films, you might classify it as a romance movie, regardless of the keywords in the description. The "naive" aspect here is that you're assuming no correlation or interaction between keywords and the director's influence on the movie's genre. Bayes: It is called Bayes because it depends on the principle of Bayes theorem.Firstly , lets discuss about Conditional Probability It refers to the probability of event occuring given that another event has already occured. In text classification, predicting the probability of particular class(Spam/Not Spam) given the presence of certain features(words) in a document. Example of dice : Rolling two dice together .| Condibionall Racha bined, 7CAls) = = at n v given PLR) to jess x Radding Kise dice Frogaten, | Dt eee Oj Ce fh, i) | The with be ke bah A, 36 oukomes [Evie Aah. apace 7o-b ne inthe eee Tak value akharined Aice P(A=5) = OLD hak — y thak DitDpr 210 PW@ = Ploitpr cle) =32 =) 36 Te Mou im Conditional lx obabi dik, “conti wis Q- Bubabitct, ty sf Dice Di =S 4 Ven Du 4D2 < 10 PLDi=5 ea Elo) = PCA [g) =f(Ang) po Peg) ‘Plang-5 Pls) — 22 Be Me Pldizs| DitDe < ta) =2,Bayes Theorem for spam filtering But what does this theorem have to do with our spam filter? We want to find out what is the likelihood of a specific message to be spam. But a message consists of multiple words. In order to find the combined probability of the words we first have to find the probability of each separate word being a spam word. This is also known as the 'spaminess' of a word and we can calculate it by using one special case of Bayes Theorem where the event is a binary variable. P(S|W) = — Pi ) + P(WH)- P(A)’ where, = P(S|W) is the probability that a message is 3 spam, knowing that a spe word E « P(W1S) is the probability that the specific word appears in spam messages; « P(S) is the overall probability that any given message is spam; « P(W/#) is the probability that the specific word appears in ham messages; « P(A) is the overall probability that any given message is hamBayes Thesstnn y cathe A Pula PH she) a Poker PCR = eviderce Ras ailehe 4 Giveen PCR) #0 Paave Le way Cond tient Pasbabit if PCA/g) = Pere) PCa) Ang= Ona < PG/e) = elena) = PCANg) —@ — PUD PCA) fCANg) = PCe/A) PCA) *anwnge@ Farm © P(n[e) = PCp/a) PCa) PCa)Madceanabiech “Ltuttien Feakuns Ke Ix %2,%3 — -=-%} C Neos vortable iz PUy|x) = PCy) oO) POX PXD= 8 Gx, y---%|¥) | =A xy aes al) POX |xax%4 Xe) PCG [DY By Cand Honl Inde bendence nen = POu| Fe PCa | ¥ LP, |) P %) P Gay) ae Aina na eT (asta a aes AvdeSentences Ouiput Send us your password Spam Send us your review Spam Review your password Ham Review us Ham Send us password Spam Send us your account Spam Probability of Spam and Ham is : P(Spam)= 4/5 P(Ham) =2/6 Probability of every word in Spam and Ham Vocabulary Spam Ham password 2/4 12 review 14 2/2 send. 3/4 1/2 us 3/4 12 your 3/4 1/2 account 14 0/2 Now new messege came and we have to check whether it is spam or ham “review us now” Conditional Probability P(review us now/Spam)=P(0,1,0,1,0,0)=(1-2/4)(1/4)(1- 3/4y*(3/4)*(3/4)*(1-3/4)*(1-L/4)= 9/2048P(review us now/Ham)=P(0,1,0,1,0,0)= (1-1/2)*(2/2)*(1-L/2)*(L/2)*(1- L/2y*(1-0/2)=1/16 Apply Bayes theorem: P(Spam/review us now) =P(review us now/Spam)*P (Spam)/P(review us now/Spam)*P(Spam) +P(review us now/Ham)*P(Ham) =9/2048*4/6/9/2 048*4/6+ 1/16*2/6 = 0.1229 approx. P(Ham/review us now) =P(review us now/Ham)*P(Ham)/P(review us now/Ham)*P(Ham) +P(review us now/Spam)*P (Spam) 1/16*2/6/1/16 *2/6+9/2048*4/5= 0.8446 approx. From the predicted probability , we can say that our messege is Ham. Advantages: 1. Easy to understand and implement. 2. Computationally efficient and requires small amout of training data. 3. Works well with high-dimensional data 4, Less prone to overfitting. 5. It can be used in online learning, where the model can be updated with new data without the need for retraining from scratch. Disadvantages: 1. It assumes features should independent. In reality, many real world datasets have correlated features which can lead to suboptimal solution. 2. When a feature doesn’t appear in the training data for a particular class, it assigns probability of zero, which can lead to incorrect classifications.3.Due to its simplicity, Naive bayes may not capture complex relationships in the data as well as more advanced models like decision trees or neural networks. 4. It can be insensitive to imbalance datasets.

ProbabilisticLearning Bayesian
No ratings yet
ProbabilisticLearning Bayesian
11 pages
Week 3 - 5-Bayesian Methods
No ratings yet
Week 3 - 5-Bayesian Methods
4 pages
Naive Bayes Classification - Elements of AI
No ratings yet
Naive Bayes Classification - Elements of AI
1 page
Statistics
No ratings yet
Statistics
25 pages
2425s Csec520 08 Naive Bayes KNN
No ratings yet
2425s Csec520 08 Naive Bayes KNN
44 pages
Lab7&8 NaiveBayes
No ratings yet
Lab7&8 NaiveBayes
5 pages
Naive Bayes Classifier Overview
No ratings yet
Naive Bayes Classifier Overview
7 pages
Lab5 NaiveBayes Full
No ratings yet
Lab5 NaiveBayes Full
5 pages
24 Shivangi DMDW
No ratings yet
24 Shivangi DMDW
12 pages
Detecting Spam Mail With Naive Bayes
No ratings yet
Detecting Spam Mail With Naive Bayes
5 pages
Multimedia Application L7 - For
No ratings yet
Multimedia Application L7 - For
46 pages
Module 3
No ratings yet
Module 3
25 pages
Calculate Conditional Probability With Bayes
No ratings yet
Calculate Conditional Probability With Bayes
5 pages
W2 3-NaiveBayes
No ratings yet
W2 3-NaiveBayes
17 pages
Multimedia Application L8
No ratings yet
Multimedia Application L8
68 pages
Naive Bayes - An Example
No ratings yet
Naive Bayes - An Example
4 pages
An Example of Text Classification With Naïve Bayes
No ratings yet
An Example of Text Classification With Naïve Bayes
4 pages
Probabilistic Learning - NB
No ratings yet
Probabilistic Learning - NB
10 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
47 pages
Lecture 12 Dr. Lamiaa
No ratings yet
Lecture 12 Dr. Lamiaa
21 pages
Day 4 - Supervised Learning (Classification)
No ratings yet
Day 4 - Supervised Learning (Classification)
46 pages
Naive Bayes Spam Classifier
0% (1)
Naive Bayes Spam Classifier
44 pages
Unit 2 Bayesian Learning Bayes Theorem and Bayes Optimal Classifier
No ratings yet
Unit 2 Bayesian Learning Bayes Theorem and Bayes Optimal Classifier
19 pages
NB 24 Aug
No ratings yet
NB 24 Aug
82 pages
Bayes Theorem
No ratings yet
Bayes Theorem
9 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
3 pages
Naive Bayes
No ratings yet
Naive Bayes
36 pages
Naive Bayes for Email Spam Classification
No ratings yet
Naive Bayes for Email Spam Classification
3 pages
Naive Bayes Classifier Presentation
No ratings yet
Naive Bayes Classifier Presentation
10 pages
Lec6 Parametricvsnonparametric
No ratings yet
Lec6 Parametricvsnonparametric
29 pages
Naïve Bayes & POS Tagging Guide
No ratings yet
Naïve Bayes & POS Tagging Guide
60 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
51 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
07 - KNN & Naive Bayes
No ratings yet
07 - KNN & Naive Bayes
59 pages
Naive Bayes for Email Spam Filtering
No ratings yet
Naive Bayes for Email Spam Filtering
14 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
16 pages
BAI601 Module 3 PDF
No ratings yet
BAI601 Module 3 PDF
19 pages
Naive Bayes-1 Th....
No ratings yet
Naive Bayes-1 Th....
13 pages
Machine Learning for Students
No ratings yet
Machine Learning for Students
30 pages
Naive Bayes Classifier 1
No ratings yet
Naive Bayes Classifier 1
18 pages
Chapter1 - Probabilistic Learning - Classification Using Naive Bayes
No ratings yet
Chapter1 - Probabilistic Learning - Classification Using Naive Bayes
17 pages
NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
100% (1)
NLP - PPT - Module 3 - Naïve Bayes, Text Classification and Sentiment
86 pages
Baye's Notes
No ratings yet
Baye's Notes
3 pages
Naivebayes 2021
No ratings yet
Naivebayes 2021
77 pages
4 NB 2024
No ratings yet
4 NB 2024
82 pages
Bai Tap Lon CLC
No ratings yet
Bai Tap Lon CLC
3 pages
Bayesian Inference
No ratings yet
Bayesian Inference
20 pages
Naïve Bayes Classifiers 3
No ratings yet
Naïve Bayes Classifiers 3
16 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
17 pages
Multinomial NB
No ratings yet
Multinomial NB
52 pages
DM Chapter 3
No ratings yet
DM Chapter 3
6 pages
Text Classification Using TF-IDF and Machine Learning
No ratings yet
Text Classification Using TF-IDF and Machine Learning
30 pages
Tran Thi Thuy Trang - OSTA2024 - Assign01
No ratings yet
Tran Thi Thuy Trang - OSTA2024 - Assign01
12 pages
Machine Learning Session: Naïve Bayes Classifier
No ratings yet
Machine Learning Session: Naïve Bayes Classifier
7 pages
Naïve Bayes Classifier Guide
No ratings yet
Naïve Bayes Classifier Guide
7 pages
Naive Bayes
No ratings yet
Naive Bayes
9 pages
Bayesian Concept Learning Guide
No ratings yet
Bayesian Concept Learning Guide
157 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Naive Bayes Classification - Part 1 (Theory)
No ratings yet
Naive Bayes Classification - Part 1 (Theory)
6 pages

Naive Bayes Algorithm Notes

Uploaded by

Naive Bayes Algorithm Notes

Uploaded by

You might also like