0% found this document useful (0 votes)

45 views4 pages

Bayesian Classification - Problem

The document explains Bayesian classification, focusing on the naive Bayes classifier which applies Bayes' theorem with strong independence assumptions among features. It details the process of calculating prior and conditional probabilities to classify data samples based on observed attributes. An example is provided to illustrate the classification of a dataset predicting whether a student will buy a computer based on age, income, student status, and credit rating.

Uploaded by

gautamchandan25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views4 pages

Bayesian Classification - Problem

Uploaded by

gautamchandan25

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Bayesian Classification

A Bayes classifier is a simple probabilistic classifier based on applying Bayes' theorem with
strong independence assumptions. A more descriptive term for the underlying probability model
would be "independent feature model".

In simple terms, a naive Bayes classifier assumes that the presence (or absence) of a particular
feature of a class is unrelated to the presence (or absence) of any other feature.

 Let X be a data sample (“evidence”): class label is unknown

 Let H be a hypothesis that X belongs to class C
 Classification is to determine P(H|X), the probability that the hypothesis holds given the
observed data sample X
 P(H) (prior probability), the initial probability

 E.g., X will buy computer, regardless of age, income,

 P(X): probability that sample data is observed

 P(X|H) (posteriori probability), the probability of observing the sample X, given that the
hypothesis holds

 E.g.,Given that X will buy computer, the prob. that X is 31..40, medium income

Bayes theorem is useful in that it provides a way of calculating the posteriori probability, P(H|X),
from P(H),P(X) and P(X|H), Bayes theorem can be stated as

P( X|H ) P( H )
P( H|X )=
P( X )

Native Bayesian Classification

The native Bayesian Classifier or simple Bayesian Classifier, works as follows,

 Let D be a training set of tuples and their associated class labels, and each tuple is
represented by an n-D attribute vector X = (x1, x2, …, xn)

 Suppose there are m classes C1, C2, …, Cm.

 Classification is to derive the maximum posteriori, i.e., the maximal P(C i|X)

 This can be derived from Bayes’ theorem

 If Ak is continous-valued, P(xk|Ci) is usually computed based on Gaussian distribution with a

mean μ and standard deviation σ,
( x −μ )2
−
1 2σand
2
P(XK|Ci)=g(xk, μci, σci)
g( x , μ , σ )= e
√2 π σ

Ex: Consider the following Training dataset to illustrate the classification for predicting a class
label for the situation “A student age is less than or equal to 30 with medium income and fair
credit rating purchased computer or not”.

Understanding the Data

The table represents a dataset used for a classification problem. We're trying to predict whether
someone "buys a computer" (the "Class" column) based on several attributes:

 RID (Record ID): A unique identifier for each data point.

 age: Categorical age ranges: "<=30", "31...40", ">40".

 income: Categorical income levels: "high", "medium", "low".

 student: Binary (yes/no) indicating if the person is a student.

 credit_rating: Categorical credit rating: "fair", "excellent".

 Class: buys_computer: The target variable we want to predict. It's binary (yes/no).
1. Calculate Prior Probabilities P(Ci):

 P(Ci): P(buys_computer = “yes”) = 9/14 = 0.643

P(buys_computer = “no”) = 5/14= 0.357

 2. Compute Conditional Probabilities P(X|Ci):

This step calculates the probability of observing specific attribute values (X) given each class
(Ci).

 Compute P(X|Ci) for each class

P(age = “<=30” | buys_computer = “yes”) = 2/9 = 0.222

P(age = “<= 30” | buys_computer = “no”) = 3/5 = 0.6

P(income = “medium” | buys_computer = “yes”) = 4/9 = 0.444

P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4

P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667

P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2

P(credit_rating = “fair” | buys_computer = “yes”) = 6/9 = 0.667

P(credit_rating = “fair” | buys_computer = “no”) = 2/5 = 0.4

3. X = (age <= 30 , income = medium, student = yes, credit_rating = fair)

P(X|Ci) : P(X|buys_computer = “yes”) = 0.222 x 0.444 x 0.667 x 0.667 = 0.044

P(X|buys_computer = “no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019

P(X|Ci)P(Ci) : P(X|buys_computer = “yes”) P(buys_computer = “yes”) = 0.028

P(X|buys_computer = “no”) * P(buys_computer = “no”) = 0.007

Therefore, X belongs to class (“buys_computer = yes”)

Statistics For The Social Sciences - A General Linear Model Approach-Cambridge University Press (2018)
100% (1)
Statistics For The Social Sciences - A General Linear Model Approach-Cambridge University Press (2018)
600 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Bayesian Classification Guide
No ratings yet
Bayesian Classification Guide
6 pages
Lecture 8 - Naive Bayes
No ratings yet
Lecture 8 - Naive Bayes
27 pages
Naive Bayes Classifier Guide
No ratings yet
Naive Bayes Classifier Guide
16 pages
Lesson 3.3 - Supervised Learning Rule Based Classification
No ratings yet
Lesson 3.3 - Supervised Learning Rule Based Classification
43 pages
Bayes Classification Methods
No ratings yet
Bayes Classification Methods
22 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
31 pages
6 - Naive Bayes
No ratings yet
6 - Naive Bayes
26 pages
Naïve Bayes Classification Guide
No ratings yet
Naïve Bayes Classification Guide
2 pages
Classification With NaiveBayes
No ratings yet
Classification With NaiveBayes
19 pages
DWM - Classification-Unit7
No ratings yet
DWM - Classification-Unit7
44 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
WINSEM2019-20 CSE3013 ETH VL2019205006650 Reference Material I 21-Feb-2020 BAYES RULE AND ITS USE
No ratings yet
WINSEM2019-20 CSE3013 ETH VL2019205006650 Reference Material I 21-Feb-2020 BAYES RULE AND ITS USE
10 pages
Bayesian Classification
No ratings yet
Bayesian Classification
25 pages
MCOM Project Proposal Guide
100% (1)
MCOM Project Proposal Guide
24 pages
20210913115710D3708 - Session 09-12 Bayes Classifier
No ratings yet
20210913115710D3708 - Session 09-12 Bayes Classifier
30 pages
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
No ratings yet
Bayesian Classification: Dr. Navneet Goyal BITS, Pilani
35 pages
Bayesian Classification Explained
No ratings yet
Bayesian Classification Explained
7 pages
K - Nearest Neighbours Classifier / Regressor
No ratings yet
K - Nearest Neighbours Classifier / Regressor
35 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
Baye's Rule and Its Use
No ratings yet
Baye's Rule and Its Use
9 pages
10 Classification New 1
No ratings yet
10 Classification New 1
31 pages
Data Classification and Prediction : Lecture-11
No ratings yet
Data Classification and Prediction : Lecture-11
36 pages
Classification DMKD
No ratings yet
Classification DMKD
50 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
Classification Naive Bayes
No ratings yet
Classification Naive Bayes
17 pages
Classification Ppts 2021
No ratings yet
Classification Ppts 2021
80 pages
Lecture12 Ch8 ClassBasic Part2
No ratings yet
Lecture12 Ch8 ClassBasic Part2
22 pages
Bayes Classification
No ratings yet
Bayes Classification
9 pages
DWDM Unit 3 Part 2
No ratings yet
DWDM Unit 3 Part 2
8 pages
Data Mining Classification: Naïve Bayes Classifier Lecture Notes For Chapter 4 &5
No ratings yet
Data Mining Classification: Naïve Bayes Classifier Lecture Notes For Chapter 4 &5
26 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Classification Clustering
No ratings yet
Classification Clustering
44 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Bayes' Theorem for Data Science
No ratings yet
Bayes' Theorem for Data Science
10 pages
Chapter 5 Classification
No ratings yet
Chapter 5 Classification
24 pages
2.3 Bayes Classification
No ratings yet
2.3 Bayes Classification
15 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
8 - Classification NaiveBayes PDF
No ratings yet
8 - Classification NaiveBayes PDF
13 pages
A5 PDF
No ratings yet
A5 PDF
9 pages
Descriptive Statistics Using Microsoft Excel
No ratings yet
Descriptive Statistics Using Microsoft Excel
5 pages
Lecture 5 Bayesian Classification
No ratings yet
Lecture 5 Bayesian Classification
16 pages
ML 09 Naive Bayes Classifier
No ratings yet
ML 09 Naive Bayes Classifier
24 pages
CS-DM Module-4
No ratings yet
CS-DM Module-4
22 pages
Bayes Classification Method
No ratings yet
Bayes Classification Method
18 pages
4 22865 IS465 2019 1 2 1 08ClassBasic
No ratings yet
4 22865 IS465 2019 1 2 1 08ClassBasic
43 pages
Module 3 - Bayesian Classifier
No ratings yet
Module 3 - Bayesian Classifier
17 pages
Unit6 - 3 Classification-Bayesian
No ratings yet
Unit6 - 3 Classification-Bayesian
15 pages
Classification 2
No ratings yet
Classification 2
56 pages
Bayesian
No ratings yet
Bayesian
23 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
AI Notes
No ratings yet
AI Notes
19 pages
Bayes Classification
No ratings yet
Bayes Classification
4 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
No ratings yet
WINSEM2024-25 BCSE334L TH VL2024250502042 2025-03-03 Reference-Material-I
18 pages
L11 Slides
No ratings yet
L11 Slides
28 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
47 pages
Practical Research - 2
No ratings yet
Practical Research - 2
40 pages
FRM一级强化段定量分析 Crystal 金程教育 (标准版
No ratings yet
FRM一级强化段定量分析 Crystal 金程教育 (标准版
156 pages
Adavanced Qualitative Research Methods Versus Advanced Quantitative Research Methods
No ratings yet
Adavanced Qualitative Research Methods Versus Advanced Quantitative Research Methods
13 pages
Tamil Nadu Open University: Regulations and Overview For
No ratings yet
Tamil Nadu Open University: Regulations and Overview For
107 pages
Rait Terminal Questions Amswers
No ratings yet
Rait Terminal Questions Amswers
11 pages
PTE Prediction 21-27 Jan
No ratings yet
PTE Prediction 21-27 Jan
82 pages
MTP 22 56 Questions 1716557591
No ratings yet
MTP 22 56 Questions 1716557591
19 pages
23AD2001R Lab Workbook
No ratings yet
23AD2001R Lab Workbook
56 pages
Crash 1500
No ratings yet
Crash 1500
77 pages
ANOVA - Modified
No ratings yet
ANOVA - Modified
53 pages
Partition
No ratings yet
Partition
52 pages
Quantitative vs Qualitative Research Guide
No ratings yet
Quantitative vs Qualitative Research Guide
22 pages
A-Level Maths Guide for Students
No ratings yet
A-Level Maths Guide for Students
12 pages
Molecular Evolution and Phylogenetics Session.4
No ratings yet
Molecular Evolution and Phylogenetics Session.4
28 pages
The Impact of Red Light Cameras (Photo-Red Enforcement) On Crashes in Virginia
No ratings yet
The Impact of Red Light Cameras (Photo-Red Enforcement) On Crashes in Virginia
149 pages
Radial Basis Function (RBF) Neural Networks For The Senior Design Project
No ratings yet
Radial Basis Function (RBF) Neural Networks For The Senior Design Project
17 pages
HowToWriteLabReport2 - 1 - How T
No ratings yet
HowToWriteLabReport2 - 1 - How T
26 pages
Sartori - 1970 - Concept Misformation in Comparative Politics
No ratings yet
Sartori - 1970 - Concept Misformation in Comparative Politics
22 pages
Design and Analysis of Experiments With Two Nuisance Factors
No ratings yet
Design and Analysis of Experiments With Two Nuisance Factors
14 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
34 pages
Statistics Homework for Students
No ratings yet
Statistics Homework for Students
16 pages
Session-5 - DBMS
No ratings yet
Session-5 - DBMS
18 pages
Phytochemical Analysis and Comparative Toxicity Test (Lc50) OF MALUNGGAY (Moringa Oleifera) ETHANOLIC Root and Leaf Extract
No ratings yet
Phytochemical Analysis and Comparative Toxicity Test (Lc50) OF MALUNGGAY (Moringa Oleifera) ETHANOLIC Root and Leaf Extract
28 pages
Lab 01 - Scientific Method and Statistics (New Version)
0% (1)
Lab 01 - Scientific Method and Statistics (New Version)
25 pages
Cold Storage Case Analysis Final
No ratings yet
Cold Storage Case Analysis Final
7 pages
Netherlands Vs Argentina
No ratings yet
Netherlands Vs Argentina
13 pages
Experience The Mahabharat Through Play CertificationKLVFinal
No ratings yet
Experience The Mahabharat Through Play CertificationKLVFinal
9 pages
Abss
No ratings yet
Abss
8 pages
Question Bank Advanced CO1, CO2
No ratings yet
Question Bank Advanced CO1, CO2
4 pages
Understanding Statistical Misuse
No ratings yet
Understanding Statistical Misuse
2 pages
Exp 11 2
No ratings yet
Exp 11 2
3 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
3 pages
For KS Diagnosis Specs Limits Shell LubeAnalyst Condemnation Limits Mar 081
100% (4)
For KS Diagnosis Specs Limits Shell LubeAnalyst Condemnation Limits Mar 081
56 pages
6 Anal
No ratings yet
6 Anal
1 page
Y21 B.Tech In-Semester II Examinations, November-2024 (2024-25 Odd Sem) TimeTable
No ratings yet
Y21 B.Tech In-Semester II Examinations, November-2024 (2024-25 Odd Sem) TimeTable
1 page
The Ruble: A Political History Ekaterina Pravilova Instant Download
No ratings yet
The Ruble: A Political History Ekaterina Pravilova Instant Download
152 pages

Bayesian Classification - Problem

Uploaded by

Bayesian Classification - Problem

Uploaded by

Bayesian Classification

 Let X be a data sample (“evidence”): class label is unknown

 E.g., X will buy computer, regardless of age, income,

 P(X): probability that sample data is observed

Native Bayesian Classification

The native Bayesian Classifier or simple Bayesian Classifier, works as follows,

 Suppose there are m classes C1, C2, …, Cm.

 This can be derived from Bayes’ theorem

 If Ak is continous-valued, P(xk|Ci) is usually computed based on Gaussian distribution with a

Understanding the Data

 RID (Record ID): A unique identifier for each data point.

 age: Categorical age ranges: "<=30", "31...40", ">40".

 income: Categorical income levels: "high", "medium", "low".

 student: Binary (yes/no) indicating if the person is a student.

 credit_rating: Categorical credit rating: "fair", "excellent".

 P(Ci): P(buys_computer = “yes”) = 9/14 = 0.643

P(buys_computer = “no”) = 5/14= 0.357

 2. Compute Conditional Probabilities P(X|Ci):

 Compute P(X|Ci) for each class

P(age = “<=30” | buys_computer = “yes”) = 2/9 = 0.222

P(age = “<= 30” | buys_computer = “no”) = 3/5 = 0.6

P(income = “medium” | buys_computer = “yes”) = 4/9 = 0.444

P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4

P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667

P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2

P(credit_rating = “fair” | buys_computer = “no”) = 2/5 = 0.4

3. X = (age <= 30 , income = medium, student = yes, credit_rating = fair)

P(X|Ci) : P(X|buys_computer = “yes”) = 0.222 x 0.444 x 0.667 x 0.667 = 0.044

P(X|buys_computer = “no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019

P(X|Ci)*P(Ci) : P(X|buys_computer = “yes”) * P(buys_computer = “yes”) = 0.028

P(X|buys_computer = “no”) * P(buys_computer = “no”) = 0.007

Therefore, X belongs to class (“buys_computer = yes”)

You might also like

P(X|Ci)P(Ci) : P(X|buys_computer = “yes”) P(buys_computer = “yes”) = 0.028