0% found this document useful (0 votes)

90 views7 pages

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

The document discusses Bayesian inference for parameter estimation. It explains how Bayesian inference treats parameters as random variables and uses prior distributions and likelihood functions to derive posterior distributions. Three examples are provided to illustrate how to construct Bayesian estimators for unknown population parameters by taking the mean of the posterior distributions. Key steps of Bayesian inference including deriving the likelihood, prior, and posterior distributions are demonstrated.

Uploaded by

Thảo Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views7 pages

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

Uploaded by

Thảo Nguyễn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Bayesian Inference

by Hoai Nam Nguyen

September 9, 2017

1
The setting is the same. Given a population that follows a distribution P ,
where P contains 1 or more unknown parameters, we want to construct an
estimator for each of them. In this course, I consider the simple case, where
there is only 1 unknown parameter . To do this, we proceed by collecting
an i.i.d sample X1 , ..., Xn P

Similar to Maximum Likelihood Estimation, we first find the Likelihood

Function L():

L() = fX1 ,...,Xn (x1 , ..., xn |)

In Bayesian inference, we treat the parameter as a random variable. That

is, follows a probability distribution with pdf (). We call () the prior
distribution of

By Bayess formula, we have

fX1 ,...,Xn (x1 , ..., xn |)()

(|x1 , ..., xn ) =
fX1 ,...,Xn (x1 , ..., xn )

fX1 ,...,Xn (x1 , ..., xn |)()

where (|x1 , ..., xn ) is the pdf of given the sample data. This is called the
posterior distribution of

Let me clarify the last step further. The symbol means proportional
to. Since the left-hand side is the distribution of conditional on the sam-
ple data {x1 , ..., xn }, all the xi are assumed to be known and the denominator
fX1 ,...,Xn (x1 , ..., xn ) is, therefore, no more than a constant

In this setting, we are given the population distribution P and the prior
distribution (). We have to find the posterior distribution (|x1 , ..., xn ).
We then use the posterior mean E[|x1 , ..., xn ] to estimate the unknown
parameter . That is,

= E[|x1 , ..., xn ]

NOTE: when calculating (|x1 , ..., xn ), always use proportionality by re-

moving constants because this will simply the calculation a lot

2
Example 1

The population distribution is Bernoulli(p), where p U nif orm(0, 1). Use

Bayesian inference to construct an estimator p

The likelihood function is given by:

n
Y
L(p) = fXi (xi |p)
i=1

n
Y
= pxi (1 p)1xi
i=1

P P
xi
=p (1 p)n xi

The pdf of the prior distribution is (p) = 1, for 0 < p < 1

Therefore, the posterior distribution is given by:

(p|x1 , ..., xn ) fX1 ,...,Xn (x1 , ..., xn |p)(p)

P P
xi
=p (1 p)n xi
, for 0 < p < 1
Recall the pdf of Beta(, ):
(+) 1
fX (x) = ()()
x (1 x)1 , for 0 < x < 1
P
By comparing,
P we can see that the posterior distribution p is Beta( xi +
1, n xi + 1)

We know that the expectation of Beta(, ) is +
. Therefore, the posterior
mean is given by:
P
xi +1
E[p|x1 , ..., xn ] = n+2
P
Xi +1
Thus, p = n+2
is the Bayesian estimator for p

Note that we used proportionality when calculating the posterior distribu-

tion. By comparing with the pdf of Beta(, ), we can easily recover the
missing constant:
(+) (n+2) P
c= ()()
= P
( xi +1)(n xi +1)

3
Example 2

Same as example 1, except that p Beta(a, b), where both a and b are
given constants

The likelihood function stays unchanged:

P P
xi
L(p) = p (1 p)n xi

The pdf of the prior distribution is given by:

(a+b) a1
(p) = (a)(b)
p (1 p)b1 , for 0 < p < 1

Therefore, the pdf of the posterior distribution is given by:

(p|x1 , ..., xn ) fX1 ,...,Xn (x1 , ..., xn |p)(p)

P P
xi
p (1 p)n xi a1
p (1 p)b1

P P
p xi +a1 (1 p)n xi +b1 , for 0 < p < 1
P P
We recognise this is Beta( xi + a, n xi + b)
P
xi +a
The posterior mean is E[p|x1 , ..., xn ] = n+a+b
. The Bayesian estimator for p
is given by:
P
Xi +a
p = n+a+b

Again, you can recover the normalising constant in the pdf of the posterior
distribution:

c= P (n+a+b) P
( xi +a)(n xi +b)

4
Example 3

The population distribution is N (, 2 ), where is unknown and 2 is known.

The parameter follows a prior distribution N (, 2 ), where both and 2
are given constants. Use Bayesian inference to construct an estimator p

The likelihood function is given by:

n
Y
L() = fXi (xi |)
i=1

n (x )2
Y 1 i
= exp
i=1 2 2 2 2

n
Y (xi )2

exp 2
, because 2 is known
i=1
2

Also, the pdf of the prior distribution is given by:

1 ( )2
() = p exp
2 2 2 2

( )2
exp , because 2 is known
2 2
Then, calculate the pdf of the posterior distribution:
Yn (x )2 ( )2
i
(|x1 , ..., xn ) exp exp
i=1
2 2 2 2

n
1 X
1
= exp 2 (xi )2 exp 2 (2 2 + 2 )
2 i=1 2

n
1 X 1
exp 2 (xi )2 exp 2 (2 2)
2 i=1 2

2
by removing exp
2 2

5
n
1 X 2

2
1 2

= exp 2 (x 2xi + ) exp 2 ( 2)
2 i=1 i 2

n
1 X
1
exp 2 (2xi + 2 ) exp 2 (2 2)
2 i=1 2

n
1 X 2
by removing exp 2 x
2 i=1 i

n
X n2 1 2

= exp xi exp ( 2)
2 i=1
2 2 2 2

n n
h 1 2 1 X i
= exp + + xi + 2
2 2 2 2 2 i=1

= exp(A2 + B)

2 B/A
= exp
1/A

2 B/A + B 2 /4A2
exp
1/A

( B/2A)2 i
h
= exp
1/A

Comparing with the pdf of a Normal distribution, we deduce that the pos-
terior distribution of is given by:

B 1
|x1 , ...xn N 2A , 2A

Clearly, E[|x1 , ..., xn ] = B/2A. Therefore, = B/2A is the Bayesian esti-

mator for

6
Example 4

Consider the following types of treatment:

Treatment 1: 100% of the patients are cured (3 out of 3)

Treatment 2: 95% of the patients are cured (19 out of 20)

Treatment 3: 90% of the patients are cured (90,000 out of 100,000)

Which one is the best???

Treatment 1 cured 100% of the patients. But the sample was so small that
we should cast doubt on the result. On the other hand, Treatment 3 was
very reassuring, but the percentage was a bit lower

Let p be the probability that a patient is cured. Then, the probability that
a patient is not cured is 1 p

Therefore, the population follows Bernoulli(p), where p is an unknown pa-

rameter
P
xi +1
In example 1, we found that p = n+2
provided an estimate for p
3+1 4
Treatment 1: p = 3+2
= 5
= 0.8
19+1 20
Treatment 2: p = 20+2
= 22
0.909
90000+1 90001
Treatment 3: p = 100000+2
= 100002
0.9

We can see that p for Treatment 2 is the highest. Therefore, we predict that
Treatment 2 is the best one. Treatment 1, despite curing everyone in the
sample, is predicted to be the worst due to its small sample size

CS321 Grosse Lecture Notes
No ratings yet
CS321 Grosse Lecture Notes
169 pages
Cs 188 HW Solutions Artificial Intelligence
No ratings yet
Cs 188 HW Solutions Artificial Intelligence
7 pages
Scientific Method Teachers Guide Discovery Education
100% (1)
Scientific Method Teachers Guide Discovery Education
31 pages
Colgate Financial Model Solved
No ratings yet
Colgate Financial Model Solved
33 pages
Soft Computing 2017
No ratings yet
Soft Computing 2017
323 pages
Applications of Robotics in Medicine
No ratings yet
Applications of Robotics in Medicine
11 pages
Introduction To The Scientific Method
No ratings yet
Introduction To The Scientific Method
47 pages
Summary Book (Decision Analysis For Management Judgment)
No ratings yet
Summary Book (Decision Analysis For Management Judgment)
11 pages
Research Data Analysis With Power BI: Vijay Krishnan S Bharanidharan G Krishnamoorthy
No ratings yet
Research Data Analysis With Power BI: Vijay Krishnan S Bharanidharan G Krishnamoorthy
8 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
14 pages
Sample Data - Ecommerce Dashboard
No ratings yet
Sample Data - Ecommerce Dashboard
384 pages
Bayesian Statistics 01
100% (1)
Bayesian Statistics 01
22 pages
Bayesian Analysis Insights
No ratings yet
Bayesian Analysis Insights
9 pages
Advanced Statistical Inference
No ratings yet
Advanced Statistical Inference
7 pages
Principles of Modelling Peter Fritzen PDF
No ratings yet
Principles of Modelling Peter Fritzen PDF
116 pages
Bayesian Statistics Essentials
No ratings yet
Bayesian Statistics Essentials
180 pages
CH 5
No ratings yet
CH 5
45 pages
L1 24 07 2019 Introduction
100% (2)
L1 24 07 2019 Introduction
28 pages
Lecture 6. Bayesian Estimation
No ratings yet
Lecture 6. Bayesian Estimation
14 pages
Bayesian Stats for Beginners
100% (1)
Bayesian Stats for Beginners
19 pages
Introduction To Bayesian Statistics: Foo Lee Kien (PHD)
No ratings yet
Introduction To Bayesian Statistics: Foo Lee Kien (PHD)
65 pages
Sample Data Sets For Linear Regression1
No ratings yet
Sample Data Sets For Linear Regression1
6 pages
Math in Society - Lippman, Clifford
No ratings yet
Math in Society - Lippman, Clifford
277 pages
Ensemble Learning and Random Forests
No ratings yet
Ensemble Learning and Random Forests
151 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Instrumental Variables, Bailey
No ratings yet
Instrumental Variables, Bailey
29 pages
Practical Missing Data Analysis in SPSS
No ratings yet
Practical Missing Data Analysis in SPSS
19 pages
Machine Learning Overview 2011
No ratings yet
Machine Learning Overview 2011
23 pages
Bayes Notes1
100% (1)
Bayes Notes1
148 pages
Jianye Ching - Bayesian Machine Learning in Geotechnical Site Characterization (Challenges in Geotechnical and Rock Engineering) - CRC Press (2024)
No ratings yet
Jianye Ching - Bayesian Machine Learning in Geotechnical Site Characterization (Challenges in Geotechnical and Rock Engineering) - CRC Press (2024)
189 pages
Sequence Model:: Hidden Markov Models
No ratings yet
Sequence Model:: Hidden Markov Models
60 pages
STELLA v9 Tutorial 2
No ratings yet
STELLA v9 Tutorial 2
14 pages
Unit Iv - Daa
No ratings yet
Unit Iv - Daa
100 pages
An Example Machine Learning Notebook
No ratings yet
An Example Machine Learning Notebook
28 pages
Odds and Probability - Elements of AI 3-1
No ratings yet
Odds and Probability - Elements of AI 3-1
15 pages
Machine Learning: Data Set
100% (1)
Machine Learning: Data Set
52 pages
Lab 6 - Naive Bayesian Classification Exercises
No ratings yet
Lab 6 - Naive Bayesian Classification Exercises
9 pages
Cse291d 2 PDF
No ratings yet
Cse291d 2 PDF
54 pages
Artificial Intelligence in Manufacturing Research - J Paulo Davim
100% (2)
Artificial Intelligence in Manufacturing Research - J Paulo Davim
194 pages
PROB2006S - Lecture-03-Conditional Probability, Total Probability Theorem, Bayes Rule
No ratings yet
PROB2006S - Lecture-03-Conditional Probability, Total Probability Theorem, Bayes Rule
23 pages
Bayesian Inference
No ratings yet
Bayesian Inference
28 pages
MATLAB Skill Book May 9
No ratings yet
MATLAB Skill Book May 9
62 pages
DS 630 - Lec 4 - ST
No ratings yet
DS 630 - Lec 4 - ST
27 pages
An Introduction To Matlab For Econometrics
No ratings yet
An Introduction To Matlab For Econometrics
106 pages
Excel Howto Dashboard
No ratings yet
Excel Howto Dashboard
9 pages
Computational Bayesian Statistics
100% (1)
Computational Bayesian Statistics
254 pages
Reinforcement Learning - Ipynb - Colaboratory
No ratings yet
Reinforcement Learning - Ipynb - Colaboratory
7 pages
Problem Solutions - Chapter 1 Problem 1.1.1 Solution: M Are Also Collectively Exhaustive
No ratings yet
Problem Solutions - Chapter 1 Problem 1.1.1 Solution: M Are Also Collectively Exhaustive
35 pages
(Peter Leow) Genetic Algorithms Demystified Unrav
No ratings yet
(Peter Leow) Genetic Algorithms Demystified Unrav
58 pages
DS 630 - Lec 5 - ST
No ratings yet
DS 630 - Lec 5 - ST
15 pages
Annual Financial Report: Your Company Name
No ratings yet
Annual Financial Report: Your Company Name
3 pages
Amos Cookies Facility Report
No ratings yet
Amos Cookies Facility Report
10 pages
Bayes Lecture Notes
No ratings yet
Bayes Lecture Notes
172 pages
Lecture 6 - Improving Local Search
No ratings yet
Lecture 6 - Improving Local Search
28 pages
L28 Bayseian Linear Regression Linchpin Sampler PDF
No ratings yet
L28 Bayseian Linear Regression Linchpin Sampler PDF
6 pages
Cs Movavg
No ratings yet
Cs Movavg
1 page
Continuity Property of Probability
No ratings yet
Continuity Property of Probability
1 page
Advances in Bayesian Machine Learning From Uncertainty To Decision Making
No ratings yet
Advances in Bayesian Machine Learning From Uncertainty To Decision Making
272 pages
Evolutionary Algorithms Guide
No ratings yet
Evolutionary Algorithms Guide
2 pages
2021 Lecture02 IntelligentAgents
No ratings yet
2021 Lecture02 IntelligentAgents
65 pages
Frequentist Vs Bayesian
No ratings yet
Frequentist Vs Bayesian
48 pages
Bayes Rule Explained for AI Applications
No ratings yet
Bayes Rule Explained for AI Applications
11 pages
Reinforcement Learning
No ratings yet
Reinforcement Learning
7 pages
POL BigDataStatisticsJune2014
No ratings yet
POL BigDataStatisticsJune2014
27 pages
Bilevel Optimization Tutorial Guide
No ratings yet
Bilevel Optimization Tutorial Guide
39 pages
Lecture 5 - Local Search GRASP
No ratings yet
Lecture 5 - Local Search GRASP
52 pages
Approaches To The Analysis of Survey Data PDF
No ratings yet
Approaches To The Analysis of Survey Data PDF
28 pages
Elements of A Control Chain
100% (2)
Elements of A Control Chain
15 pages
Artificial Intelligence: Chapter# 05 Robotics
No ratings yet
Artificial Intelligence: Chapter# 05 Robotics
37 pages
BNN Tutorial CILVR
No ratings yet
BNN Tutorial CILVR
83 pages
HANDOUT - Rapid Control Prototyping With MATLAB PDF
No ratings yet
HANDOUT - Rapid Control Prototyping With MATLAB PDF
19 pages
Modelling and Simulation - Lecture 06
100% (1)
Modelling and Simulation - Lecture 06
156 pages
Bayesian Modeling For Infectious Diseases Using PyMC3
No ratings yet
Bayesian Modeling For Infectious Diseases Using PyMC3
31 pages
IDS22Bayes Applications
No ratings yet
IDS22Bayes Applications
34 pages
Dymamic Systems
No ratings yet
Dymamic Systems
802 pages
Bayesian Inference
No ratings yet
Bayesian Inference
5 pages
Bayesian Analysis in Natural Language Processing 2nd Edition Cohen S - Download The Ebook Now To Never Miss Important Information
100% (1)
Bayesian Analysis in Natural Language Processing 2nd Edition Cohen S - Download The Ebook Now To Never Miss Important Information
66 pages
Question Bank For Class 9 Artificial Intelligence
No ratings yet
Question Bank For Class 9 Artificial Intelligence
7 pages
Sparse Bayesian Learning Explained
No ratings yet
Sparse Bayesian Learning Explained
19 pages
HMM Intraday Momentum
No ratings yet
HMM Intraday Momentum
23 pages
Bayesian Hierarchical Models in Ecology
No ratings yet
Bayesian Hierarchical Models in Ecology
3 pages
Full Download Bayesian Optimization: Theory and Practice Using Python Peng Liu PDF
100% (4)
Full Download Bayesian Optimization: Theory and Practice Using Python Peng Liu PDF
66 pages
Chapter 4 Bayesian Machinery - Bayesian Hierarchical Models in Ecology
No ratings yet
Chapter 4 Bayesian Machinery - Bayesian Hierarchical Models in Ecology
10 pages
DS 630 - Lec 3 - ST
No ratings yet
DS 630 - Lec 3 - ST
24 pages
Unit 1 Part 2 Notes
No ratings yet
Unit 1 Part 2 Notes
34 pages
Bayesian Statistics in Machine Learning - 093615
No ratings yet
Bayesian Statistics in Machine Learning - 093615
7 pages
Stats COurses
No ratings yet
Stats COurses
9 pages
Masks Affect Emotion Recognition
No ratings yet
Masks Affect Emotion Recognition
14 pages
Inventory Homework4 Solution
No ratings yet
Inventory Homework4 Solution
6 pages
Analytics of Observational Data Lec 10
No ratings yet
Analytics of Observational Data Lec 10
23 pages
ESSEC PHD Program in Data Analytics
No ratings yet
ESSEC PHD Program in Data Analytics
1 page
5 BayesHandout
No ratings yet
5 BayesHandout
15 pages
20-Bayesian 310456690
No ratings yet
20-Bayesian 310456690
34 pages
An Introduction To Medical Statistics 4th Edition Martin Bland PDF Download
No ratings yet
An Introduction To Medical Statistics 4th Edition Martin Bland PDF Download
52 pages
BCA (AIDS) - 3rd Sem - TBD303 - Statistical Methods For Data Science-JBK
No ratings yet
BCA (AIDS) - 3rd Sem - TBD303 - Statistical Methods For Data Science-JBK
2 pages
Studio 5 Questions
No ratings yet
Studio 5 Questions
8 pages

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

Uploaded by

Bayesian Inference: by Hoai Nam Nguyen September 9, 2017

Uploaded by

Bayesian Inference

by Hoai Nam Nguyen

Similar to Maximum Likelihood Estimation, we first find the Likelihood

L() = fX1 ,...,Xn (x1 , ..., xn |)

In Bayesian inference, we treat the parameter as a random variable. That

By Bayess formula, we have

fX1 ,...,Xn (x1 , ..., xn |)()

fX1 ,...,Xn (x1 , ..., xn |)()

NOTE: when calculating (|x1 , ..., xn ), always use proportionality by re-

The population distribution is Bernoulli(p), where p U nif orm(0, 1). Use

The likelihood function is given by:

The pdf of the prior distribution is (p) = 1, for 0 < p < 1

Therefore, the posterior distribution is given by:

Note that we used proportionality when calculating the posterior distribu-

The likelihood function stays unchanged:

The pdf of the prior distribution is given by:

Therefore, the pdf of the posterior distribution is given by:

(p|x1 , ..., xn ) fX1 ,...,Xn (x1 , ..., xn |p)(p)

The population distribution is N (, 2 ), where is unknown and 2 is known.

The likelihood function is given by:

Also, the pdf of the prior distribution is given by:

Clearly, E[|x1 , ..., xn ] = B/2A. Therefore, = B/2A is the Bayesian esti-

Consider the following types of treatment:

Treatment 1: 100% of the patients are cured (3 out of 3)

Treatment 2: 95% of the patients are cured (19 out of 20)

Treatment 3: 90% of the patients are cured (90,000 out of 100,000)

Which one is the best???

Therefore, the population follows Bernoulli(p), where p is an unknown pa-

You might also like