0% found this document useful (0 votes)

125 views11 pages

ETC 2420/5242 Lab 10 2016: Purpose

1) The lab calculates conditional probabilities to analyze a spam filter using Bayesian inference. 2) Simulation and graphical analysis is used to examine the behavior of the posterior distribution under different priors and sample sizes. 3) The exact posterior density of the probability of heads coming up less than 3 times in 10 coin flips is derived and plotted. 4) The beta prior and posterior distributions for the proportion of Californians supporting the death penalty are derived and plotted based on survey data.

Uploaded by

Ishara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views11 pages

ETC 2420/5242 Lab 10 2016: Purpose

Uploaded by

Ishara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

ETC 2420/5242 Lab 10 2016

Souhaib Ben Taieb

Week 10

Purpose

This lab is to compute conditional probabilities and practice Bayesian inference.

Question 1

A situation where Bayesian analysis is routinely used is your spam filter in your mail server. The message is
scrutinized for the appearance of key words which make it likely that the message is spam. Let us describe
how one one of these filters might work. We imagine that the evidence for spam is that the subject message
of the mail contains the sentence “check this out”. We define events spam (the message is spam) and check
this out (the subject line contains this sentence).
From previous experience we know that 40% of emails are spam, 1% of spam email have “check this out” in
the subject line, and .4% of non-spam emails have this sentence in the subject line.
Explain the different steps to compute the conditional probability P(spam | check this out).
P (check this out|spam)P (spam)
P (spam|check this out) = P (check this out)

P (spam) = 0.4
check this out|spam = 0.01

P (check this out) = P (check this out|spam)P (spam) + P (check this out|not spam)P (not spam)
= 0.01 × 0.4 + 0.004 × 0.6 = 0.0064

P (spam|check this out) = 0.004

0.0064 = 5
8 = 0.625

Question 2

Let X1 , . . . , Xn ∼ N (θ, 9).

a. If θ ∼ N (µ, τ 2 ), what is π(θ|x1 , . . . , xn )?

b. What is the posterior mean E[θ|x1 , . . . , xn ]?
c. What is the MLE estimate θ̂MLE ?

See the slides of week 9

Suppose the “true” value is θ = 2. Consider (1) µ = 5 and τ = 1, and (2) µ = 2 and τ = 2.
For n ∈ {1, 10, 20, 50, 100, 10000}:

a. Simulate a data set consisting of n observations

b. Plot on the same graphic π(θ), π(θ|x1 , . . . , xn ) and θ̂MLE .

Discuss the behavior of π(θ|x1 , . . . , xn ) as n increases and the impact of the prior distribution.

1
set.seed(1986)
theta <- 2
sigma_0 <- 3

alln <- c(1, 2, 5, 10, 100, 10000)

for(case in c(1, 2)){
if(case == 1){
prior_mu <- 2
prior_tau <- 2
}else if(case == 2){
prior_mu <- 5
prior_tau <- 1
}

for(n in alln){
x <- rnorm(n, mean = theta, sd= sigma_0)
x_bar <- mean(x)

a <- (n * x_bar)/sigma_0^2 + prior_mu/prior_tau^2

b <- n/sigma_0^2 + 1/prior_tau^2

post_mu <- a/b

print(post_mu)
post_sigma <- 1/(n/sigma_0^2 + 1/prior_tau^2)

xx <- seq(-5, 5, by = 0.001)

xx_prior <- xx * prior_tau + prior_mu
xx_post <- xx * post_sigma + post_mu

Y <- cbind(dnorm(xx_prior, mean = prior_mu, sd= prior_tau), dnorm(xx_post, mean = post_mu, sd = post
X <- cbind(xx_prior, xx_post)
matplot(X, Y, type = 'l', lty = 1, main = paste("n = ", n))
abline(v = x_bar, lty = 1)
}
}
# [1] 1.957306

n= 1
0.20
0.10
Y

0.00

−10 0 5 10

# [1] 2.376356

2
n= 2

0.20
0.10
Y

0.00
−5 0 5 10

# [1] 1.561813

n= 5
0.30
0.15
Y

0.00

−5 0 5 10

# [1] 2.718445

n = 10
0.0 0.2 0.4
Y

−5 0 5 10

# [1] 2.307785

3
n = 100

0 1 2 3 4
Y
−5 0 5 10

# [1] 2.028544

n = 10000
400
200
Y

−5 0 5 10

# [1] 4.495602

n= 1
0.4
0.2
Y

0.0

0 2 4 6 8 10

# [1] 4.879569

4
n= 2

0.4
Y

0.2
0.0
0 2 4 6 8 10

# [1] 3.914996

n= 5
0.6
0.3
Y

0.0

0 2 4 6 8 10

# [1] 2.632301

n = 10
0.8
0.4
Y

0.0

0 2 4 6 8 10

# [1] 2.529208

5
n = 100

4
Y

2
0
0 2 4 6 8 10

# [1] 2.069093

n = 10000
400
200
Y

0 2 4 6 8 10

Question 3

Suppose there is a Beta(4, 4) prior distribution on the the probability θ that a coin will yield a “head” when
spun in a specified maner. The coin is independently spun ten times, and “heads” appear fewer than 3 times.
You are not told how many heads were seen, only that the number is less than 3. Calculate your exact
posterior density (up to a proportionality constant) for θ and plot it.
Prior density:
π(θ) ∝ θ3 (1 − θ)3
Likelihood:

10 0 10 1 10 2

f (data|θ) = θ (1 − θ)1 0 + θ (1 − θ)9 + θ (1 − θ)8
0 1 2
= (1 − θ)10 + 10θ(1 − θ)9 + 45θ2 (1 − θ)8

Posterior density:
π(θ|data) ∝ θ3 (1 − θ)13 + 10θ4 (1 − θ)12 + 45θ5 (1 − θ)11

6
theta <- seq(0, 1, .01)
dens <- theta^3 * (1-theta)^13 + 10 * theta^4 * (1-theta)^12 + 45 * theta^5 * (1-theta)^11
plot (theta, dens, ylim=c(0,1.1*max(dens)), type="l", xlab="theta", ylab="", xaxs="i",yaxs="i", yaxt="n"

0.0 0.4 0.8

theta

Question 4

Suppose your prior distribution for θ, the proportion of Californians who support the deat penalty, is beta
with mean 0.6 and standard deviation 0.3.

a. Determine the parameters α and β of your prior distribution. Plot the prior density function.
b. A random sample of 1000 Californians is taken, and 65% support the death penalty. What are your
posterior mean and variance for θ? Plot the posterior density function.

E[θ](1−E[θ])
α+β = var(θ) − 1 = 1.67
α = (α + β)E[θ] = 1
β = (α + β)(1 − E[θ]) = 0.67

theta <- seq(0,1,.001)

dens <- dbeta(theta,1,.67)
plot (theta, dens, xlim=c(0,1), ylim=c(0,3),
type="l", xlab="theta", ylab="", xaxs="i",
yaxs="i", yaxt="n", bty="n", cex=2)
lines (c(1,1),c(0,3),col=0)
lines (c(1,1),c(0,3),lty=3)

0.0 0.4 0.8

theta

7
Posterior distribution:
π(θ|data) = Beta(α + 650, β + 350) = Beta(651, 350.67)
E(θ|data) = 0.6499
sd(θ|data) = 0.015

theta <- seq(0,1,.001)

dens <- dbeta(theta,651,350.67)
cond <- dens/max(dens) > 0.001
plot (theta[cond], dens[cond],
type="l", xlab="theta", ylab="", xaxs="i",
yaxs="i", yaxt="n", bty="n", cex=2)

0.60 0.64 0.68

theta

Question 5

10 Prussian cavalry corp were monitored for 20 years (200 Corp-Years) and the number of fatalities due to
horse kicks were recorded:

x = # Deaths Number of Corp-Years with x Fatalities

0 109
1 65
2 22
3 3
4 1

i.i.d
Let xi , i = 1, . . . , 200, be the number of deaths in observation i. Assume that xi ∼ Poisson(θ).

a. Compute the MLE estimate θ̂MLE ?

θ̂MLE = x̄ = 122
200 = 0.61
Suppose θ ∼ Gamma(α, β).

a. What is the prior mean and variance.

E[θ] = α
β

V ar[θ] = α
β2

8
b. What is the posterior distribution π(θ|x)?

Gamma(α + n ∗ x̄, β + n)

c. What is the posterior mean and variance.

E[θ|x] = α+n∗x̄
β+n

V ar[θ|x] = α+n∗x̄
(β+n)2

Plot on the same graphic π(θ), π(θ|x) and θ̂MLE for

a. α=β = 0.5
b. α=β =1
c. α=β = 10
d. α=β = 100

n <- 200
DT <- data.frame(c(0, 1, 2, 3, 4), c(109, 65, 22, 3, 1))
xbar <- sum(DT[, 1] * DT[, 2])/n

x <- seq(0, 2, by = 0.01)

for(case in c(1, 2, 3, 4)){

if(case == 1){
alpha <- beta <- 0.5
}else if(case == 2){
alpha <- beta <- 1
}else if(case == 3){
alpha <- beta <- 10
}else if(case == 4){
alpha <- beta <- 100
}

dens <- dgamma(x, shape = alpha, rate = beta)

alpha_posterior <- alpha + n * xbar

beta_posterior <- beta + n
dens_posterior <- dgamma(x, shape = alpha_posterior, rate = beta_posterior)

matplot(x, cbind(dens, dens_posterior), lty = 1, type = 'l', ylab = "Density", xlab = "theta")
abline(v = xbar)

9
6
Density

4
2
0
0.0 0.5 1.0 1.5 2.0

theta

6
Density

4
2
0

0.0 0.5 1.0 1.5 2.0

theta
6
Density

4
2
0

0.0 0.5 1.0 1.5 2.0

theta
8
6
Density

4
2
0

0.0 0.5 1.0 1.5 2.0

theta

10
TURN IN

• Your .Rmd file

• Your Word (or pdf) file that results from knitting the Rmd.
• Make sure your group members are listed as authors, one person per group will turn in the report
• DUE: Wednesday after the lab, by 7am, loaded into moodle

Resources

• Lecture slides on Bayesian reasoning

Practical Projects
100% (30)
Practical Projects
478 pages
Barge Steel WT CalculatorRev050815
No ratings yet
Barge Steel WT CalculatorRev050815
14 pages
20-Bayesian 310456690
No ratings yet
20-Bayesian 310456690
34 pages
Intro Bayes Time Series 1
No ratings yet
Intro Bayes Time Series 1
72 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
Solutions 308
No ratings yet
Solutions 308
13 pages
ST903 Week9sol
No ratings yet
ST903 Week9sol
2 pages
Two Way Anova (18 Ms PT Amd 03, 18 Ms PT Amd 14)
100% (2)
Two Way Anova (18 Ms PT Amd 03, 18 Ms PT Amd 14)
2 pages
Bayesian Inference Slides 2021
No ratings yet
Bayesian Inference Slides 2021
37 pages
Problem Set 1 Sol
No ratings yet
Problem Set 1 Sol
7 pages
Homework 8
100% (1)
Homework 8
6 pages
Bayesian Statistics for Binomial Probability Estimation
No ratings yet
Bayesian Statistics for Binomial Probability Estimation
3 pages
Problem Set 1
No ratings yet
Problem Set 1
3 pages
20 Bayesian2
No ratings yet
20 Bayesian2
50 pages
Bayes
No ratings yet
Bayes
3 pages
Lecture 2 - 4 Prior
No ratings yet
Lecture 2 - 4 Prior
51 pages
Lecture 5 - 8 Bayesian Estimation
No ratings yet
Lecture 5 - 8 Bayesian Estimation
65 pages
hw10 Sol
No ratings yet
hw10 Sol
3 pages
Ebin - Pub - Basic Probability What Every Math Student Should Know 2nd Edition 2nbsped 2021012561 9789811237492 9789811238512 9789811237508
100% (1)
Ebin - Pub - Basic Probability What Every Math Student Should Know 2nd Edition 2nbsped 2021012561 9789811237492 9789811238512 9789811237508
188 pages
Part A Statistics HT 2017 Problem Sheet 4
No ratings yet
Part A Statistics HT 2017 Problem Sheet 4
2 pages
W10 Notes
No ratings yet
W10 Notes
2 pages
Week 10
No ratings yet
Week 10
2 pages
SA302 - Lecture 25 - 2024
No ratings yet
SA302 - Lecture 25 - 2024
16 pages
ProblemSheet1 23
No ratings yet
ProblemSheet1 23
5 pages
238 03242024 - Final 课后
No ratings yet
238 03242024 - Final 课后
10 pages
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
CH 5
No ratings yet
CH 5
45 pages
SA302 - Lecture 24 - 2024
No ratings yet
SA302 - Lecture 24 - 2024
11 pages
Final Sol
No ratings yet
Final Sol
3 pages
Slides 1
No ratings yet
Slides 1
73 pages
DS 630 - Lec 4 - ST
No ratings yet
DS 630 - Lec 4 - ST
27 pages
Lecture 20 - Bayesian Analysis
No ratings yet
Lecture 20 - Bayesian Analysis
4 pages
BT Wk3 LectureNotes
No ratings yet
BT Wk3 LectureNotes
16 pages
MA40189 20 Open
No ratings yet
MA40189 20 Open
6 pages
Fuskpaper Bayes
No ratings yet
Fuskpaper Bayes
51 pages
Assignment 5 Stat Inf b3 2022 2023 PDF
No ratings yet
Assignment 5 Stat Inf b3 2022 2023 PDF
16 pages
IT590 Bayesian Theory Lecture 2
No ratings yet
IT590 Bayesian Theory Lecture 2
5 pages
Lecture 3
No ratings yet
Lecture 3
4 pages
MIT18 05S14 ps6 PDF
No ratings yet
MIT18 05S14 ps6 PDF
5 pages
MIT18 05S14 Class14 Slides
No ratings yet
MIT18 05S14 Class14 Slides
26 pages
Prints PDF
No ratings yet
Prints PDF
106 pages
Chapter 1 B
No ratings yet
Chapter 1 B
35 pages
Bayesian Estimation for Statisticians
No ratings yet
Bayesian Estimation for Statisticians
23 pages
1 Solution To Problem 8.1
No ratings yet
1 Solution To Problem 8.1
16 pages
Tutorial 2
No ratings yet
Tutorial 2
16 pages
Chapter 5. Bayesian Statistics (II)
No ratings yet
Chapter 5. Bayesian Statistics (II)
30 pages
Assign 1
No ratings yet
Assign 1
5 pages
T10 Sol..ol
No ratings yet
T10 Sol..ol
8 pages
Introduction To Bayesian Methods With An Example
No ratings yet
Introduction To Bayesian Methods With An Example
25 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
The University of Nottingham: Do NOT Turn Examination Paper Over Until Instructed To Do So
No ratings yet
The University of Nottingham: Do NOT Turn Examination Paper Over Until Instructed To Do So
6 pages
Homework 3few
No ratings yet
Homework 3few
2 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
W9PS
No ratings yet
W9PS
9 pages
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
No ratings yet
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
13 pages
Stat 111
No ratings yet
Stat 111
7 pages
MLESA v2024 Week10 Assignment Solution
No ratings yet
MLESA v2024 Week10 Assignment Solution
7 pages
Predição em Modelos de Tempo de Falha Acelerado Com Efeito Aleatório para Avaliação de Riscos de Falha - (JoaoBC)
No ratings yet
Predição em Modelos de Tempo de Falha Acelerado Com Efeito Aleatório para Avaliação de Riscos de Falha - (JoaoBC)
22 pages
Graphic Era Deemed To Be University MBA 1st Sem
0% (1)
Graphic Era Deemed To Be University MBA 1st Sem
10 pages
Bayesian Stats for Academics
No ratings yet
Bayesian Stats for Academics
33 pages
5B Bayesian Inference: Class Problems
No ratings yet
5B Bayesian Inference: Class Problems
9 pages
Math2830 Chapter 08
No ratings yet
Math2830 Chapter 08
9 pages
Statistics Exam Paper Analysis
No ratings yet
Statistics Exam Paper Analysis
2 pages
STAT 135 Solutions To Homework 4:: 30 Points
No ratings yet
STAT 135 Solutions To Homework 4:: 30 Points
9 pages
Werner Modlin Ratemaking
No ratings yet
Werner Modlin Ratemaking
423 pages
Prob Stat 11 3rd Quarter Exam
No ratings yet
Prob Stat 11 3rd Quarter Exam
2 pages
1743 Chapter 3 Probability
No ratings yet
1743 Chapter 3 Probability
21 pages
CU-2020 B.A. B.Sc. (Honours) Mathematics Semester-V Paper-CC-11 QP
No ratings yet
CU-2020 B.A. B.Sc. (Honours) Mathematics Semester-V Paper-CC-11 QP
4 pages
Digital Communication and Probability of Error.
No ratings yet
Digital Communication and Probability of Error.
58 pages
Midterm 3 Submission Guide
No ratings yet
Midterm 3 Submission Guide
1 page
Document 2292686 4256229 PDF
No ratings yet
Document 2292686 4256229 PDF
1 page
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
No ratings yet
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
1 page
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
No ratings yet
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
1 page
Bayesian Analysis Exercises
No ratings yet
Bayesian Analysis Exercises
30 pages
Monte Carlo Method & Simulation Guide
No ratings yet
Monte Carlo Method & Simulation Guide
12 pages
Theory of Distribution WorkSheet I
No ratings yet
Theory of Distribution WorkSheet I
3 pages
Black-Karasinski Model - Wikipedia
No ratings yet
Black-Karasinski Model - Wikipedia
1 page
Extreme Value Theory
No ratings yet
Extreme Value Theory
11 pages
Teaching Guide 4
No ratings yet
Teaching Guide 4
2 pages
D2L Quiz Portion
No ratings yet
D2L Quiz Portion
3 pages
Probability Problems for Students
No ratings yet
Probability Problems for Students
5 pages
Value at Risk - Notes
No ratings yet
Value at Risk - Notes
16 pages
Basic Probability & Statistics Guide
No ratings yet
Basic Probability & Statistics Guide
35 pages
Instant Download Stochastics Introduction To Probability and Statistics 2nd Rev. and Extended Ed. Edition Hans-Otto Georgii PDF All Chapter
100% (9)
Instant Download Stochastics Introduction To Probability and Statistics 2nd Rev. and Extended Ed. Edition Hans-Otto Georgii PDF All Chapter
82 pages
Unit 5
No ratings yet
Unit 5
41 pages
Bayesian Statistics and MCMC Methods For Portfolio Selection
No ratings yet
Bayesian Statistics and MCMC Methods For Portfolio Selection
62 pages
Assessing The Var of A Portfolio Using D-Vine Copula Based Multivariate Garch Models
No ratings yet
Assessing The Var of A Portfolio Using D-Vine Copula Based Multivariate Garch Models
33 pages
Introduction
No ratings yet
Introduction
99 pages
Actuarial Society of India: Examinations
No ratings yet
Actuarial Society of India: Examinations
12 pages
Normal 222
No ratings yet
Normal 222
12 pages
Large Sample Randomization Inference of Causal Effects in The Presence of Interference
No ratings yet
Large Sample Randomization Inference of Causal Effects in The Presence of Interference
15 pages
PGDSMA Brochure 2021
No ratings yet
PGDSMA Brochure 2021
22 pages
Probability and Statistics
No ratings yet
Probability and Statistics
80 pages
Universidad Nacional Abierta Y A Distancia Teoria de Las Decisiones
No ratings yet
Universidad Nacional Abierta Y A Distancia Teoria de Las Decisiones
19 pages
1 RandomVariable
No ratings yet
1 RandomVariable
21 pages
Chapter 2: Financial Returns: August 25, 2016
No ratings yet
Chapter 2: Financial Returns: August 25, 2016
20 pages
Danzig Wolfe Decomposition
No ratings yet
Danzig Wolfe Decomposition
21 pages
CLASSIFICATION: Bayesian Classifiers: Naïve Bayes Bayesian Networks
No ratings yet
CLASSIFICATION: Bayesian Classifiers: Naïve Bayes Bayesian Networks
29 pages