0% found this document useful (0 votes)

38 views17 pages

Lectures 6

Uploaded by

rweinert00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views17 pages

Lectures 6

Uploaded by

rweinert00

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Monte Carlo Methods

∗ Monte Carlo methods are a way of approximating the value

of an integral using large samples of random variables.

∗ These samples of random variables are typically computer

generated.

∗ Since we regularly need to calculate integrals in Bayesian

inference, Monte Carlo methods are very popular for that.

6-1
Monte Carlo Integration

∗ Suppose that we need to evaluate

Z
I = h(x) dx
A

∗ Let f be a probability density function with support A.

∗ Then we can write

h(x)
Z Z
I = f (x) dx = g(x)f (x) dx
A f (x) A

∗ Now if Y is a random variable with pdf f then we have

I = E[g(Y )].

6-2
Monte Carlo Integration

∗ Hence if we have Y1, . . . , YN iid

∼ f , the Weak Law of Large
Numbers tells us that
N
1 X p
ˆ
I = g(Yi) −→ I.
N i=1

∗ We can therefore use Iˆ to approximate I very well for large

enough N .

∗ Furthermore we can use estimate the variability in Iˆ using

the sample variance of the random sample g(Y1), . . . , g(YN )
divided by N .

∗ Since N is totally in our control we can choose N to be large

enough to make the variability as low as we desire.

6-3
Generating Uniform Random Variates

∗ Computers are unable to generate random numbers.

∗ The can, however, be used to generate pseudo-random num-

bers.

∗ These are sequences of numbers which are generated from a

deterministic algorithm but which behave like a sequence of
iid random variates.

∗ Typically the numbers generated by a computer can be thought

of as coming from a Uniform(0,1) distribution.

6-4
Generating Non-Uniform Random Numbers

∗ Uniform random variates are rarely what we need for simula-

tion or Monte Carlo inference.

∗ They are, however, the building blocks for generating random

variates from any other distributions.

∗ Much of this is based on the following theorem

Theorem 6.1 (Probability Integral Transform)
Suppose that U ∼ Uniform(0, 1) and that F is a continuous cdf
with unique inverse F −1. Then the random variable

Y = F −1(U )
has a distribution with cdf F .

6-5
Generating Discrete Random Variates

∗ The method as described above requires that we have a con-

tinuous cdf.

∗ A similar technique can also be used to generate discrete

random variables.

∗ Suppose that p(y) is the probability mass function and the

support of the random variable is Y = {y : p(y) > 0}. Then
we can define the inverse cdf as

F −1(u) = min{y ∈ Y : F (y) ⩾ u}

∗ Then if U ∼ uniform(0, 1), the random variable Y = F −1(u)

will be distributed with probability mass function p(y).

6-6
Special Methods

∗ In many cases, the inverse cdf is not available in closed form

and so this method cannot be used. We can often, however,
use algorithms based on transformations for such situations.

∗ Suppose that U1 and U2 are two independent Uniform(0, 1)

random variables then it is easy to show that
q q
Y1 = −2 log U1 sin(2πU2) and Y2 = −2 log U1 cos(2πU2)
are independent standard normal random variables.

∗ This is known as the Box-Muller Algorithm.

6-7
Accept/Reject Algorithm

∗ A more general technique which is useful when the inverse

cdf method cannot be applied is called the Accept/Reject
Algorithm.

∗ This method relies on generating a different random variable

V which has the same support as the required variable Y .

∗ We also require that the ratio of densities is bounded by a

known constant
fY (y)
M = sup < ∞
y fV (y)

6-8
Accept/Reject Algorithm

1. Calculate M = supy fY (y)/fV (y).

2. Generate V ∼ fV and independently U ∼ Uniform(0, 1).

3. If
fY (V )
U <
M fV (V )
then set Y = V . Otherwise discard U and V and return to
step 2.

6-9
Markov Chain Monte Carlo Methods

∗ Many of the methods described so far are not very useful for
generating multivariate random variates.

∗ Markov Chain Monte Carlo methods are now widely used in

these settings.

∗ The methods work on the idea of constructing a Markov

chain which has a stationary distribution equal to the distri-
bution of interest.

∗ Under certain conditions, the distribution of the elements in

such a chain will converge to this stationary distribution.

6-10
Markov Chain Monte Carlo Methods

∗ These algorithms start with some initial value for the random
variable of interest.

∗ They then run a carefully constructed Markov chain starting

from that initial value for a sufficiently long time.

∗ It is not always easy to know how long the chains should be

run but various diagnostics have been proposed.

∗ Any observations in the chain after this burn-in period may

be considered as (at least approximately) distributed with the
stationary distribution.

6-11
Metropolis–Hastings Algorithm

∗ First introduced in statistical physics in 1954 by Metropolis

et al. Statistical properties shown by Hastings in 1970.

∗ It is basically a Markov chain version of the accept/reject

algorithm.

∗ Random variates are generated from some candidate distri-

bution conditional on the current state of the chain and then
either the new state is accepted or rejected in which case the
chain stays where it is.

6-12
Metropolis–Hastings Algorithm

Suppose we wish to sample Y ∼ fY .

First initialize the chain with some value Y (0).

Then for t = 1, 2, . . . we generate Y (t) by

1. Generate V (t) ∼ fV |Y (v | Y (t−1)).
2. Calculate the acceptance probability
 
 fY V (t) fV |Y Y (t−1) | V (t) 
ρt = min × , 1
f
Y Y
(t−1) fV |Y V (t) | Y (t−1) 

3. Generate Ut ∼ Uniform(0, 1) and set


 V (t) if Ut ⩽ ρt
Y (t) =
 Y (t−1) if Ut > ρt

6-13
Independence Metropolis–Hastings Algorithm

∗ It is often convenient to generate V (t) from the same distri-

bution at every iteration.

∗ In this case we have fV |Y (v | Y (t−1)) = fV (v) and so the

acceptance probability becomes
 
 fY V (t) fV Y (t−1) 
ρt = min × , 1
f
Y Y
(t−1) fV V (t) 
 
 fY V (t) fV Y (t−1) 
= min × , 1
f
V V
(t) fY Y (t−1) 

6-14
Random Walk Metropolis–Hastings Algorithm

∗ Another special case is where fV |Y (v | y) = fZ (v − y) where

fZ is a distribution symmetric about 0.

∗ We generate Z (t) ∼ fZ and set V (t) = Y (t−1) + Z (t).

∗ In stochastic processes this is called a random walk.

∗ The acceptance probability for the Metropolis–Hastings al-

gorithm then becomes
 
 fY V (t) 
ρt = min , 1
f
Y Y
(t−1) 

6-15
Gibbs Sampler

∗ The Gibbs Sampler (Geman & Geman, 1984) is designed to

generate observations from a complex multivariate distribu-
tion.

∗ The Markov chain is constructed by considering the univari-

ate conditional distributions.

∗ Suppose that the random vector of interest is Y = (Y1, . . . , Yd)

and that we can generate observations from the full condi-
tional distributions

fj y | Y−j = y−j = fyj |y−j y | Y−j = y−j j = 1, . . . , d

where Y−j = Y1, . . . , Yj−1, Yj+1, . . . , Yd .

6-16
Gibbs Sampler

(0) (0)
Initialise the chain to some value Y (0) = Y1 , . . . , Yd .

For t = 1, 2, . . .
(t) (t−1) (t−1)
1 Generate Y1 from f1 y1 | Y2 , . . . , Yd .

(t) (t) (t−1) (t−1)
2 Generate Y2 from f2 y2 | Y1 , Y3 , . . . , Yd .
...

(t) (t) (t) (t−1) (t−1)
j Generate Yj from fj yj | Y1 , . . . , Yj−1, Yj+1 , . . . , Yd .
...

(t) (t) (t)
d Generate Yd from fd yd | Y1 , . . . , Yd−1 .

(t) (t)
Then we set Y (t) = Y1 , . . . , Yd .

6-17

Lesson Plan in Mathematics 4: School: Teacher: Date: I. Objectives
No ratings yet
Lesson Plan in Mathematics 4: School: Teacher: Date: I. Objectives
6 pages
Introduction of Sludge Management
No ratings yet
Introduction of Sludge Management
154 pages
36 Series Gear Box
No ratings yet
36 Series Gear Box
1 page
Acids Bases and Salts IGCSE
No ratings yet
Acids Bases and Salts IGCSE
22 pages
Prosman2 - Fluidity of Molten Metal
No ratings yet
Prosman2 - Fluidity of Molten Metal
22 pages
Mathematics For Electrical Science and Physical Science, M-1, S2
No ratings yet
Mathematics For Electrical Science and Physical Science, M-1, S2
4 pages
Unit V Graphical Models
No ratings yet
Unit V Graphical Models
23 pages
On The Markov Chain Monte Carlo (MCMC) Method: Rajeeva L Karandikar
No ratings yet
On The Markov Chain Monte Carlo (MCMC) Method: Rajeeva L Karandikar
24 pages
Diesel Engine Tune-Up Guide
No ratings yet
Diesel Engine Tune-Up Guide
4 pages
Markov Chain Monte Carlo and Gibbs Sampling
No ratings yet
Markov Chain Monte Carlo and Gibbs Sampling
24 pages
Introduction To Bayesian Computation Lecture02
No ratings yet
Introduction To Bayesian Computation Lecture02
40 pages
Examples: 238 17 Psychrometrics
No ratings yet
Examples: 238 17 Psychrometrics
12 pages
MCMC Sampling - Class 2025
No ratings yet
MCMC Sampling - Class 2025
101 pages
Sampling
No ratings yet
Sampling
100 pages
The Metropolis-Hastings Algorithm: C.P. Robert
No ratings yet
The Metropolis-Hastings Algorithm: C.P. Robert
15 pages
Project
No ratings yet
Project
44 pages
Markov Chains and Monte Carlo Methods: Ioana A. Cosma and Ludger Evers
No ratings yet
Markov Chains and Monte Carlo Methods: Ioana A. Cosma and Ludger Evers
97 pages
Script
No ratings yet
Script
112 pages
Computational Methods in Astrophysics: Monte Carlo Simulations and Radiative Transfer
No ratings yet
Computational Methods in Astrophysics: Monte Carlo Simulations and Radiative Transfer
45 pages
CPSC 540: Machine Learning: Monte Carlo Methods
No ratings yet
CPSC 540: Machine Learning: Monte Carlo Methods
32 pages
Chapter 4: Forecasting: Problem 1: Auto Sales at Carmen's Chevrolet Are Shown Below. Find A Naive Forecast
No ratings yet
Chapter 4: Forecasting: Problem 1: Auto Sales at Carmen's Chevrolet Are Shown Below. Find A Naive Forecast
11 pages
Chapter 2
No ratings yet
Chapter 2
57 pages
CPSC 440: Advanced Machine Learning: Monte Carlo Methods
No ratings yet
CPSC 440: Advanced Machine Learning: Monte Carlo Methods
30 pages
Discussion - A Technical Note - Derivation of The LRFD Column Design Equations
No ratings yet
Discussion - A Technical Note - Derivation of The LRFD Column Design Equations
2 pages
LectureNotes Complete
No ratings yet
LectureNotes Complete
90 pages
Advanced Random Sampling Methods
No ratings yet
Advanced Random Sampling Methods
20 pages
Working With Files: A Presentation On
No ratings yet
Working With Files: A Presentation On
27 pages
ML - Unit-V-1
No ratings yet
ML - Unit-V-1
42 pages
Taz TFG 2022 2997
No ratings yet
Taz TFG 2022 2997
33 pages
MCMC
No ratings yet
MCMC
7 pages
MCMC Notes
No ratings yet
MCMC Notes
77 pages
Chib UnderstandingMetropolisHastingsAlgorithm 1995
No ratings yet
Chib UnderstandingMetropolisHastingsAlgorithm 1995
10 pages
Pinto - pm2 - Session 4 - Shared Slides
No ratings yet
Pinto - pm2 - Session 4 - Shared Slides
78 pages
Unit 5
No ratings yet
Unit 5
74 pages
Stat 413
No ratings yet
Stat 413
55 pages
UNIT-5 Markov Chain Monte Carlo Methods
No ratings yet
UNIT-5 Markov Chain Monte Carlo Methods
17 pages
Pe21 05 730
No ratings yet
Pe21 05 730
13 pages
Questions For Unit 5 RM
No ratings yet
Questions For Unit 5 RM
4 pages
Problem Set
No ratings yet
Problem Set
13 pages
Rarefied Gas Dynamics - DSMC Course
No ratings yet
Rarefied Gas Dynamics - DSMC Course
50 pages
F2014L
No ratings yet
F2014L
4 pages
Computational Statistics With Matlab
No ratings yet
Computational Statistics With Matlab
71 pages
Monte Carlo Final
No ratings yet
Monte Carlo Final
72 pages
Putational Statistics Using Matlab
No ratings yet
Putational Statistics Using Matlab
78 pages
Bayesian - Lec - 4
No ratings yet
Bayesian - Lec - 4
25 pages
Bayesian Inference
No ratings yet
Bayesian Inference
28 pages
MCMC
No ratings yet
MCMC
71 pages
Computation
No ratings yet
Computation
11 pages
MCMC
No ratings yet
MCMC
70 pages
(People and Ideas) Daniel C. Tosteson (Auth.), Daniel C. Tosteson (Eds.) - Membrane Transport - People and Ideas (1989, Springer New York)
100% (1)
(People and Ideas) Daniel C. Tosteson (Auth.), Daniel C. Tosteson (Eds.) - Membrane Transport - People and Ideas (1989, Springer New York)
410 pages
DVD Lens Actuator
No ratings yet
DVD Lens Actuator
6 pages
5d MCMC
No ratings yet
5d MCMC
9 pages
2 MS2 (Sampling)
No ratings yet
2 MS2 (Sampling)
29 pages
Chib Greenberg 1995
No ratings yet
Chib Greenberg 1995
12 pages
Stochastic Algorithms Guide
No ratings yet
Stochastic Algorithms Guide
406 pages
Stat513 l10
No ratings yet
Stat513 l10
27 pages
Preparation PLM 11
No ratings yet
Preparation PLM 11
18 pages
BaseR Cheat Sheet
No ratings yet
BaseR Cheat Sheet
21 pages
Stochastic Simulation Book
No ratings yet
Stochastic Simulation Book
146 pages
Bayesian Modelling Tuts-12-15
No ratings yet
Bayesian Modelling Tuts-12-15
4 pages
Invoice Details for Car Buyers
No ratings yet
Invoice Details for Car Buyers
2 pages
Namma Kalvi 12th Computer Applications Practical Manual em
No ratings yet
Namma Kalvi 12th Computer Applications Practical Manual em
33 pages
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
No ratings yet
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
78 pages
Operation / Installation Instructions: Dickow Pumpen KG
No ratings yet
Operation / Installation Instructions: Dickow Pumpen KG
47 pages
Part A Simulation: Matthias Winkel Department of Statistics University of Oxford
No ratings yet
Part A Simulation: Matthias Winkel Department of Statistics University of Oxford
54 pages
Geyer - Markov Chain Monte Carlo Lecture Notes
No ratings yet
Geyer - Markov Chain Monte Carlo Lecture Notes
166 pages
Monte Carlo
No ratings yet
Monte Carlo
59 pages
Explicit Solutions For Critical and Normal Depths in Trapezoidal and Parabolic Open Channels
No ratings yet
Explicit Solutions For Critical and Normal Depths in Trapezoidal and Parabolic Open Channels
7 pages
CSE291D Lecture 6: Monte Carlo Methods 2: Markov Chain Monte Carlo
No ratings yet
CSE291D Lecture 6: Monte Carlo Methods 2: Markov Chain Monte Carlo
66 pages
Monte Carlo Methods in Simulations
No ratings yet
Monte Carlo Methods in Simulations
57 pages
Cra I U Rosenthal Ann Rev
No ratings yet
Cra I U Rosenthal Ann Rev
40 pages
DT-10 Owner's Manual: Turning On The Power
No ratings yet
DT-10 Owner's Manual: Turning On The Power
3 pages
Intro To Markov Chain Monte Carlo: Rebecca C. Steorts Bayesian Methods and Modern Statistics: STA 360/601
No ratings yet
Intro To Markov Chain Monte Carlo: Rebecca C. Steorts Bayesian Methods and Modern Statistics: STA 360/601
35 pages
Monte Carlo Sampling Methods
No ratings yet
Monte Carlo Sampling Methods
25 pages
Sampling Methods: Søren Højsgaard
No ratings yet
Sampling Methods: Søren Højsgaard
22 pages
IE 403 Ch03 RNG RVG With Comments
No ratings yet
IE 403 Ch03 RNG RVG With Comments
27 pages
Computer Simulation
No ratings yet
Computer Simulation
36 pages
MC Manual PDF
No ratings yet
MC Manual PDF
45 pages
2021 Pure Form 4
No ratings yet
2021 Pure Form 4
14 pages
(Business Statistics) Chapter 3 Part 1
No ratings yet
(Business Statistics) Chapter 3 Part 1
30 pages
Pharmaceuticals 18 00217
No ratings yet
Pharmaceuticals 18 00217
25 pages
CLS Aipmt-18-19 XII Phy Study-Package-6 SET-2 Chapter-8 PDF
No ratings yet
CLS Aipmt-18-19 XII Phy Study-Package-6 SET-2 Chapter-8 PDF
17 pages
Improvise Academy: Subject: Physics Class: XII Full Marks: 75
No ratings yet
Improvise Academy: Subject: Physics Class: XII Full Marks: 75
2 pages
Handbook of Shanti Swarup Bhatnagar Prize Winners (1958 - 1998)
No ratings yet
Handbook of Shanti Swarup Bhatnagar Prize Winners (1958 - 1998)
118 pages
4in SB12MNRX2 25 4
No ratings yet
4in SB12MNRX2 25 4
1 page

Lectures 6

Uploaded by

Lectures 6

Uploaded by

Monte Carlo Methods

∗ Monte Carlo methods are a way of approximating the value

∗ These samples of random variables are typically computer

∗ Since we regularly need to calculate integrals in Bayesian

∗ Suppose that we need to evaluate

∗ Let f be a probability density function with support A.

∗ Then we can write

∗ Now if Y is a random variable with pdf f then we have

∗ Hence if we have Y1, . . . , YN iid

∗ We can therefore use Iˆ to approximate I very well for large

∗ Furthermore we can use estimate the variability in Iˆ using

∗ Since N is totally in our control we can choose N to be large

∗ Computers are unable to generate random numbers.

∗ The can, however, be used to generate pseudo-random num-

∗ These are sequences of numbers which are generated from a

∗ Typically the numbers generated by a computer can be thought

∗ Uniform random variates are rarely what we need for simula-

∗ They are, however, the building blocks for generating random

∗ Much of this is based on the following theorem

∗ The method as described above requires that we have a con-

∗ A similar technique can also be used to generate discrete

∗ Suppose that p(y) is the probability mass function and the

F −1(u) = min{y ∈ Y : F (y) ⩾ u}

∗ Then if U ∼ uniform(0, 1), the random variable Y = F −1(u)

∗ In many cases, the inverse cdf is not available in closed form

∗ Suppose that U1 and U2 are two independent Uniform(0, 1)

∗ This is known as the Box-Muller Algorithm.

∗ A more general technique which is useful when the inverse

∗ This method relies on generating a different random variable

∗ We also require that the ratio of densities is bounded by a

1. Calculate M = supy fY (y)/fV (y).

2. Generate V ∼ fV and independently U ∼ Uniform(0, 1).

∗ Markov Chain Monte Carlo methods are now widely used in

∗ The methods work on the idea of constructing a Markov

∗ Under certain conditions, the distribution of the elements in

∗ They then run a carefully constructed Markov chain starting

∗ It is not always easy to know how long the chains should be

∗ Any observations in the chain after this burn-in period may

∗ First introduced in statistical physics in 1954 by Metropolis

∗ It is basically a Markov chain version of the accept/reject

∗ Random variates are generated from some candidate distri-

Suppose we wish to sample Y ∼ fY .

First initialize the chain with some value Y (0).

Then for t = 1, 2, . . . we generate Y (t) by

3. Generate Ut ∼ Uniform(0, 1) and set

∗ It is often convenient to generate V (t) from the same distri-

∗ In this case we have fV |Y (v | Y (t−1)) = fV (v) and so the

∗ Another special case is where fV |Y (v | y) = fZ (v − y) where

∗ We generate Z (t) ∼ fZ and set V (t) = Y (t−1) + Z (t).

∗ In stochastic processes this is called a random walk.

∗ The acceptance probability for the Metropolis–Hastings al-

∗ The Gibbs Sampler (Geman & Geman, 1984) is designed to

∗ The Markov chain is constructed by considering the univari-

∗ Suppose that the random vector of interest is Y = (Y1, . . . , Yd)

You might also like