Study Guide 1 (Introduction To Bayesian Data Analysis)

The document provides an introduction to Bayesian data analysis, emphasizing its foundational ideas of reallocating credibility across possibilities and using parameter values in mathematical models. It outlines the steps involved in Bayesian analysis, including identifying relevant data, defining descriptive models, specifying prior distributions, and interpreting posterior distributions. The document also discusses the nature of data as noisy indicators and the importance of posterior predictive checks to ensure the model's adequacy in reflecting the observed data.

Uploaded by

skiclyde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views19 pages

Study Guide 1 (Introduction To Bayesian Data Analysis)

Uploaded by

skiclyde

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Introduction to Bayesian

Data Analysis
Study Guide 1

Prepared by:
Asst. Prof. Sherelyn A. Evangelio
Introduction
• Bayesian data analysis has two foundational ideas. The first idea is that Bayesian
inference is reallocation of credibility across possibilities.
• The second foundational idea is that the possibilities, over which we allocate
credibility, are parameter values in meaningful mathematical models.
Bayesian Inference is Reallocation of Credibility
Across Possibilities
Suppose we step outside one morning and notice that the sidewalk is wet, and
wonder why. We consider all possible causes of the wetness, including possibilities
such as recent rain, recent garden irrigation, a newly erupted underground spring, a
broken sewage pipe, a passerby who spilled a drink, and so on. If all we know until
this point is that some part of the sidewalk is wet, then all those possibilities will have
some prior credibility based on previous knowledge. For example, recent rain may
have greater prior probability than a spilled drink from a passerby. Continuing on our
outside journey, we look around and collect new observations. If we observe that the
sidewalk is wet for as far as we can see, as are the trees and parked cars, then we re-
allocate credibility to the hypothetical cause of recent rain. The other possible causes,
such as a passerby spilling a drink, would not account for the new observations. On
the other hand, if instead we observed that the wetness was localized to a small area,
and there was an empty drink cup a few feet away, then we would re-allocate
credibility to the spilled-drink hypothesis, even though it had relatively low prior
probability.
Bayesian Inference is Reallocation of Credibility
Across Possibilities
• The word “Credibility” is
synonymous with “probability.”

• Another example of Bayesian

inference has been immortalized
in the words of Sherlock Homes to
Doctor Watson:
“How often have I said to you that
when you have eliminated the
impossible, whatever remains,
however improbable, must be the
truth?”
Bayesian Inference is Reallocation of Credibility
Across Possibilities
• The reallocated distribution of
credibility is called the posterior
distribution. It then becomes the
prior beliefs for subsequent
observations.

• This reallocation of credibility is

not only intuitive, it is also what
the exact mathematics of Bayesian
inference prescribe.
Bayesian Inference is Reallocation of Credibility
Across Possibilities
• The complementary form of reasoning is also
Bayesian, and can be called judicial exoneration.

• Suppose there are several possible culprits for a

crime, and that these suspects are mutually
unaffiliated and exhaust all possibilities. If evidence
accrues that one suspect is definitely culpable, then
the other suspects are exonerated.
Data are noisy and inferences are probabilistic
• Previous examples assumed that observed data had definitive, deterministic
relations to the candidate causes. For example, Holmes may have found a footprint
at the scene of the crime and identified the size and type of shoe with complete
certainty, thereby completely ruling out a particular candidate suspect.
• In reality, data have only probabilistic relations to their underlying causes, the
measurements are not perfect, and the footprint is only an imperfect representation
of the shoe that produced it.
• The relation between the cause and the measured effect is full of random variation.
• In scientific research, measurements are replete with randomness. Extraneous
influences contaminate the measurements despite tremendous efforts to limit their
intrusion.
• All scientific data have some degree of “noise” in their values. The techniques of
data analysis are designed to infer underlying trends from noisy data.
Data are noisy and inferences are probabilistic
• We can collect data and only incrementally adjust the
credibility of some possible trends. The beauty of Bayesian
analysis is that the mathematics reveal exactly how much to
reallocate credibility in realistic probabilistic situations.
• Suppose there is a manufacturer of inflated bouncy balls, and
the balls are produced in four discrete sizes, diameters of 1.0,
2.0, 3.0, and 4.0 units. Suppose we submit an order to the
factory for the three balls of size 2. We receive three balls with
diameters of 1.77, 2.23, and 2.70.
From those measurements, can we conclude that the factory
correctly sent us three balls of size 2, or did the factory send
size 3 or size 1 by mistake, or even size 4?
Data are noisy and inferences are probabilistic
• Inferring the underlying manufacturing size of the balls from their “noisy”
individual diameters is analogous to data analysis in real-world scientific research
and applications.
• The data are noisy indicators of the underlying generator. We hypothesize a range
of possible underlying generators, and from the data we infer their relative
credibilities.
• Other examples are testing people for illicit drug use and detection of spam in
email.
• Bayesian analysis is the mathematics of reallocating credibility in a logically
coherent and precise way across possibilities. The distribution of credibility initially
reflects prior knowledge about the possibilities. Then new data are observed.
Possibilities that are consistent with the data garner more credibility, while those
that are not consistent lose credibility.
Possibilities are Parameter Values in Descriptive
Models
• A key step in Bayesian analysis is defining the set of possibilities over which
credibility is allocated. A posterior predictive check is a process of expanding the set of
possibilities when the data seem not to be well described by the chosen set of
possibilities.
• Consider the example of the blood-pressure drug, in which blood pressures are
measured in one group that took the drug and in another group that took placebo.
The magnitude of difference in blood pressure describes the data, and our goal is to
assess which possible descriptions are more or less credible.
• In general, data analysis begins with a family of candidate descriptions for the data.
These are mathematical formulas that characterizes the trends and spreads in data.
The formulas have parameter values that determine the exact shape of mathematical
forms.
Possibilities are Parameter Values in Descriptive
Models
• For example, the normal distribution has two
parameters, the mean which is the location parameter,
and the standard deviation which is the scale parameter.
• The role of the Bayesian inference is to compute the
exact relative credibilities of candidate parameter
values.
• In realistic applications, the candidate parameter values
can form an infinite continuum. For normal
distribution, the range of the location parameter are all
real numbers.
• Bayesian inference operates without trouble on infinite
continuums.
Possibilities are Parameter Values in Descriptive
Models
• There are two main desiderata for a mathematical description of data. First, the
mathematical form should be comprehensible with meaningful parameters.
• Second, it should be descriptively adequate, the mathematical form should “look
like” the data.
• It is important to understand that mathematical descriptions of the data are no
necessarily causal explanations of the data.
• The parameters are “meaningful” only in the context of the familiar mathematical
form defined by the distribution; the parameter values have no necessary meaning
with respect to causes in the world.
The Steps of Bayesian Data Analysis
In general, Bayesian analysis of data follows these steps:
1. Identify the data relevant to the research questions. What are the measurement
scales of the data? Which data variables are to be predicted, and which data
variables are supposed to act as predictors?
2. Define a descriptive model for the relevant data. The mathematical form and its
parameters should be meaningful and appropriate to the theoretical purposes of
the analysis.
3. Specify a prior distribution on the parameters. The prior must pass muster with
the audience of the analysis, such as skeptical scientists.
4. Use Bayesian inference to re-allocate credibility across parameter values. Interpret
the posterior distribution with respect to theoretically meaningful issues
(assuming that the model is a reasonable description of the data; see next step).
The Steps of Bayesian Data Analysis
5. Check that the posterior predictions mimic the
data with reasonable accuracy (i.e., conduct a
“posterior predictive check”). If not, then
consider a different descriptive model.

Suppose we are interested in the relationship

between weight and height of people. In particular,
we might be interested in predicting a person’s
weight based on their height.
Step 1: Identify the relevant data. Suppose we have
been able to collect heights (in) and weights (lb)
from 57 mature adults sampled at random from a
population of interest.
The Steps of Bayesian Data Analysis
Step 2: Define a descriptive model of the data that is meaningful for our research of
interest. We will describe predicted weight as a multiplier times height plus a
baseline, denoted mathematically as
𝑦ො = 𝛽1 𝑥 + 𝛽0
where 𝑦ො is the predicted weight, 𝛽1 indicates how much the predicted weight
increases when the height 𝑥 goes up by 1 inch, and the baseline 𝛽0 represents the
weight of a person who is 0 inches tall. The above equation is the model of trend and
is often called linear regression.
We have to describe the random variation of actual weights around the predicted
weights. We assume that the actual weights 𝑦 are distributed randomly according to a
normal distribution around 𝑦ො and with standard deviation 𝜎, symbolically denoted as
𝑦~normal(𝑦,ො 𝜎)
The full model has three parameters, the slope 𝛽1 , the intercept 𝛽0 and standard
deviation of the “noise,” 𝜎.
The Steps of Bayesian Data Analysis
Step 3: Specify a prior distribution on the parameters. We might inform the prior with
previously conducted research. For this example, we will use equal prior credibility
across a vast range of possible values for the slope and intercept, both centered at 0.
For the noise parameter, we will use uniform distribution. This choice of prior
distribution implies that it has virtually no biasing influence on the resulting posterior
distribution.

Step 4: Interpret the posterior distribution. Bayesian inference has reallocated

credibility across parameter values. The posterior distribution indicates combinations
of 𝛽1 , 𝛽0 , and 𝜎 that together are credible, given the data.
The Steps of Bayesian Data Analysis
Step 4 (cont.):
The posterior distribution of 𝛽1 indicates that
the most credible value of the slope is about
4.1. It also shows the uncertainty in the
estimated slope. One way to summarize the
uncertainty is by marking the span of values
that are most credible and cover 95% of the
distribution, called the highest density interval
(HDI).
It can be observed that the slope of 0 falls far
outside any credible value for the slope. We
could decide to “reject” 0 slope.
The Steps of Bayesian Data Analysis
Step 5: Check that the model actually mimics the
data reasonably well. This is called the
“posterior predictive check.”
By visual inspection of the graph, we can see
that the actual data appear to be well described
by the predicted data.
If the actual data appear to deviate
systematically from the predicted form, then we
could contemplate alternative descriptive
models.
Data analysis without parametric models?
• One situation in which it might appear that parametized models are not used is
with so called nonparametric models. But they do actually have parameters.
• Suppose we want to describe the weights of dogs, sampled at random from the
entire spectrum of dog breeds. There are probably clusters of weights, each has its
own parameters, for different breeds of dogs. The number of parameter in the
model is inferred and can grow to infinity with infinite data.
• There are variety of situations which it might seem at first that no parametized
model would apply. In case of disease diagnosis, the parameter refer to discrete
states instead of continuous distributions.
• Finally, there might be some situation in which the analyst is loathe to commit to
any parametized model of the data, even tremendously flexible infinitely
parametized models. One case is trying to make inferences from data without using
a model is resampling or bootstrapping.

Bayesian Reasoning in Data Analysis A Critical Introduction by Giulio D. Agostini
100% (2)
Bayesian Reasoning in Data Analysis A Critical Introduction by Giulio D. Agostini
351 pages
Log Interpretation in Horizontal Wells
No ratings yet
Log Interpretation in Horizontal Wells
386 pages
KPK Board Conceptual Questions 9th Class
100% (4)
KPK Board Conceptual Questions 9th Class
18 pages
Measurements Worksheet
No ratings yet
Measurements Worksheet
16 pages
Chapter3 Bayesian Statistics
No ratings yet
Chapter3 Bayesian Statistics
28 pages
24 Intro To Bayesian Inference
No ratings yet
24 Intro To Bayesian Inference
33 pages
Bayesian Statistics: A Biologist S Interpretation: Marguerite Pelletier URI Natural Resources Science / U.S. EPA
No ratings yet
Bayesian Statistics: A Biologist S Interpretation: Marguerite Pelletier URI Natural Resources Science / U.S. EPA
19 pages
Var PPTS
No ratings yet
Var PPTS
249 pages
20-Bayesian 310456690
No ratings yet
20-Bayesian 310456690
34 pages
1 Bayesian Talk
No ratings yet
1 Bayesian Talk
84 pages
Bayesian Analysis Primer
No ratings yet
Bayesian Analysis Primer
13 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
Astm B 466-14
No ratings yet
Astm B 466-14
6 pages
Studio 5 Questions
No ratings yet
Studio 5 Questions
8 pages
Bayesian-Statistics Final 20140416 3
No ratings yet
Bayesian-Statistics Final 20140416 3
38 pages
Bayesian Data Analysis
No ratings yet
Bayesian Data Analysis
36 pages
1 Inference
No ratings yet
1 Inference
9 pages
Dimitri P. Bertsekas, John N. Tsitsiklis - Introduction To Probability, 2nd Edition - Athena Scientific (2008) - 6
No ratings yet
Dimitri P. Bertsekas, John N. Tsitsiklis - Introduction To Probability, 2nd Edition - Athena Scientific (2008) - 6
49 pages
Bayesian Statistics and Modelling
No ratings yet
Bayesian Statistics and Modelling
28 pages
25 Intro To Bayesian Inference
No ratings yet
25 Intro To Bayesian Inference
31 pages
Bayesain Stastatics: P.Charan 23H51A66B3 CSM-4
No ratings yet
Bayesain Stastatics: P.Charan 23H51A66B3 CSM-4
9 pages
JSA Excavation
No ratings yet
JSA Excavation
6 pages
BayesianThinking Day1 Albert WORKSHOP Ppts PDF
No ratings yet
BayesianThinking Day1 Albert WORKSHOP Ppts PDF
188 pages
Bayesian Statistics Lecture Notes
No ratings yet
Bayesian Statistics Lecture Notes
146 pages
Making Models With Bayes
No ratings yet
Making Models With Bayes
51 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
76 pages
End To End Statistics For Data Science
No ratings yet
End To End Statistics For Data Science
28 pages
DSOST2
No ratings yet
DSOST2
44 pages
Bayesian Foundational Ideas
No ratings yet
Bayesian Foundational Ideas
1 page
Overview of Principles of Statistics
No ratings yet
Overview of Principles of Statistics
8 pages
Gronau 等 - 2021 - A Primer on Bayesian Model-Averaged Meta-Analysis
No ratings yet
Gronau 等 - 2021 - A Primer on Bayesian Model-Averaged Meta-Analysis
19 pages
Intro-Bayes Theory
No ratings yet
Intro-Bayes Theory
17 pages
All Things
No ratings yet
All Things
12 pages
Overview of Bayesian Statistics
No ratings yet
Overview of Bayesian Statistics
13 pages
Bayesian Statistics Essentials
No ratings yet
Bayesian Statistics Essentials
180 pages
How To Become A Bayesian
No ratings yet
How To Become A Bayesian
28 pages
Balazs 2020 DPFBI
No ratings yet
Balazs 2020 DPFBI
3 pages
Advance Statistics
No ratings yet
Advance Statistics
23 pages
Calibration Procedure Guide
No ratings yet
Calibration Procedure Guide
5 pages
Bayesian Estimation and Inference
No ratings yet
Bayesian Estimation and Inference
21 pages
IDS22Bayes Applications
No ratings yet
IDS22Bayes Applications
34 pages
BDA Unit5
No ratings yet
BDA Unit5
9 pages
Beam Analysis for Engineers
No ratings yet
Beam Analysis for Engineers
8 pages
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
No ratings yet
Course On Bayesian Methods in Environmental Valuation: Basics (Continued) : Models For Proportions and Means
34 pages
Lecture 15
No ratings yet
Lecture 15
6 pages
2-Homogeneous Equation and Euler Theorem
100% (1)
2-Homogeneous Equation and Euler Theorem
22 pages
Bayesian Inference
No ratings yet
Bayesian Inference
22 pages
Lecture Material 2.5 - Bayesian Estimation & Concepts
No ratings yet
Lecture Material 2.5 - Bayesian Estimation & Concepts
12 pages
Bayes For Beginners: Luca Chech and Jolanda Malamud Supervisor: Thomas Parr 13 February 2019
No ratings yet
Bayes For Beginners: Luca Chech and Jolanda Malamud Supervisor: Thomas Parr 13 February 2019
41 pages
Baysian Inferences
No ratings yet
Baysian Inferences
20 pages
Math 212 Module 1
No ratings yet
Math 212 Module 1
5 pages
Bayesian Analysis - Explanation
No ratings yet
Bayesian Analysis - Explanation
20 pages
Bayesian Inference and Computation A Beginner's Guide - Brewer
No ratings yet
Bayesian Inference and Computation A Beginner's Guide - Brewer
40 pages
Physical Quantities and Measurement
75% (4)
Physical Quantities and Measurement
9 pages
Bayesianism 3 Couses-1
No ratings yet
Bayesianism 3 Couses-1
23 pages
Bayesian Analysis
No ratings yet
Bayesian Analysis
33 pages
HB 092012 Submitted
No ratings yet
HB 092012 Submitted
26 pages
Esha Synopsis (A)
No ratings yet
Esha Synopsis (A)
11 pages
Bayesian Inference
No ratings yet
Bayesian Inference
5 pages
Bayesian Inference: The Basics
No ratings yet
Bayesian Inference: The Basics
37 pages
First Term Syllabus Class Viii
0% (1)
First Term Syllabus Class Viii
3 pages
Bayesian Statistics: MA501, Statistics For Insurance
No ratings yet
Bayesian Statistics: MA501, Statistics For Insurance
28 pages
Bayesian Data Analysis Guide
No ratings yet
Bayesian Data Analysis Guide
42 pages
Chem 17.1 Course Syllabus Ver. 2023 (As of Feb 2)
No ratings yet
Chem 17.1 Course Syllabus Ver. 2023 (As of Feb 2)
10 pages
STP 559-1974
No ratings yet
STP 559-1974
319 pages
Metrology Course 2-1441-1442 Part 2
No ratings yet
Metrology Course 2-1441-1442 Part 2
54 pages
Course Outline
No ratings yet
Course Outline
9 pages
MAS3301 Bayesian Statistics: M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2008-9
No ratings yet
MAS3301 Bayesian Statistics: M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2008-9
9 pages
The Dimension of Strain Is? A) LT B) N/M C) N D) Dimensionless
No ratings yet
The Dimension of Strain Is? A) LT B) N/M C) N D) Dimensionless
608 pages
STATS 225: Bayesian Analysis Lecture 1: Introduction: Babak Shahbaba
No ratings yet
STATS 225: Bayesian Analysis Lecture 1: Introduction: Babak Shahbaba
49 pages
1992 Ebeling and Morrison PDF
100% (1)
1992 Ebeling and Morrison PDF
327 pages
Bayesian Statistics and Quality Modelling in The - Groen Kennisnet 134085
No ratings yet
Bayesian Statistics and Quality Modelling in The - Groen Kennisnet 134085
15 pages
Baysian-Slides 16 Bayes Intro
No ratings yet
Baysian-Slides 16 Bayes Intro
49 pages
Aero First Principles Modeling Workbook (Student)
No ratings yet
Aero First Principles Modeling Workbook (Student)
6 pages
"Equilibrium Constant of Fe(SCN)2+ Lab"
No ratings yet
"Equilibrium Constant of Fe(SCN)2+ Lab"
9 pages
Bayes Handout
No ratings yet
Bayes Handout
17 pages
(Ebook) Introduction To Bayesian Econometrics and Decision Theory
No ratings yet
(Ebook) Introduction To Bayesian Econometrics and Decision Theory
29 pages
Project Based On Data Analysis by SPSS Excel
No ratings yet
Project Based On Data Analysis by SPSS Excel
97 pages
AP Physics C Mech Yearly Plan
No ratings yet
AP Physics C Mech Yearly Plan
12 pages
PSV 07692
No ratings yet
PSV 07692
1 page
Charged Particles Q2
No ratings yet
Charged Particles Q2
2 pages
ME 472 - Corrosion Engineering 1
No ratings yet
ME 472 - Corrosion Engineering 1
2 pages
Institute of Engineering & Technology, Devi Ahilya University, Indore, (M.P.), India. (Scheme Effective From July 2015)
No ratings yet
Institute of Engineering & Technology, Devi Ahilya University, Indore, (M.P.), India. (Scheme Effective From July 2015)
2 pages
Deep-Investigated Analytical Modeling of A Surface Permanent Magnet Vernier Motor
No ratings yet
Deep-Investigated Analytical Modeling of A Surface Permanent Magnet Vernier Motor
12 pages
Advances in Weld Seam Tracking Techniques For Robotic Welding - A Review
No ratings yet
Advances in Weld Seam Tracking Techniques For Robotic Welding - A Review
26 pages
Science Revision Sheet
No ratings yet
Science Revision Sheet
7 pages
MEE 313E - 1st Law
No ratings yet
MEE 313E - 1st Law
7 pages
Engineering Physics Insights
No ratings yet
Engineering Physics Insights
6 pages
3.4. Glasses
No ratings yet
3.4. Glasses
50 pages

Study Guide 1 (Introduction To Bayesian Data Analysis)

Uploaded by

Study Guide 1 (Introduction To Bayesian Data Analysis)

Uploaded by

Introduction to Bayesian

• Another example of Bayesian

• This reallocation of credibility is

• Suppose there are several possible culprits for a

Suppose we are interested in the relationship

Step 4: Interpret the posterior distribution. Bayesian inference has reallocated

You might also like