Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
5 views13 pages

Stats & Probability

The document explains the concepts of discrete and continuous random variables, their probability distributions, and how to compute mean, variance, and standard deviation. It also covers sampling distributions, the central limit theorem, and properties of normal distribution. Key formulas and properties are summarized, along with references for further learning.

Uploaded by

alexandrite0423
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views13 pages

Stats & Probability

The document explains the concepts of discrete and continuous random variables, their probability distributions, and how to compute mean, variance, and standard deviation. It also covers sampling distributions, the central limit theorem, and properties of normal distribution. Key formulas and properties are summarized, along with references for further learning.

Uploaded by

alexandrite0423
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

Computing the Mean, Variance &

SD of Random Variables
Let us first define:

Discrete Variables - can take on only certain values and it must not be a
fraction, must be a whole number, and must be countable or have a
limit. (Example: Defective chairs produced in a factory )
Continuous Variable - can take on any value within a range of values,
determined by chance, has no limit. (Example: The average amount of
electricity consumed per household per month)
Discrete probability distribution or a Probability Mass Function -
consists of the values a random variable can assume and the
corresponding probabilities of the values.
Histogram - a graphical method for displaying the shape of a
distribution.

A random variable is….

a function that associates a real number to each element in the sample


space.
It is a variable whose values are determined by chance.

It is a discrete random variable IF

its set of possible outcomes is countable.


It is represented by countable data.

It is a continuous random variable IF

it takes on values on a continuous scale. It is not defined at specific


:
values but by an interval of values.

Properties of a Probability Distribution


The probability of each value of the random variable must be between
or equal to 0 and 1.

This sort of means the values are always fractions or decimals, or is the
number 0 or 1.

In symbol form: 0 ≤ P(X) ≤ 1

The sum of the probabilities of all values of the random variable must
be equal to 1.

When you add them together, it totals to 1.

In symbol form: ƩP(X)=1

!! Remember this !!
Number of Possible Combinations: NcR on the calculator
Possible OUTCOMES:
:
The table we will be using:

Possible Outcomes Probability


X•P(X) X-µ (X-µ)2 (X-µ)2 •P(X)
X P(X)

Mean: Variance: Standard


Deviation:

The mean (µ) is the summation of X•P(X). This means that whatever
values you have under the X•P(X) column, add them together to get
your mean.

The variance is the summation of (X-µ)2 •P(X). This means that


whatever values you have under the (X-µ)2 •P(X) column, add them
together to get your variance.

The standard deviation is the variance squared.

Example:
:
Sampling distribution of the Sample Means
:
Sampling distribution is a frequency distribution using the means
computed from all possible random samples of a specific size.

IMPORTANT NOTE:

If we can count, then the given population is finite.


If we cannot count the population, then the given population is infinite.

What does it mean to “take a sample from a distribution”?

It is essentially picking a random number based on the probabilities


described by the curve.

Properties of Sampling Distribution of the


Sample Means

equal to the population means. That is 𝜇𝑥 = 𝜇.


1. The mean of the sampling distribution of the sample means is

2. .The variance of the sampling distribution of the sample means


is given by:

3. The standard deviation of the sampling distribution of the


sample means is just the SQUARE ROOT of its variance.

It is also known as the standard error of the mean.


It measures the degree of accuracy of the sample mean as an estimate
of the population mean
:
Solving with the Short Method:

Central limit theorem “Even if you’re not


normal, the average is normal!”
In this theorem,
Means are normally distributed.
Even if the data is exponentially distributed, the mean is not
exponentially distributed. It is always normal.
As long as the sample size of n is large, you will always get a normally
:
distributed mean.

Essentially, no matter what type of distribution you take data from, the
means are normally distributed.

NOTE: for this to work, you need to be able to calculate a mean from
your sample in the first place.

Normal Distribution

It is also referred to as the “normal” or “Gaussian distribution.


Also referred to as a “bell shaped curve” because it looks like a bell and
is symmetrical.

The y-axis represents the relative probability of observing if something


is most or least likely to occur. (ex. In the image, it is shown how likely
or less likely it is to find someone who is short, tall, or average height.)

The look of the curve is heavily influenced by the standard deviation.


Normal curves are drawn in a way that 95% of the measurements fall
between +2 or -2 standard deviations around the mean.
The average is where the center of the curve goes. The standard
deviation is what tells you how wide the curve goes.
The wider the curve, the shorter. The narrower the curve, the taller.

The 6 Properties of Normal Distribution,


:
Summarized
I use this mnemonic to remember: BADAAS.
Which is essentially:
1. Bell-Shaped. Pretty self explanatory
2. Averages. Mean, median, and mode are all in
the center.
3. Dispersion. The width of a curve is
determined by its standard deviation.
4. Asymptotic. The ends of the curve always
approaches the x axis, but never touches it.
It is asymptotic to the baseline.
5. Area 1. The area under the curve is 1.
:
Dummied down and in summary,
area is ALWAYS positive
:
two numbers, get both z scores and then…
- if same region, subtract
- if different region, add
“to the right, to the left…” = half region is 0.5, and then treat it the same
as if it were two numbers

Percentile
A measure of relative standing.
It is a descriptive measure of the relationship of a measurement to the
rest of the data.
Percentile divides a set of data into 100 equal parts.
Percentiles always refer to quantities below or less than the percentile
rank

NOTE: 0.99 -> 99% and etc.

Looking for percentile: Add 0.5 to the area and round to nearest
hundredths
Looking for Z-score: Minus percentile from 0.5. Nearest value to answer
is Z score
:
SUMMARIZED NOTES

SUMMARIZED formulas
:
VIDEOS YOU CAN REFERENCE FURTHER
The Normal Distribution, Clearly Explained!!!

The Central Limit Theorem, Clearly Explained!!!


:
The Main Ideas behind Probability Distributions

Sampling Distribution of Sample Mean Using Central Limit Theorem |


Statistics and Probability

Central Limit Theorem | Statistics and Probability

Problems Involving Central Limit Theorem | Statistics and Probability

Central limit theorem | Inferential statistics | Probability and Statistics |


Khan Academy

Sampling distribution of the sample mean | Probability and Statistics |


Khan Academy

Sampling distribution of the sample mean 2 | Probability and Statistics |


Khan Academy

Central Limit Theorem - Sampling Distribution of Sample Means - Stats


& Probability

Standard error of the mean | Inferential statistics | Probability and


Statistics | Khan Academy

Sampling distribution example problem | Probability and Statistics |


Khan Academy
:

You might also like