Statistics
Discrete Random Variables
Where We’ve Been
Using probability to make inferences
about populations
Measuring the reliability of the
inferences
McClave, Statistics, 11th ed. Chapter 4: Discrete 2
Random Variables
Where We’re Going
Develop the notion of a random
variable
Numerical data and discrete random
variables
Discrete random variables and their
probabilities
McClave, Statistics, 11th ed. Chapter 4: Discrete 3
Random Variables
4.1: Two Types of Random
Variables
A random variable is a variable hat
assumes numerical values associated
with the random outcome of an
experiment, where one (and only one)
numerical value is assigned to each
sample point.
McClave, Statistics, 11th ed. Chapter 4: 4
Discrete Random Variables
4.1: Two Types of Random
Variables
A discrete random variable can assume a
countable number of values.
Number of steps to the top of the Eiffel Tower*
A continuous random variable can
assume any value along a given interval of
a number line.
The time a tourist stays at the top
once s/he gets there
*Believe it or not, the answer ranges from 1,652 to 1,789. See Great Buildings
McClave, Statistics, 11th ed. Chapter 4: 5
Discrete Random Variables
4.1: Two Types of Random
Variables
Discrete random variables
Number of sales
Number of calls
Shares of stock
People in line
Mistakes per page
Continuous random
variables
Length
Depth
Volume
Time
Weight
McClave, Statistics, 11th ed. Chapter 4: 6
Discrete Random Variables
4.2: Probability Distributions
for Discrete Random Variables
The probability distribution of a
discrete random variable is a graph,
table or formula that specifies the
probability associated with each
possible outcome the random variable
can assume.
p(x) ≥ 0 for all values of x
p(x) = 1
McClave, Statistics, 11th ed. Chapter 4: 7
Discrete Random Variables
4.2: Probability Distributions
for Discrete Random Variables
x P(x)
Say a random variable 1 .30
x follows this pattern: 2 .21
3 .15
p(x) = (.3)(.7)x-1
4 .11
for x > 0. 5 .07
This table gives the 6 .05
probabilities (rounded
7 .04
to two digits) for x
8 .02
between 1 and 10.
9 .02
10 .01
McClave, Statistics, 11th ed. Chapter 4: 8
Discrete Random Variables
4.3: Expected Values of
Discrete Random Variables
The mean, or expected value, of a
discrete random variable is
E( x) xp( x).
McClave, Statistics, 11th ed. Chapter 4: 9
Discrete Random Variables
4.3: Expected Values of
Discrete Random Variables
The variance of a discrete random
variable x is
E[( x ) ] ( x ) p( x).
2 2 2
The standard deviation of a discrete
random variable x is
E[( x ) ]
2 2
(x ) 2
p( x).
McClave, Statistics, 11th ed. Chapter 4: 10
Discrete Random Variables
4.3: Expected Values of
Discrete Random Variables
Chebyshev’s Rule Empirical Rule
P( x ) ≥0 .68
P( 2 x 2 ) ≥ .75 .95
P( 3 x 3 ) ≥ .89 1.00
McClave, Statistics, 11th ed. Chapter 4: 11
Discrete Random Variables
4.3: Expected Values of
Discrete Random Variables
In a roulette wheel in a U.S. casino, a $1 bet on
“even” wins $1 if the ball falls on an even number
(same for “odd,” or “red,” or “black”).
The odds of winning this bet are 47.37%
P( win $1) .4737
P(lose $1) .5263
$1 .4737 $1 .5263 .0526
.9986
On average, bettors lose about a nickel for each dollar they put down on a bet like this.
(These are the best bets for patrons.)
McClave, Statistics, 11th ed. Chapter 4: 12
Discrete Random Variables
4.4: The Binomial Distribution
A Binomial Random Variable
n identical trials
Two outcomes: Success or Failure
P(S) = p; P(F) = q = 1 – p
Trials are independent
x is the number of Successes in n trials
McClave, Statistics, 11th ed. Chapter 4: 13
Discrete Random Variables
4.4: The Binomial Distribution
A Binomial Random
Variable
n identical trials Flip a coin 3 times
Two outcomes: Success Outcomes are Heads or Tails
or Failure
P(S) = p; P(F) = q = 1 – p P(H) = .5; P(F) = 1-.5 = .5
Trials are independent A head on flip i doesn’t
x is the number of S’s in n change P(H) of flip i + 1
trials
McClave, Statistics, 11th ed. Chapter 4: 14
Discrete Random Variables
4.4: The Binomial Distribution
Results of 3 flips Probability Combined Summary
HHH (p)(p)(p) p3 (1)p3q0
HHT (p)(p)(q) p2q
HTH (p)(q)(p) p2q (3)p2q1
THH (q)(p)(p) p2q
HTT (p)(q)(q) pq2
THT (q)(p)(q) pq2 (3)p1q2
TTH (q)(q)(p) pq2
TTT (q)(q)(q) q3 (1)p0q3
McClave, Statistics, 11th ed. Chapter 4: 15
Discrete Random Variables
4.4: The Binomial Distribution
The Binomial Probability Distribution
p = P(S) on a single trial
q=1–p
n = number of trials
x = number of successes
n x n x
P( x) p q
x
McClave, Statistics, 11th ed. Chapter 4: 16
Discrete Random Variables
4.4: The Binomial Distribution
The Binomial Probability Distribution
n x n x
P( x) p q
x
McClave, Statistics, 11th ed. Chapter 4: 17
Discrete Random Variables
4.4: The Binomial Distribution
Say 40% of the n x n x
class is female. P( x) p q
x
What is the 10 6 106
probability that 6 (.4 )(.6 )
of the first 10 6
students walking 210(.004096)(.1296)
in will be female? .1115
McClave, Statistics, 11th ed. Chapter 4: 18
Discrete Random Variables
4.4: The Binomial Distribution
A Binomial Random Variable has
Mean np
Variance npq
2
Standard Deviation npq
McClave, Statistics, 11th ed. Chapter 4: 19
Discrete Random Variables
4.4: The Binomial Distribution
For 1,000 coin flips,
np 1000 .5 500
npq 1000 .5 .5 250
2
npq 250 16
The actual probability of getting exactly 500 heads out of 1000 flips is
just over 2.5%, but the probability of getting between 484 and 516 heads
(that is, within one standard deviation of the mean) is about 68%.
McClave, Statistics, 11th ed. Chapter 4: 20
Discrete Random Variables
4.5: The Poisson Distribution
Evaluates the probability of a (usually
small) number of occurrences out of many
opportunities in a …
Period of time
Area
Volume
Weight
Distance
Other units of measurement
McClave, Statistics, 11th ed. Chapter 4: 21
Discrete Random Variables
4.5: The Poisson Distribution
x
e
P( x)
x!
= mean number of occurrences in the
given unit of time, area, volume, etc.
e = 2.71828….
µ=
2 =
McClave, Statistics, 11th ed. Chapter 4: 22
Discrete Random Variables
4.5: The Poisson Distribution
Say in a given stream there are an average
of 3 striped trout per 100 yards. What is the
probability of seeing 5 striped trout in the
next 100 yards, assuming a Poisson
distribution?
x
e 3e 5 3
P( x 5) .1008
x! 5!
McClave, Statistics, 11th ed. Chapter 4: 23
Discrete Random Variables
4.5: The Poisson Distribution
How about in the next 50 yards, assuming a
Poisson distribution?
Since the distance is only half as long, is only
half as large.
x
e 1.5 e 5 1.5
P( x 5) .0141
x! 5!
McClave, Statistics, 11th ed. Chapter 4: 24
Discrete Random Variables
4.6: The Hypergeometric
Distribution
In the binomial situation, each trial was
independent.
Drawing cards from a deck and replacing
the drawn card each time
If the card is not replaced, each trial
depends on the previous trial(s).
The hypergeometric distribution can be
used in this case.
McClave, Statistics, 11th ed. Chapter 4: 25
Discrete Random Variables
4.6: The Hypergeometric
Distribution
Randomly draw n elements from a set
of N elements, without replacement.
Assume there are r successes and N-r
failures in the N elements.
The hypergeometric random variable
is the number of successes, x, drawn
from the r available in the n selections.
McClave, Statistics, 11th ed. Chapter 4: 26
Discrete Random Variables
4.6: The Hypergeometric
Distribution
r N r
x n x
P( x)
N
n
where
N = the total number of elements
r = number of successes in the N elements
n = number of elements drawn
X = the number of successes in the n elements
McClave, Statistics, 11th ed. Chapter 4: 27
Discrete Random Variables
4.6: The Hypergeometric
Distribution
r N r
x n x
P( x)
N
n
nr
N
r ( N r ) n( N n)
2
N ( N 1)
2
McClave, Statistics, 11th ed. Chapter 4: 28
Discrete Random Variables
4.6: The Hypergeometric
Distribution
Suppose a customer at a pet store wants to buy two hamsters
for his daughter, but he wants two males or two females (i.e.,
he wants only two hamsters in a few months)
If there are ten hamsters, five male and five female, what is the
probability of drawing two of the same sex? (With hamsters,
it’s virtually a random selection.)
5 10 5
2 2 2 (10)(1)
P( M 2) P( F 2) .22
10 45
2
P( M 2 or F 2) P( M 2) P( F 2) 2 .22 .44
McClave, Statistics, 11th ed. Chapter 4: 29
Discrete Random Variables