Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
15 views20 pages

Module 1 Topic 3

The document provides an overview of the Poisson Distribution, a discrete probability distribution used to model the number of events occurring in a fixed interval of time or space, under certain assumptions. It includes the probability mass function, mean, variance, and applications in fields such as AI and data science, along with examples and problems for practical understanding. The document also explains the relationship between the Poisson and Binomial distributions and provides detailed derivations for mean and variance.

Uploaded by

aayushrasaily04
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views20 pages

Module 1 Topic 3

The document provides an overview of the Poisson Distribution, a discrete probability distribution used to model the number of events occurring in a fixed interval of time or space, under certain assumptions. It includes the probability mass function, mean, variance, and applications in fields such as AI and data science, along with examples and problems for practical understanding. The document also explains the relationship between the Poisson and Binomial distributions and provides detailed derivations for mean and variance.

Uploaded by

aayushrasaily04
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

Module I: Probability Distributions

Topic 3: Poisson Distribution

Dr. P. Rajendra, Professor, Department of Mathematics,


CMRIT, Bengaluru.
Introduction to Poisson Distribution

▶ The Poisson Distribution is a discrete probability


distribution that expresses the probability of a given number
of events occurring in a fixed interval of time or space.
▶ The key assumptions are:
▶ Events occur independently of each other.
▶ The average rate (λ) at which events occur is constant.
▶ Two events cannot occur at exactly the same time.
▶ The Poisson Distribution is particularly useful for modeling
the number of times an event occurs within a specific time
period or spatial area.

Example
The Poisson Distribution can model the number of user clicks on a
webpage per minute or the number of requests to a server per hour.
Poisson Distribution (Continued..)
▶ The probability mass function (PMF) of the Poisson
Distribution is given by:

λk e −λ
P(X = k) =
k!
▶ Here:
▶ X is the random variable representing the number of events.
▶ k is the actual number of events that occur.
▶ λ is the average rate (mean number of events in the interval).
▶ e is the base of the natural logarithm (approximately 2.71828).
▶ The PMF gives the probability of observing exactly k events
in a fixed interval.

Example
Suppose a server receives an average of 5 requests per minute.
The Poisson Distribution can be used to find the probability that
the server receives exactly 8 requests in a given minute.
Poisson Distribution (Continued..)

Let X be the discrete random variable for any real value λ, such
that the probability mass function of the Poisson distribution can
be defined as:
( −λ x
e λ
P(X = x) = P(x) = x! , x ≥ 0
0, otherwise
where λ is called the parameter.
i) P(X = x) = P(x) ≥ 0
P∞ P∞ e −λ λx
ii) x=0 P(x) = x=0 x! =1
iii) Mean, µ = np = λ

iv) Variance, σ 2 = λ, S.D = λ
Relationship to Other Distributions

▶ The Poisson Distribution can be derived as a limiting case of


the Binomial Distribution when the number of trials n
becomes large and the probability of success p becomes small,
such that np = λ remains constant.
▶ For large values of λ, the Poisson Distribution approaches a
Normal Distribution due to the Central Limit Theorem.

Example
If a machine learning model is evaluated over a large dataset with
rare events (such as detecting rare failures), the distribution of the
number of detected failures might be approximated by a Poisson
Distribution.
Applications in AI and Data Science

▶ Event Prediction: The Poisson Distribution is used to


predict the number of events (e.g., clicks, purchases, or
system failures) within a specific time frame.
▶ Anomaly Detection: In network security, the Poisson
Distribution can model the normal number of requests to a
server. Significant deviations might indicate anomalies.
▶ Natural Language Processing (NLP): Used to model the
number of times a rare word occurs in a large text corpus.
▶ Queueing Theory: The Poisson Distribution models the
number of arrivals of jobs or tasks in systems like customer
service queues or server request handling.
Derivation for Mean and Variance of a Poisson Distribution
Let X be a random variable following a Poisson distribution with
parameter λ. The probability mass function (pmf) of X is given by:

e −λ λx
P(X = x) = , x = 0, 1, 2, . . .
x!
1. Mean of a Poisson Distribution
The mean (expected value) E (X ) of a Poisson distribution can be
derived as follows:
∞ ∞
X X e −λ λx
E (X ) = x · P(X = x) = x·
x!
x=0 x=0
λx
Rewriting x as x = x · x(x−1)! , we have:

X λx
E (X ) = e −λ
(x − 1)!
x=1
Let y = x − 1, then x = y + 1 and as x goes from 1 to ∞, y goes
from 0 to ∞:

X λy
E (X ) = e −λ λ
y!
y =0
P∞ λy
Using the fact that y =0 y ! = e λ , we get:

E (X ) = e −λ λe λ = λ
Therefore, the mean of a Poisson distribution is:

E (X ) = λ
2. Variance of a Poisson Distribution
The variance Var(X ) of a Poisson distribution is given by
E (X 2 ) − [E (X )]2 .
First, we find E (X 2 ):
∞ ∞
X X e −λ λx
E (X 2 ) = x 2 · P(X = x) = x2 ·
x!
x=0 x=0
Rewriting x 2 as x(x − 1) + x, we have:
∞ ∞
−λ
X λx X λx−1
2
E (X ) = e + e −λ λ
(x − 2)! (x − 1)!
x=2 x=1

Letting y = x − 2 and z = x − 1, we get:


∞ ∞
X λy X λz
E (X 2 ) = e −λ λ2 + e −λ λ
y! z!
y =0 z=0

E (X 2 ) = λ2 + λ
Then, the variance is:

Var(X ) = E (X 2 ) − [E (X )]2 = (λ2 + λ) − λ2 = λ


Thus, the variance of a Poisson distribution is:

Var(X ) = λ
Problem 1

The number of accidents in a year for taxi drivers in a city follows


a Poisson distribution with a mean of 3. Out of 1000 taxi drivers,
find approximately the number of drivers with:
(i) No accident in a year.
(ii) More than 3 accidents in a year.

Solution:
Given that X follows a Poisson distribution with mean λ = 3.

e −λ λx
P(X = x) =
x!
where λ = 3.
(i) Number of drivers with no accident in a year:

e −3 · 30
P(X = 0) = = e −3 ≈ 0.0498
0!
The expected number of drivers with no accidents is:

1000 × P(X = 0) = 1000 × 0.0498 = 49.8 ≈ 50


So, approximately 50 drivers are expected to have no accidents in a
year.

(ii). Number of drivers with more than 3 accidents in a year:

We need to calculate P(X > 3), which is:

P(X > 3) = 1 − P(X ≤ 3)


First, calculate P(X ≤ 3):

P(X ≤ 3) = P(X = 0) + P(X = 1) + P(X = 2) + P(X = 3)


30
P(X = 0) = e −3 = e −3 ≈ 0.0498
0!
31
P(X = 1) = e −3 = 3e −3 ≈ 0.1494
1!
3 2 9e −3
P(X = 2) = e −3 = ≈ 0.2240
2! 2
33 27e −3
P(X = 3) = e −3 = ≈ 0.2240
3! 6
P(X ≤ 3) ≈ 0.0498 + 0.1494 + 0.2240 + 0.2240 = 0.6472
Thus,

P(X > 3) = 1 − 0.6472 = 0.3528


The expected number of drivers with more than 3 accidents is:

1000 × P(X > 3) = 1000 × 0.3528 = 352.8 ≈ 353


So, approximately 353 drivers are expected to have more than 3
accidents in a year.
Problem 2
In a certain factory turning out razor blades, there is a small
1
probability of 500 for any blade to be defective. The blades are
supplied in packets of 10. Use the Poisson distribution to calculate
the approximate number of packets containing:
(i) No defective blades.
(ii) 2 defective blades.
(iii) 3 defective blades.
in the consignment of 10,000 packets.

Solution:
Let X denote the number of defective blades in a packet. The
1
probability of a blade being defective is p = 500 . The mean
number of defective blades per packet, λ, can be calculated as:
1 10
λ = np = 10 × = = 0.02
500 500
The Poisson probability mass function is given by:

e −λ λx
P(X = x) =
x!
where λ = 0.02.

(i). Number of packets with no defective blades:

e −0.02 · 0.020
P(X = 0) = = e −0.02 ≈ 0.9802
0!
The expected number of packets with no defective blades is:

10000 × P(X = 0) = 10000 × 0.9802 = 9802


So, approximately 9802 packets are expected to have no defective
blades.
(ii). Number of packets with 2 defective blades:

e −0.02 · 0.022
P(X = 2) =
2!
e −0.02 · 0.0004 0.9802 × 0.0004
= ≈ = 0.000196
2 2
The expected number of packets with 2 defective blades is:

10000 × P(X = 2) = 10000 × 0.000196 = 1.96


So, approximately 2 packets are expected to have 2 defective
blades.
(iii). Number of packets with 3 defective blades:

e −0.02 · 0.023
P(X = 3) =
3!
e −0.02 · 0.000008 0.9802 × 0.000008
= ≈ = 0.00000131
6 6
The expected number of packets with 3 defective blades is:

10000 × P(X = 3) = 10000 × 0.00000131 = 0.0131


So, approximately 0 packets are expected to have 3 defective
blades (very unlikely).
Problem 3

If the probability of a bad reaction from a certain injection is


0.001, determine the probability that out of 2000 individuals, more
than 2 will get a bad reaction.

Solution:
Let X denote the number of individuals who get a bad reaction.
The probability of a bad reaction for an individual is p = 0.001.
For 2000 individuals, we have n = 2000.
The mean number of bad reactions, λ, is given by:

λ = np = 2000 × 0.001 = 2
Since n is large and p is small, we can use the Poisson
approximation to the binomial distribution, where X ∼ Poisson(λ).
The probability mass function of a Poisson distribution is:

e −λ λx
P(X = x) =
x!
where λ = 2.
To find the probability that more than 2 individuals will get a bad
reaction, we need to calculate P(X > 2):

P(X > 2) = 1 − P(X ≤ 2)


First, we calculate P(X ≤ 2) using the cumulative distribution
function:

P(X ≤ 2) = P(X = 0) + P(X = 1) + P(X = 2)


Calculating each term:

e −2 · 20
P(X = 0) = = e −2 ≈ 0.1353
0!
e −2 · 21
P(X = 1) = = 2e −2 ≈ 0.2707
1!

e −2 · 22
P(X = 2) = = 2e −2 ≈ 0.2707
2!
Summing these probabilities:

P(X ≤ 2) = 0.1353 + 0.2707 + 0.2707 = 0.6767


Now, calculate P(X > 2):

P(X > 2) = 1 − 0.6767 = 0.3233


Therefore, the probability that more than 2 individuals will get a
bad reaction is 0.3233.
Assignment Problems
1.The probability that a news reader commits no mistakes in
reading the news is e13 . Find the probability that on a particular
news broadcast he commits:
(i) Only 2 mistakes [Ans: 0.2240]
(ii) More than 3 mistakes [Ans: 0.3500]
(iii) At most 3 mistakes [Ans: 0.6500]
2.Suppose 300 misprints are randomly distributed throughout a
book of 500 pages. Find the probability that a given page contains:
(i) Exactly 3 misprints [Ans: 0.0197]
(ii) Less than 3 misprints [Ans: 0.9768]
(iii) 4 or more misprints [Ans: 0.0033]
3. A certain screw-making machine produces an average of 2
defective screws out of 100 and packs them in boxes of 500. Find
the probability that the box contains:
(i) 3 defective screws [Ans: 0.0075]
(ii) At least 1 defective screw [Ans: 0.9999]
(iii) Between 2 and 4 defective screws [Ans: 0.0287]

You might also like