0% found this document useful (0 votes)

156 views60 pages

Probability for Data Science Students

The document outlines the principles of probability theory and its applications in data science, emphasizing key concepts such as random phenomena, sample spaces, events, and set operations. It discusses various approaches to defining probability, including classical and frequency methods, and introduces axiomatic definitions with associated theorems. The text serves as a foundational resource for understanding probability in the context of data science, supported by recommended textbooks and references.

Uploaded by

Somasekhar Lalam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

156 views60 pages

Probability for Data Science Students

Uploaded by

Somasekhar Lalam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Probability Theory for Data

Science

ISHAPATHIK DAS
IIT TIRUPATI
TIRUPATI, AP, INDIA

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
To introduce the core principles
of probability theory and
fundamental statistical
techniques, and to demonstrate
methods for solving practical
probability problems and
statistical applications.

Textbook(s): Ross S, A First course in Probability, Prentice Hall of India (2009).

Reference(s):
1. Chung K L, Elementary Probability Theory with Stochastic Process, Springer
Verlag (1974).
2. Drake A, Fundamentals of Applied Probability Theory, McGraw-Hill (1967).
3. Kreyszig E, Advanced Engineering Mathematics, John Wiley & Sons (2010).
4. Hsu H P, Schaum's outline of theory and problems of probability, random
variables, and random processes, McGraw-Hill (1997).
5. Gupta S C, Kapoor V K, Fundamentals of mathematical statistics, Sultan Chand
& Sons (2020).

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability: Probability models and
axioms, conditioning and Bayes'
rule, independence discrete random
variables; probability mass
functions; expectations, examples,
multiple discrete random variables:
joint PMFs, expectations,
conditioning, independence,
continuous random variables,
probability density functions,
expectations, examples, multiple
continuous random variables,
transformation of random variables,
covariance and correlation, iterated
expectations, convolution; notion of
convergence, weak law of large
numbers, central limit theorem.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
• A phenomenon refers to a fact, occurrence, or
circumstance that can be observed or is
observable.

• For example, natural phenomena include weather

patterns, fog, thunder, tornadoes, biological
processes, and decomposition.

• In scientific terms, a phenomenon encompasses

any observable event, often involving the use of
instruments to observe, record, or collect data.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Non-
Deterministic
Phenomena
Deterministic

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
There is a mathematical model that enables the
"perfect" prediction of a phenomenon’s outcome.

Numerous examples of this can be found in the

exact sciences, such as Physics and Chemistry.

Consider predicting the amount of money in a

bank account.

If you know the initial deposit and the interest rate,

you can accurately determine the account
balance after one year.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
• There is no mathematical model that enables
"perfect" prediction of a phenomenon’s
outcome.

• These phenomena can be divided into two

groups:
• Random phenomena: While individual
outcomes cannot be predicted, the
long-term outcomes exhibit statistical
regularity.
• Haphazard phenomena: Outcomes are
unpredictable, and there is no long-
term statistical regularity.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Random
Non-
Deterministic
Phenomena Haphazard
Deterministic

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
• While individual outcomes cannot be predicted, the
long-term results exhibit statistical regularity.

• For example, when rolling a die, the possible

outcomes are S = {1, 2, 3, 4, 5, 6}.

• Although the outcome of a single roll is

unpredictable, over many rolls, each number will
appear approximately 1/6 of the time.

• This regularity is due to the symmetry of a fair die,

where each side is equally likely to occur

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
In this case,
outcomes are
unpredictable and do
not exhibit statistical
regularity over the
long run.

For example, • It is impossible to predict which number they might choose at any given
time.
consider a scenario • We cannot determine the probability of observing any specific value
from 1 to 6.
where someone is • We don't know if the person has a favorite number that they choose more
choosing numbers frequently.
• We have no insight into the process by which the person is selecting the
from 1 to 6. numbers.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
The set of all possible outcomes

Probability Theory for Data

•
of a random phenomena is called
the sample space S.

Science
• Examples:

1. Random Experiment: Tossing a coin. All

Dr. Ishapathik Das, IIT Tirupati

possible outcomes: 𝑺𝟏 ={Head, Tail}.

2. Random Experiment: Rolling a die. All

possible outcomes: 𝑺𝟐 = {𝟏, 𝟐, 𝟑, 𝟒, 𝟓, 𝟔}
An event is a subset of the sample space S.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
• Random experiment: rolling a die.

• Sample space 𝑺𝟐 = {𝟏, 𝟐, 𝟑, 𝟒, 𝟓, 𝟔}.

• An event E={2, 4, 6}, representing

the outcome of rolling an even
number.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Data Science
Probability Theory for

Let S be a non-empty set. A class 𝑪 of subsets of S is called a

field if it contains S itself and is closed under the formation of
complements and finite unions:
1. 𝑆 ∈ 𝑪;
2. 𝐴 ∈ 𝑪 implies 𝐴𝑐 ∈ 𝑪;
3. 𝐴, 𝐵 ∈ 𝑪 implies 𝐴 ∪ 𝐵 ∈ 𝑪.
Dr. Ishapathik Das, IIT
Tirupati
Let S be a non-empty set. A class 𝑪 of subsets of S is called a
field if it contains S itself and is closed under the formation of
complements and countable unions:
• 𝑆 ∈ 𝑪;
• 𝐴 ∈ 𝑪 implies 𝐴𝑐 ∈ 𝑪;
• 𝐴1 , 𝐴2 , … ∈ 𝑪 implies A1 ∪ 𝐴2 ∪ ⋯ ∈ 𝑪.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
• The null event (empty event, impossible event) is denoted by Φ.
• Φ represents the event that contains no outcomes.
• The entire event (certain event) is denoted by S.
• S represents the event that contains all possible outcomes.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Union of Sets
The union of two sets A and B (A ∪ B) is the set of elements that are in either A, B, or both. This represents the
combined collection of all elements from both sets.

Definition Example
A ∪ B = {x | x ∈ A or x ∈ B} If A = {1, 2, 3} and B = {3, 4, 5}, then A ∪ B = {1, 2,
3, 4, 5}.

A B
Intersection of Sets
The intersection of two sets A and B (A ∩ B) is the set of elements that are common to both A and B.
This represents the elements that belong to both sets simultaneously.

Definition Example
A ∩ B = {x | x ∈ A and x ∈ B} If A = {1, 2, 3} and B = {3, 4, 5}, then A ∩ B = {3}.

A B

𝐴∩𝐵
Difference of Sets
The difference of two sets A and B (A - B) is the set of elements that are in A but not in B. This represents the
elements that belong to A but not to B.

Definition Example
A - B = {x | x ∈ A and x ∉ B} If A = {1, 2, 3} and B = {3, 4, 5}, then A - B = {1, 2}.

𝐴−𝑩 B
Complement of a Set
The complement of a set A (A') is the set of elements that are not in A. This represents the elements that belong
to the universal set but not to A.

Definition Example
A' = {x | x ∈ U and x ∉ A} If the universal set U = {1, 2, 3, 4, 5} and A = {1, 2,
3}, then A' = {4, 5}.

𝐴′

A
Properties of Set Operations
Set operations exhibit several fundamental properties, including commutativity, associativity, and distributivity,
which are crucial for understanding and manipulating set relationships.

Distributive
Commutative A ∪ (B ∩ C) = (A ∪ B) ∩ (A ∪ C) and A ∩ (B ∪ C) =
A ∪ B = B ∪ A and A ∩ B = B ∩ A (A ∩ B) ∪ (A ∩ C)

1 2 3

Associative
(A ∪ B) ∪ C = A ∪ (B ∪ C) and (A ∩ B) ∩ C = A ∩ (B
∩ C)
Two events A and B are said to be mutually exclusive
events if 𝐴 ∩ 𝐵 = Φ.

A B

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Dr. Ishapathik Das, IIT Tirupati

Probability Theory for Data Science

There are two important approaches for defining the probability of an
event.

1. Classical approach: If an event can happen in ℎ different ways out of

ℎ
a total of 𝑛 equally likely possible ways, the probability of the event is 𝑛.

2. Frequency approach: After conducting an experiment 𝑛 times (where

𝑛 is very large) and observing the event occur in ℎ of those trials, the
ℎ
probability of the event is given by 𝑛. This is also known as the empirical
probability of the event.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Dr. Ishapathik Das, IIT Tirupati

Both the classical and frequency approaches

have significant drawbacks.

The phrase "equally likely" is ambiguous.

The term "large number" is ambiguous.

This has led mathematicians to adopt an

axiomatic approach for defining probability.

Probability Theory for Data Science

• Let S be a sample space and 𝐶 be a sigma field.

Probability Theory for Data Science

Dr. Ishapathik Das, IIT Tirupati

• To each event 𝐴 ∈ 𝐶 in the class 𝐶 of events, we associate a real

number P(A).

• The P is called a probability function, and P(A) is the probability of

the event A, if the following axioms are satisfied.
• Axiom 1: For any 𝐴 ∈ 𝐶, 𝑃 𝐴 ≥ 0.

• Axiom 2: For certain event 𝑆 ∈ 𝐶, P(S)=1.

• Axiom 3: If 𝐴1 , 𝐴2 , … , 𝐴𝑛 , … are countable collections of pairwise mutually events in the

class 𝐶,
𝑃 𝐴1 ∪ 𝐴2 ∪ ⋯ = 𝑃 𝐴1 + 𝑃 𝐴2 + ⋯ .

• In particular, if 𝐴1 and 𝐴2 are two mutually exclusive events in 𝐶,

𝑃 𝐴1 ∪ 𝐴2 = 𝑃 𝐴1 + 𝑃 𝐴2 .

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Theorem 1.1: If 𝐴1 ⊂ 𝐴2 , then 𝑃 𝐴2 − 𝐴1 = 𝑃 𝐴2 − 𝑃 𝐴1 , and 𝑃 𝐴1 ≤ 𝑃 𝐴2 .

Theorem 1.2: 𝑃 𝐴 ∈ 0,1 , for any event 𝐴 ∈ 𝐶.

Theorem 1.3: 𝑃 Φ = 0, where Φ is the impossible event.

Theorem 1.4: 𝑃 𝐴′ = 1 − 𝑃 𝐴 , where 𝐴′ is the complement of 𝐴.

Theorem 1.5: Let 𝐴1 , 𝐴2 , … , 𝐴𝑛 be pairwise mutually exclusive events. Then

𝑃 𝐴1 ∪ 𝐴2 ∪ ⋯ 𝐴𝑛 = 𝑃 𝐴1 + 𝑃 𝐴2 + ⋯ 𝑃 𝐴𝑛 .

Theorem 1.6: For any two events 𝐴 and 𝐵,

𝑃 𝐴∪𝐵 =𝑃 𝐴 +𝑃 𝐵 −𝑃 𝐴∩𝐵 .

Theorem 1.7: For any two events 𝐴 and 𝐵,

𝑃 𝐴 = 𝑃 𝐴 ∩ 𝐵 + 𝑃 𝐴 ∩ 𝐵′ .

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science
Dr. Ishapathik Das, IIT Tirupati

• Let 𝐴 be the event that Chennai is among the final 5, and 𝐵 be

the event that Mumbai is among the final 5.

• Given 𝑃(𝐴)=0.2, 𝑃(𝐵)=0.35, and 𝑃(𝐴∩𝐵)=0.08, we need to find

𝑃(𝐴∪𝐵).

• Using Theorem 1.6, we have: 𝑃(𝐴∪𝐵)=𝑃(𝐴)+𝑃(𝐵)−𝑃(𝐴∩𝐵).

• Substituting the given probabilities:

𝑃(𝐴∪𝐵)=0.20+0.35−0.08=0.47.
Probability Theory for Data
Science
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
• Imagine you're provided with
information about the potential
outcome of a random experiment before
it occurs.

• How should this information influence

your prediction of the outcome?

• Specifically, how should probabilities be

modified to incorporate this
information?

• Typically, this information is presented

as follows: You're informed that the
outcome falls within a particular event
(i.e., you're notified that a certain event
has taken place).

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
• Three prisoners, A, B, and C, are confined in jail.
One of them faces execution, while the
remaining two will be released. Prisoner A
inquires of the guard: "One of my fellow
inmates, either B or C, will be granted freedom.
Could you please inform me which one among
them will be set free?”

• After pondering for a moment, the guard

conveyed to A: "If I refrain from informing you,
your probability of facing death stands at 1/3.
However, if I disclose the information, leaving
only two individuals, you become one of the
candidates for execution, thereby increasing
your chance of death to 1/2. Are you truly
inclined to raise your risk of demise?"

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Data Science
Probability Theory for

• Let's say we're interested in finding the probability of event 𝐴,

and we've been informed that event 𝐵 has happened.

• In this case, the conditional probability of 𝐴 given 𝐵 is

defined as:

𝑃 𝐴∩𝐵
• 𝑃 𝐴𝐵 = , 𝑖𝑓 𝑃 𝐵 ≠ 0.
𝑃 𝐵
Dr. Ishapathik Das, IIT
Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
If we're informed that event B has taken place, then the sample space is confined
to B. The probability within B must be normalized.

This is accomplished by dividing by P(B). Event A can now only transpire if the
outcome falls within AB. Therefore, the updated probability of A is:

𝑃 𝐴∩𝐵
𝑃 𝐴𝐵 = , 𝑖𝑓 𝑃 𝐵 ≠ 0.
𝑃(𝐵)

A B
𝐴∩𝐵

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Roll a fair die once and note the
number facing upward.

Let E denote the event where a 1

appears on the top face.

Let F represent the event where the

number on the top face is odd.

▪ Find P(E).
▪ What is the probability of event E if we are
informed that the number on the top face
is odd, meaning we know that event F has
occurred?
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Central concept: The initial sample
space is no longer applicable.

The updated or diminished sample

space is S={1, 3, 5}.
• Observe that the revised sample space
comprises solely of the outcomes in F.

1
P(E occurs given that F occurs) = .
3

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Dr. Ishapathik Das, IIT Tirupati

Probability Theory for Data Science

Dr. Ishapathik Das, IIT Tirupati

Two events A and B are said to be independent if

𝑃 𝐴 ∩ 𝐵 = 𝑃 𝐴 𝑃(𝐵)

Probability Theory for Data

Science
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
If A and B are two independent events and, 𝑃 𝐴 ≠ 0 ≠ 𝑃 𝐵 , then

𝑃 𝐴∩𝐵 𝑃 𝐴 𝑃 𝐵
𝑃 𝐴𝐵 = = =𝑃 𝐴 ,
𝑃(𝐵) 𝑃(𝐵)
and
𝑃 𝐴∩𝐵 𝑃 𝐴 𝑃 𝐵
𝑃 𝐵𝐴 = = =𝑃 𝐵 .
𝑃(𝐴) 𝑃(𝐴)

Therefore, in the scenario of independence, the conditional probability of

an event remains unaffected by the knowledge of another event.

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Two events that are mutually Mutually exclusive events
exclusive are only independent exhibit strong dependence
in the specific scenario where otherwise. A and B cannot
either the probability of event A happen simultaneously. If one
equals zero or the probability of event occurs, the other event
event B equals zero. does not.

A B

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Dr. Ishapathik Das, IIT Tirupati

Probability Theory for Data Science

Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati
Probability Theory for Data Science Dr. Ishapathik Das, IIT Tirupati

2025 NLP Lecture 02 Mathematical Foundation (New)
No ratings yet
2025 NLP Lecture 02 Mathematical Foundation (New)
100 pages
Probability - and - Statistics 1
No ratings yet
Probability - and - Statistics 1
33 pages
Basic Probability
No ratings yet
Basic Probability
57 pages
EEE 6545 - Lecture 2 Notes - Complete - F2024
No ratings yet
EEE 6545 - Lecture 2 Notes - Complete - F2024
38 pages
Sem 7 - DS - Unit 3
No ratings yet
Sem 7 - DS - Unit 3
95 pages
Introduction To Discrete Probability
No ratings yet
Introduction To Discrete Probability
40 pages
Unit II - Session 1
No ratings yet
Unit II - Session 1
48 pages
Probability Theory
No ratings yet
Probability Theory
61 pages
Lecture 25
100% (1)
Lecture 25
44 pages
Chap3 Probability STAT320
No ratings yet
Chap3 Probability STAT320
25 pages
Lecure-2 Probability-1
No ratings yet
Lecure-2 Probability-1
44 pages
Probability & Queueing Theory Guide
No ratings yet
Probability & Queueing Theory Guide
12 pages
Probability & Statistics For Scientist and Engineers: Dr. M. M. Bhatti
No ratings yet
Probability & Statistics For Scientist and Engineers: Dr. M. M. Bhatti
25 pages
Lec2 Probability
No ratings yet
Lec2 Probability
23 pages
Introduction To Prob. and Counting
No ratings yet
Introduction To Prob. and Counting
38 pages
Leon-Garcia-IPPR - Chapters 1-6
No ratings yet
Leon-Garcia-IPPR - Chapters 1-6
180 pages
Decision Analysis Test Bank
100% (3)
Decision Analysis Test Bank
42 pages
F21 Lecture1
No ratings yet
F21 Lecture1
5 pages
Module 01 PPT Class Final 02-03-2023
No ratings yet
Module 01 PPT Class Final 02-03-2023
67 pages
Week - 4 - Probabilistic Reasoning - Nihad
No ratings yet
Week - 4 - Probabilistic Reasoning - Nihad
52 pages
Introduction To Probability
No ratings yet
Introduction To Probability
61 pages
Machine Learning Probability Basics
No ratings yet
Machine Learning Probability Basics
74 pages
lecture-1-SMMD-11092024-080101pm (2) (Autosaved)
No ratings yet
lecture-1-SMMD-11092024-080101pm (2) (Autosaved)
39 pages
01 Lec RVSP
No ratings yet
01 Lec RVSP
75 pages
QT - Bba - Module Iv - Juraz
No ratings yet
QT - Bba - Module Iv - Juraz
6 pages
Probability
No ratings yet
Probability
28 pages
Unit3 L2
No ratings yet
Unit3 L2
53 pages
Lecture 2
No ratings yet
Lecture 2
40 pages
PA Lec 2 2024
No ratings yet
PA Lec 2 2024
78 pages
Probability and Statistics Syllabus
No ratings yet
Probability and Statistics Syllabus
184 pages
L1 Probability Theory
No ratings yet
L1 Probability Theory
13 pages
Probability
No ratings yet
Probability
33 pages
Cours Chapter1
No ratings yet
Cours Chapter1
12 pages
1.bayes Theorem
No ratings yet
1.bayes Theorem
9 pages
Lec 1
No ratings yet
Lec 1
30 pages
Decision Science MCQS
No ratings yet
Decision Science MCQS
100 pages
Lecture 2
No ratings yet
Lecture 2
9 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
62 pages
Prob 1
No ratings yet
Prob 1
4 pages
Introduction To Probability - Part II: Axioms and A Few Probability Rules
No ratings yet
Introduction To Probability - Part II: Axioms and A Few Probability Rules
13 pages
Chapter 03
No ratings yet
Chapter 03
18 pages
Probability Theory
No ratings yet
Probability Theory
13 pages
Probability
No ratings yet
Probability
138 pages
Probability Concepts and Random Variable - SMTA1402: Unit - I
No ratings yet
Probability Concepts and Random Variable - SMTA1402: Unit - I
105 pages
BS Unit-4
No ratings yet
BS Unit-4
14 pages
01 Lec RVSP 1
No ratings yet
01 Lec RVSP 1
76 pages
STAE Lecture Notes - LU4
No ratings yet
STAE Lecture Notes - LU4
16 pages
BSAD - Lecture PPTs
No ratings yet
BSAD - Lecture PPTs
50 pages
Qa Pastpapers
No ratings yet
Qa Pastpapers
146 pages
NLP - PPT - CH 3
No ratings yet
NLP - PPT - CH 3
69 pages
Probability Basics for Students
No ratings yet
Probability Basics for Students
15 pages
Stat I Chapter 4
No ratings yet
Stat I Chapter 4
29 pages
Chapter 2
No ratings yet
Chapter 2
53 pages
NLP - PPT - CH 4
No ratings yet
NLP - PPT - CH 4
78 pages
Elementary Probability and Statistics
No ratings yet
Elementary Probability and Statistics
25 pages
Probability Theory
No ratings yet
Probability Theory
13 pages
Math Pre Board
No ratings yet
Math Pre Board
5 pages
3 Probability
No ratings yet
3 Probability
33 pages
(International Centre For Mechanical Sciences 140) T. Kailath (Auth.) - Lectures On Wiener and Kalman Filtering-Springer-Verlag Wien (1981) PDF
No ratings yet
(International Centre For Mechanical Sciences 140) T. Kailath (Auth.) - Lectures On Wiener and Kalman Filtering-Springer-Verlag Wien (1981) PDF
189 pages
Probability Theory For Data Science Week 8
No ratings yet
Probability Theory For Data Science Week 8
68 pages
Intro to Random Variables
No ratings yet
Intro to Random Variables
2 pages
Chapter 2 Probability Concepts and Applications
100% (1)
Chapter 2 Probability Concepts and Applications
112 pages
Business Economics PDF
No ratings yet
Business Economics PDF
395 pages
Moment Generating Function Explained - Towards Data Science
No ratings yet
Moment Generating Function Explained - Towards Data Science
8 pages
I Semester Complementary Statistics - Course I Basic Statistics
No ratings yet
I Semester Complementary Statistics - Course I Basic Statistics
4 pages
BSC Statistics
100% (1)
BSC Statistics
40 pages
ProbabilityStatistics Probability
No ratings yet
ProbabilityStatistics Probability
10 pages
Report: Expected Value 100%: Ques On 15
No ratings yet
Report: Expected Value 100%: Ques On 15
4 pages
DL Questions
No ratings yet
DL Questions
2 pages
Lecture - 3 Probability Theory
No ratings yet
Lecture - 3 Probability Theory
25 pages
NLP - PPT - CH 5
No ratings yet
NLP - PPT - CH 5
29 pages
T2 Decision Theory
No ratings yet
T2 Decision Theory
19 pages
Probability Theory Foundations
No ratings yet
Probability Theory Foundations
9 pages
ECE Probability & Random Processes
No ratings yet
ECE Probability & Random Processes
125 pages
Conditional Expectation Lecture
No ratings yet
Conditional Expectation Lecture
17 pages
Advanced Signal Analysis
No ratings yet
Advanced Signal Analysis
24 pages
Suresh Kumar 1-4 Chap Pns Notes
No ratings yet
Suresh Kumar 1-4 Chap Pns Notes
19 pages
JKNJKN
No ratings yet
JKNJKN
72 pages
15ma207 - PQT - Unit I - II - Cycle Test Ix
No ratings yet
15ma207 - PQT - Unit I - II - Cycle Test Ix
30 pages
Decision
No ratings yet
Decision
8 pages
Measurement Uncertainty Guide
No ratings yet
Measurement Uncertainty Guide
50 pages
NLP - PPT - CH 1
No ratings yet
NLP - PPT - CH 1
24 pages
Discrete Random Variable
No ratings yet
Discrete Random Variable
53 pages
Discrete Random Variables Guide
No ratings yet
Discrete Random Variables Guide
17 pages
Probability & Statistics Exam Review
No ratings yet
Probability & Statistics Exam Review
12 pages
SP Module 2 - 3Q - Mean of Discrete Random Variables
No ratings yet
SP Module 2 - 3Q - Mean of Discrete Random Variables
10 pages
CM412 - DL - Model Paper
No ratings yet
CM412 - DL - Model Paper
5 pages
Introduction to Probability Concepts
100% (1)
Introduction to Probability Concepts
281 pages
Chapter 6-8 Sampling and Estimation
No ratings yet
Chapter 6-8 Sampling and Estimation
48 pages
2025-02-04 14-13-20
No ratings yet
2025-02-04 14-13-20
4 pages
0-Cheatsheet Capstone Part 1
No ratings yet
0-Cheatsheet Capstone Part 1
4 pages
Problem Set 7
No ratings yet
Problem Set 7
2 pages
Course Outline Title Probability and Statistics Code MT-205 Credit Hours
No ratings yet
Course Outline Title Probability and Statistics Code MT-205 Credit Hours
7 pages

Probability for Data Science Students

Uploaded by

Probability for Data Science Students

Uploaded by

Probability Theory for Data

Textbook(s): Ross S, A First course in Probability, Prentice Hall of India (2009).

• For example, natural phenomena include weather

• In scientific terms, a phenomenon encompasses

Numerous examples of this can be found in the

Consider predicting the amount of money in a

If you know the initial deposit and the interest rate,

• These phenomena can be divided into two

• For example, when rolling a die, the possible

• Although the outcome of a single roll is

• This regularity is due to the symmetry of a fair die,

Probability Theory for Data

1. Random Experiment: Tossing a coin. All

Dr. Ishapathik Das, IIT Tirupati

2. Random Experiment: Rolling a die. All

• Sample space 𝑺𝟐 = {𝟏, 𝟐, 𝟑, 𝟒, 𝟓, 𝟔}.

• An event E={2, 4, 6}, representing

Let S be a non-empty set. A class 𝑪 of subsets of S is called a

Probability Theory for Data Science

1. Classical approach: If an event can happen in ℎ different ways out of

2. Frequency approach: After conducting an experiment 𝑛 times (where

Both the classical and frequency approaches

The phrase "equally likely" is ambiguous.

The term "large number" is ambiguous.

This has led mathematicians to adopt an

Probability Theory for Data Science

Probability Theory for Data Science

• To each event 𝐴 ∈ 𝐶 in the class 𝐶 of events, we associate a real

• The P is called a probability function, and P(A) is the probability of

• Axiom 2: For certain event 𝑆 ∈ 𝐶, P(S)=1.

• Axiom 3: If 𝐴1 , 𝐴2 , … , 𝐴𝑛 , … are countable collections of pairwise mutually events in the

• In particular, if 𝐴1 and 𝐴2 are two mutually exclusive events in 𝐶,

Theorem 1.2: 𝑃 𝐴 ∈ 0,1 , for any event 𝐴 ∈ 𝐶.

Theorem 1.3: 𝑃 Φ = 0, where Φ is the impossible event.

Theorem 1.4: 𝑃 𝐴′ = 1 − 𝑃 𝐴 , where 𝐴′ is the complement of 𝐴.

Theorem 1.5: Let 𝐴1 , 𝐴2 , … , 𝐴𝑛 be pairwise mutually exclusive events. Then

Theorem 1.6: For any two events 𝐴 and 𝐵,

Theorem 1.7: For any two events 𝐴 and 𝐵,

• Let 𝐴 be the event that Chennai is among the final 5, and 𝐵 be

• Given 𝑃(𝐴)=0.2, 𝑃(𝐵)=0.35, and 𝑃(𝐴∩𝐵)=0.08, we need to find

• Using Theorem 1.6, we have: 𝑃(𝐴∪𝐵)=𝑃(𝐴)+𝑃(𝐵)−𝑃(𝐴∩𝐵).

• Substituting the given probabilities:

• How should this information influence

• Specifically, how should probabilities be

• Typically, this information is presented

• After pondering for a moment, the guard

• Let's say we're interested in finding the probability of event 𝐴,

• In this case, the conditional probability of 𝐴 given 𝐵 is

Let E denote the event where a 1

Let F represent the event where the

The updated or diminished sample

Probability Theory for Data Science

Two events A and B are said to be independent if

Probability Theory for Data

Therefore, in the scenario of independence, the conditional probability of

Probability Theory for Data Science

You might also like