0% found this document useful (0 votes)

93 views3 pages

Week 3 - Notes

The document discusses key concepts in statistics including expected value, variance, standard deviation, covariance, and correlation. It defines these terms and outlines properties such as: the expected value of a function of random variables equals the sum or product of the individual expected values; variance is a measure of spread from the expected value; and correlation measures the linear relationship between two random variables. Bounds on probabilities like Markov's and Chebyshev's inequalities are also presented using mean, variance and standard deviation.

Uploaded by

naghul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views3 pages

Week 3 - Notes

Uploaded by

naghul

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Statistics for Data Science - 2

Week 3 Notes
Expected value

• Expected value of a random variable

Definition: Suppose X is a discrete random variable with range TX and PMF fX . The
expected value of X, denoted E[X], is defined as
X
E[X] = tP (X = t)
t∈TX

assuming the above sum exists.

Expected value represents “center” of a random variable.

1. Consider a constant c as a random variable X with

P (X = c) = 1.
E[c] = c × 1 = c
2. If X takes only non-negative values, i.e. P (X ≥ 0) = 1. Then,

E[X] ≥ 0

• Expected value of a function of random variables

Suppose X1 . . . Xn have joint PMF fX1 ...Xn with range of Xi denoted as TXi . Let

g : TX1 × . . . × TXn → R

be a function, and let Y = g(X1 , . . . , Xn ) have range TY and PMF fY . Then,

X X
E[g(X1 , . . . , Xn )] = tfY (t) = g(t1 , . . . , tn )fX1 ...Xn (t1 , . . . , tn )
t∈TY ti ∈TXi

• Linearity of Expected value:

1. E[cX] = cE[X] for a random variable X and a constant c.

2. E[X + Y ] = E[X] + E[Y ] for any two random variables X, Y .

• Zero mean Random variable:

A random variable X with E[X] = 0 is said to be a zero-mean random variable.

• Variance and Standard deviation:

Definition: The variance of a random variable X, denoted by Var(X), is defined as

Var(X) = E[(X − E[X])2 ]

Variance measures the spread about the expected value.
Variance of random variable X is also given by Var(X) = E[X 2 ] − E[X]2

The standard deviation of X, denoted by SD(X), is defined as

p
SD(X) = + Var(X)

Units of SD(X) are same as units of X.

• Properties: Scaling and translation

Let X be a random variable. Let a be a constant real number.

1. Var(aX) = a2 Var(X)
2. SD(aX) =| a | SD(X)
3. Var(X + a) = Var(X)
4. SD(X + a) = SD(X)

• Sum and product of independent random variables

1. For any two random variables X and Y (independent or dependent), E[X + Y ] =

E[X] + E[Y ].
2. If X and Y are independent random variables,
(a) E[XY ] = E[X]E[Y ]
(b) Var(X + Y ) = Var(X) + Var(Y )

• Standardised random variables:

1. Definition: A random variable X is said to be standardised if E[X] = 0, Var(X) =

1.
X − E[X]
2. Let X be a random variable. Then, Y = is a standardised random
SD(X)
variable.

• Covariance:
Definition: Suppose X and Y are random variables on the same probability space. The
covariance of X and Y , denoted as Cov(X, Y ), is defined as

Cov(X, Y ) = E[(X − E[X])(Y − E[Y ])]

It summarizes the relationship between two random variables.

Properties:

1. Cov(X, X) = Var(X)
2. Cov(X, Y ) = E[XY ] − E[X]E[Y ]

Page 2
3. Covariance is symmetric if Cov(X, Y ) = Cov(Y, X)
4. Covariance is a “linear” quantity.
(a) Cov(X, aY + bZ) = aCov(X, Y ) + bCov(X, Z)
(b) Cov(aX + bY, Z) = aCov(X, Z) + bCov(Y, Z)
5. Independence: If X and Y are independent, then X and Y are uncorrelated, i.e.
Cov(X, Y ) = 0
6. If X and Y are uncorrelated, they may be dependent.
• Correlation coefficient:
Definition: The correlation coefficient or correlation of two random variables X and Y
, denoted by ρ(X, Y ), is defined as
Cov(X, Y )
ρ(X, Y ) =
SD(X)SD(Y )
1. −1 ≤ ρ(X, Y ) ≤ 1.
2. ρ(X, Y ) summarizes the trend between random variables.
3. ρ(X, Y ) is a dimensionless quantity.
4. If ρ(X, Y ) is close to zero, there is no clear linear trend between X and Y .
5. If ρ(X, Y ) = 1 or ρ(X, Y ) = −1, Y is a linear function of X.
6. If | ρ(X, Y ) | is close to one, X and Y are strongly correlated.
• Bounds on probabilities using mean and variance
1. Markov’s inequality: Let X be a discrete random variable taking non-negative
values with a finite mean µ. Then,
µ
P (X ≥ c) ≤
c
Mean µ, through Markov’s inequality: bounds the probability that a non-negative
random variable takes values much larger than the mean.
2. Chebyshev’s inequality: Let X be a discrete random variable with a finite mean
µ and a finite variance σ 2 . Then,
1
P (| X − µ |≥ kσ) ≤
k2
Other forms:
σ2 1
(a) P (| X − µ |≥ c) ≤ 2
, P ((X − µ)2 > k 2 σ 2 ) ≤ 2
c k
1
(b) P (µ − kσ < X < µ + kσ) ≥ 1 − 2
k
Mean µ and standard deviation σ, through Chebyshev’s inequality: bound the
probability that X is away from µ by kσ.

Page 3

Understanding Mathematical Expectation
No ratings yet
Understanding Mathematical Expectation
52 pages
Probability Formula Sheet
No ratings yet
Probability Formula Sheet
11 pages
Project 9 Portfolio
No ratings yet
Project 9 Portfolio
51 pages
Probability Theory Essentials
No ratings yet
Probability Theory Essentials
3 pages
Statistics for Aspiring Data Scientists
100% (1)
Statistics for Aspiring Data Scientists
45 pages
Intro to Randomized Design & ANOVA
100% (4)
Intro to Randomized Design & ANOVA
5 pages
Random Variables: Presented by in Stochastic Analysis and Inverse Modelling
100% (1)
Random Variables: Presented by in Stochastic Analysis and Inverse Modelling
21 pages
L-10 Expectation & Variance PDF
No ratings yet
L-10 Expectation & Variance PDF
34 pages
AB1202 Statistics and Analysis
No ratings yet
AB1202 Statistics and Analysis
16 pages
Probability
No ratings yet
Probability
12 pages
04 Ekspektasi - Matematik - SLIDE
No ratings yet
04 Ekspektasi - Matematik - SLIDE
28 pages
Properties of Expectation Expectation of A Sum: - Proposition
No ratings yet
Properties of Expectation Expectation of A Sum: - Proposition
16 pages
Data Science Imp Questions and Answers
No ratings yet
Data Science Imp Questions and Answers
13 pages
PME-lec7-ch4-a
No ratings yet
PME-lec7-ch4-a
67 pages
7 Multiple Random Variables
No ratings yet
7 Multiple Random Variables
15 pages
Expectation and Variance Guide
No ratings yet
Expectation and Variance Guide
39 pages
AE 248: AI and Data Science: Prabhu Ramachandran 2024-01-01
No ratings yet
AE 248: AI and Data Science: Prabhu Ramachandran 2024-01-01
12 pages
Expectations 13 Pages
No ratings yet
Expectations 13 Pages
13 pages
Unit - III Spatial Data Ajustment
No ratings yet
Unit - III Spatial Data Ajustment
127 pages
Econometrics1 2 PDF
No ratings yet
Econometrics1 2 PDF
63 pages
Math Review
No ratings yet
Math Review
29 pages
OptimalLinearFilters PDF
No ratings yet
OptimalLinearFilters PDF
107 pages
Introductory Probability and The Central Limit Theorem
No ratings yet
Introductory Probability and The Central Limit Theorem
11 pages
Covariance and Some Conditional Expectation Exercises: Scott Sheffield
No ratings yet
Covariance and Some Conditional Expectation Exercises: Scott Sheffield
69 pages
Intro Econometrics: Stats Review
No ratings yet
Intro Econometrics: Stats Review
7 pages
1 + X E (X Is Is Integrable, But Not Square Is Not Integrable, The Variance Is
No ratings yet
1 + X E (X Is Is Integrable, But Not Square Is Not Integrable, The Variance Is
18 pages
Lec4 IntroToProbabilityAndStatistics
No ratings yet
Lec4 IntroToProbabilityAndStatistics
44 pages
Chapter 6
No ratings yet
Chapter 6
39 pages
Stat 1
No ratings yet
Stat 1
6 pages
Introductory Econometrics: Probability and Statistics Refresher
No ratings yet
Introductory Econometrics: Probability and Statistics Refresher
35 pages
2A2. Review of Probability
No ratings yet
2A2. Review of Probability
8 pages
Advanced Signal Processing Lab
No ratings yet
Advanced Signal Processing Lab
7 pages
L21 MTH202 Atreyee
No ratings yet
L21 MTH202 Atreyee
12 pages
Review of Probability and Statistics
No ratings yet
Review of Probability and Statistics
34 pages
Sum of Variances
No ratings yet
Sum of Variances
11 pages
Probability Theory Basics
No ratings yet
Probability Theory Basics
7 pages
Chapter 4 Mathematical Expectation
No ratings yet
Chapter 4 Mathematical Expectation
28 pages
Ch4 Random Variables
No ratings yet
Ch4 Random Variables
54 pages
ST3236 Note3
No ratings yet
ST3236 Note3
17 pages
Random Variables: COS 341 Fall 2002, Lecture 21
No ratings yet
Random Variables: COS 341 Fall 2002, Lecture 21
6 pages
Mathematical Expectation
No ratings yet
Mathematical Expectation
28 pages
CH 5 3502 PDF
No ratings yet
CH 5 3502 PDF
5 pages
MIT6 436JF18 Lec06
No ratings yet
MIT6 436JF18 Lec06
18 pages
Lecture 8
No ratings yet
Lecture 8
28 pages
Probability Review Stochastic
No ratings yet
Probability Review Stochastic
23 pages
Lecture 20
No ratings yet
Lecture 20
17 pages
Week 5-8 Short Notes
No ratings yet
Week 5-8 Short Notes
10 pages
Statistics Boot Camp: X F X X E DX X XF X E Important Properties of The Expectations Operator
No ratings yet
Statistics Boot Camp: X F X X E DX X XF X E Important Properties of The Expectations Operator
3 pages
Microeconometrics Probability Theory Guide
No ratings yet
Microeconometrics Probability Theory Guide
6 pages
Random Variables
No ratings yet
Random Variables
24 pages
f23 Econ103 Week1 Ta Note
No ratings yet
f23 Econ103 Week1 Ta Note
6 pages
Discrete Random Variable Student 2019.2
No ratings yet
Discrete Random Variable Student 2019.2
24 pages
Big Data Analytics
100% (2)
Big Data Analytics
126 pages
Mathematical Expectation: Variance and Covariance of Random Variables
No ratings yet
Mathematical Expectation: Variance and Covariance of Random Variables
3 pages
0 Basic Statistics Knowledge
No ratings yet
0 Basic Statistics Knowledge
6 pages
Lec-9 Covariance Correlation
No ratings yet
Lec-9 Covariance Correlation
15 pages
Cas Casestudy
No ratings yet
Cas Casestudy
11 pages
Time Series by Oscar Torres-Reyna
No ratings yet
Time Series by Oscar Torres-Reyna
31 pages
Week 6
No ratings yet
Week 6
3 pages
Article 16 Assessment of Family Income PDF
No ratings yet
Article 16 Assessment of Family Income PDF
16 pages
Digital Audit: Governance Impact
No ratings yet
Digital Audit: Governance Impact
10 pages
Data Science Probability Basics
No ratings yet
Data Science Probability Basics
7 pages
Week 3
No ratings yet
Week 3
3 pages
Week 3
No ratings yet
Week 3
3 pages
Immediate Download Marketing Research 4th Edition Joseph Hair Ebooks 2024
100% (6)
Immediate Download Marketing Research 4th Edition Joseph Hair Ebooks 2024
82 pages
Dissertation Data Analysis
100% (2)
Dissertation Data Analysis
5 pages
Week2 Annotated Final
No ratings yet
Week2 Annotated Final
100 pages
Week 4
No ratings yet
Week 4
73 pages
Understanding Random Variables
No ratings yet
Understanding Random Variables
71 pages
Statistics: Unlocking The Power of Data, 2Nd Edition (Ebook PDF) Install Download
No ratings yet
Statistics: Unlocking The Power of Data, 2Nd Edition (Ebook PDF) Install Download
53 pages
Special Aspects of Spatial Data Analysis
No ratings yet
Special Aspects of Spatial Data Analysis
10 pages
GRL - EX - 4 (1) .Ipynb - Colaboratory
No ratings yet
GRL - EX - 4 (1) .Ipynb - Colaboratory
7 pages
Tourism Research Methods Course
No ratings yet
Tourism Research Methods Course
5 pages
What Is Cluster Analysis?: Dmitriy (Dima) Gorenshteyn
No ratings yet
What Is Cluster Analysis?: Dmitriy (Dima) Gorenshteyn
54 pages
Laboratory Procedure Manual: Triglycrides Serum Hitachi 912
No ratings yet
Laboratory Procedure Manual: Triglycrides Serum Hitachi 912
17 pages
Research Design for Agricultural Value-Chains
No ratings yet
Research Design for Agricultural Value-Chains
46 pages
Geissinger Et Al. - 2019 - How Sustainable Is The Sharing Economy On The Sustainability Connotations of Sharing Economy Platforms PDF
No ratings yet
Geissinger Et Al. - 2019 - How Sustainable Is The Sharing Economy On The Sustainability Connotations of Sharing Economy Platforms PDF
11 pages
Spearman's Rank Correlation Guide
No ratings yet
Spearman's Rank Correlation Guide
6 pages
Assesment Report ARCOXIA
No ratings yet
Assesment Report ARCOXIA
7 pages
Zobel 1988 PDF
No ratings yet
Zobel 1988 PDF
6 pages
10 Challenges Facing Today's Applied Sport Scientist
No ratings yet
10 Challenges Facing Today's Applied Sport Scientist
7 pages
Linear Regression Essentials
No ratings yet
Linear Regression Essentials
6 pages
Analisis Data Penelitian
No ratings yet
Analisis Data Penelitian
3 pages
B Ed Assignment 1 and 2
No ratings yet
B Ed Assignment 1 and 2
5 pages
Social Studies Case Study Guide
No ratings yet
Social Studies Case Study Guide
3 pages
Types of Data Analytics
No ratings yet
Types of Data Analytics
3 pages
Abstract A New XG Model For Football Analytics.
No ratings yet
Abstract A New XG Model For Football Analytics.
3 pages
Cfa 2
No ratings yet
Cfa 2
3 pages
Krushna Padole Scientist Resume
No ratings yet
Krushna Padole Scientist Resume
1 page
Andrew CV
No ratings yet
Andrew CV
3 pages

Week 3 - Notes

Uploaded by

Week 3 - Notes

Uploaded by

Statistics for Data Science - 2

• Expected value of a random variable

assuming the above sum exists.

1. Consider a constant c as a random variable X with

• Expected value of a function of random variables

be a function, and let Y = g(X1 , . . . , Xn ) have range TY and PMF fY . Then,

• Linearity of Expected value:

1. E[cX] = cE[X] for a random variable X and a constant c.

• Zero mean Random variable:

• Variance and Standard deviation:

Var(X) = E[(X − E[X])2 ]

The standard deviation of X, denoted by SD(X), is defined as

Units of SD(X) are same as units of X.

• Properties: Scaling and translation

• Sum and product of independent random variables

1. For any two random variables X and Y (independent or dependent), E[X + Y ] =

• Standardised random variables:

1. Definition: A random variable X is said to be standardised if E[X] = 0, Var(X) =

Cov(X, Y ) = E[(X − E[X])(Y − E[Y ])]

It summarizes the relationship between two random variables.

You might also like