0% found this document useful (0 votes)

8 views46 pages

Smai Lecture 06 Regression

The document discusses regression models in supervised learning, focusing on the relationship between independent and dependent variables. It covers linear regression, the least squares method for estimating parameters, and the gradient descent algorithm for optimization. Additionally, it addresses potential issues such as overfitting and the complexity of regression models with multiple variables.

Uploaded by

aaravjee1076

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views46 pages

Smai Lecture 06 Regression

Uploaded by

aaravjee1076

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

22.08.

2025

Statistical Methods in AI (CS7.403)

Lecture-6: Regression

Ravi Kiran S (CVIT)

Sai Kiran B (SPCRC)

IIIT Hyderabad
Supervised
Learning
Reinforcement
Classification Regression
Learning
Regression model

• Regression model
– Explanatory variables: independent variables
– Variables to be explained : dependent variables
Examples
• Independent variable: Price of crude oil
• Dependent variable: Retail price of petrol

• Independent variables: hours of work, education, occupation, sex, age, years of experience etc.
• Dependent variable: Employment income

• Independent variables: Area of house, Population Density

• Dependent variable: Rent or Price of house

• Price of a product and quantity produced or sold:

– Quantity sold affected by price. Dependent variable is quantity of product sold – independent variable is
price.
– Price affected by quantity offered for sale. Dependent variable is price – independent variable is quantity
sold.
600 160

140

500

120

400

100

300 80

200

100

0 0
1981M01

1982M01

1983M01

1984M01

1985M01

1986M01

1987M01

1988M01

1989M01

1990M01

1991M01

1992M01

1993M01

1994M01

1995M01

1996M01

1997M01

1998M01

1999M01

2000M01

2001M01

2002M01

2003M01

2004M01

2005M01

2006M01

2007M01

2008M01
Crude Oil price index, 1997=100, left axis Regular gasoline prices, regina, cents per litre, right axis

Source: CANSIM II Database (Vector v1576530 and v735048 respectively)

Linear Regression Model

 1. Relationship Between Variables Is a Linear Function

Slope Random
Y-Intercept Error

Yi   0  1X i   i
Dependent Independent
(Response) (Explanatory) Variable
Variable (e.g. Yrs of experience)
(e.g. Salary)
Linear Regression Model

Y Yi   0  1X i   i Observed
value

i = Error

E Y   0  1 X i

X
Observed value
Estimating Parameters:
Least Squares Method
Least Squares
 1. ‘Best Fit’ Means Difference Between Actual Y Values
& Predicted Y Values Are a Minimum. But Positive
Differences Offset Negative ones
Y
Yi   0  1X i   i
^4
^2 E Y   0  1 X i
^1 ^3

X
Least Squares
 1. ‘Best Fit’ Means Difference Between Actual Y Values
& Predicted Y Values is a Minimum. But Positive
Differences Offset Negative ones. So square errors!

 Y  Y    ˆ
n n
Y ˆ 2
2
i i i
i 1 i 1
^4
^2 Yi   0  1X i   i
^1 ^3
E Y   0  1 X i
X
Least Squares
 1. ‘Best Fit’ Means Difference Between Actual Y Values
& Predicted Y Values Are a Minimum. But Positive
Differences Off-Set Negative. So square errors!

    ˆ
n n

 ˆ 2
Y  Yi i
2
i
i 1 i 1
 2. LS Minimizes the Sum of the Squared Differences
(errors) (SSE)
Least Squares Graphically
n
LS minimizes  i 1 2 3 4

 2
 
 2
 
 2
 
 2
 
 2

i 1
Y Y2   0   1X 2   2
^4
^2
^1 ^3

X
Derivation of Parameters (1)
 Least Squares (L-S):
Minimize squared error
n n

    yi  0  1 xi 
2 2
i
i 1 i 1
Derivation of Parameters (1)
 Least Squares (L-S):
Minimize squared error
n n

    yi  0  1 xi 
2 2
i
i 1 i 1

   i2    yi   0  1 xi 
2

0 
0 0
 2  ny  n 0  n1 x 

ˆ0  y  ˆ1x
Derivation of Parameters (1)
 Least Squares (L-S):
Minimize squared error
   i2    yi   0  1 xi 
2

0 
1 1
 2 xi  yi   0  1 xi 
 2 xi  yi  y  1 x  1 xi 

1  xi  xi  x    xi  yi  y 
1   xi  x  xi  x     xi  x  yi  y 
SS xy
ˆ1 
SS xx
Coefficient Equations
 Prediction equation
yˆi  ˆ0  ˆ1xi

 Sample slope
SS xy   xi  x  yi  y 
ˆ
1  
  xi  x 
SS xx 2

 Sample Y - intercept

ˆ0  y  ˆ1x
Regression – Error measures
Regression – Error measures
Linear Regression – Matrix Form
Linear Regression – Matrix Form
Geometric Interpretation
• 𝑌෠ lies in subspace spanned by column space of 𝑋: 𝐶 𝑋

• 𝑟 = 𝑌 − 𝑌෠ = 𝑌 − 𝑋𝛽 is orthogonal to 𝑋, 𝑖. 𝑒. 𝑋 𝑇 𝑟 = 0

• Colab Notebook:
https://colab.research.google.com/drive/193OYEJ_-
wh_p9Mv8idRruV3ZIbb6EkiP?usp=sharing
Linear Regression – Matrix Form - Issues

• N samples, p-dimensional (what if p > N ?)

• Complexity of matrix inversion (what if N very large
?)
Gradient Descent
1. Initialize the parameters 𝛽 to some
random values.
2. Update the parameters using gradient
descent rule
𝛽 𝑡 + 1 = 𝛽 𝑡 − 𝜂 ∇𝛽 𝐿 𝛽 𝑡
3. Repeat 2 until |∇𝛽 𝐿 𝛽 𝑡 | is close to 0
Gradient Descent
1. Initialize the parameters 𝛽 to some
random values.
2. Update the parameters using gradient
descent rule
𝛽 𝑡 + 1 = 𝛽 𝑡 − 𝜂 ∇𝛽 𝐿 𝛽 𝑡
3. Repeat 2 until |∇𝛽 𝐿 𝛽 𝑡 | is close to 0
Gradient Descent
1. Initialize the parameters 𝛽 to some random
values.
2. Update the parameters using gradient
descent rule
𝛽 𝑡 + 1 = 𝛽 𝑡 − 𝜂 ∇𝛽 𝐿 𝛽 𝑡
3. Repeat 2 until |∇𝑤 𝐿 𝑤 𝑡 | is close to 0

𝛽1
𝛽0
Gradient Descent
1. Initialize the parameters 𝛽 to some
random values.
2. Update the parameters using gradient
descent rule
𝑤 𝑡 + 1 = 𝑤 𝑡 − 𝜂 ∇𝑤 𝐿 𝑤 𝑡
3. Repeat 2 until |∇𝑤 𝐿 𝑤 𝑡 | is close to 0
Linear Regression
• Linear Regression  Linear in coefficients and
NOT variables
Careful: X may not be causing y !
Linear Regression – Outliers
Linear Regression is problematic in
many other cases
Linear Regression is problematic in
many other cases
Linear Regression is problematic in
many other cases
Linear Regression is problematic in
many other cases
Piecewise Linear Regression

Also ref: Multivariate Adaptive Regression Splines (MARS)

Bivariate and multivariate models
Bivariate or simple regression model
(Education) x y (Income)

Multivariate or multiple regression model

(Education) 1 x
(Gender) x2
(Experience) x3 y (Income)
(Age) x4

Model with simultaneous relationship

Price of wheat Quantity of wheat produced
Types of
Regression Models
1 Explanatory Regression 2+ Explanatory
Variable Models Variables

Simple Multiple

Non- Non-
Linear Linear
Linear Linear

39
Test Time
Overfitting

MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Master The PRINCE2 Themes With Pictures
100% (2)
Master The PRINCE2 Themes With Pictures
11 pages
Bucket Bag
100% (1)
Bucket Bag
8 pages
AI & ML Lab Manual - LDCE
No ratings yet
AI & ML Lab Manual - LDCE
70 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Linear Regression Explained
No ratings yet
Linear Regression Explained
26 pages
04 LinearModels
No ratings yet
04 LinearModels
28 pages
Taran Et Al. 2015 PDF
No ratings yet
Taran Et Al. 2015 PDF
31 pages
LinearRegression1 210720 171800
No ratings yet
LinearRegression1 210720 171800
41 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
Scomi Drilling Fluid
No ratings yet
Scomi Drilling Fluid
23 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
5.linear Regression
No ratings yet
5.linear Regression
39 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Counter Rust 7010 TDS
No ratings yet
Counter Rust 7010 TDS
2 pages
2a Linear Regression 18may
No ratings yet
2a Linear Regression 18may
28 pages
Lec 3 Regression.
No ratings yet
Lec 3 Regression.
20 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
How To Post Bail For Your Temporary Liberty
No ratings yet
How To Post Bail For Your Temporary Liberty
4 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
33 pages
Reset Root Password Linux
No ratings yet
Reset Root Password Linux
6 pages
Object Oriented Development in PL/SQL
No ratings yet
Object Oriented Development in PL/SQL
27 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Design of Linear Quadratic Regulator For Rotary Inverted Pendulum Using Labview
No ratings yet
Design of Linear Quadratic Regulator For Rotary Inverted Pendulum Using Labview
5 pages
Chapter 1 Governments and Individuals PDF
No ratings yet
Chapter 1 Governments and Individuals PDF
24 pages
Lecture 5 PDF
No ratings yet
Lecture 5 PDF
22 pages
Woodhouse: Midgley Gardens
No ratings yet
Woodhouse: Midgley Gardens
36 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
IPE 341 Chip Formation Mechanism
100% (1)
IPE 341 Chip Formation Mechanism
22 pages
Middle East Real Estate Predictions - Dubai
No ratings yet
Middle East Real Estate Predictions - Dubai
28 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Unit - 3 - ML - 24
No ratings yet
Unit - 3 - ML - 24
41 pages
Insurance Premium Rates Guide
No ratings yet
Insurance Premium Rates Guide
6 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
23 pages
ML Unit3
No ratings yet
ML Unit3
9 pages
Import Java - Util.Scanner Import Java - Text.Decimalformat Public Class Javaapplication4 (
No ratings yet
Import Java - Util.Scanner Import Java - Text.Decimalformat Public Class Javaapplication4 (
1 page
Linear Regression
No ratings yet
Linear Regression
11 pages
Mla Unit 2
No ratings yet
Mla Unit 2
99 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
Machine Learning
No ratings yet
Machine Learning
62 pages
Privacy Information For Installation Features Windows 7 Privacy Statement For Installation Features
No ratings yet
Privacy Information For Installation Features Windows 7 Privacy Statement For Installation Features
13 pages
ML Unit-4
No ratings yet
ML Unit-4
65 pages
ML Lec8
No ratings yet
ML Lec8
7 pages
Self-Reflection On Instructional Coaching (1) 2
No ratings yet
Self-Reflection On Instructional Coaching (1) 2
3 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
SME Report English
No ratings yet
SME Report English
28 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
S&ML Unit 5 - Q & A
No ratings yet
S&ML Unit 5 - Q & A
15 pages
Bengtech Metallurgy Extended
100% (1)
Bengtech Metallurgy Extended
2 pages
Supervised Learning Essentials
No ratings yet
Supervised Learning Essentials
30 pages
Unit 2
No ratings yet
Unit 2
92 pages
KSP Response To LINK Nky Records Request
No ratings yet
KSP Response To LINK Nky Records Request
2 pages
10 Examples of Human Rights - Human Rights Careers
No ratings yet
10 Examples of Human Rights - Human Rights Careers
5 pages
4 Startup Roles To Hire
No ratings yet
4 Startup Roles To Hire
8 pages
Machine Learning Unit2
No ratings yet
Machine Learning Unit2
31 pages
ML Unit3b
No ratings yet
ML Unit3b
175 pages
Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Sajan Reliance MF
No ratings yet
Sajan Reliance MF
2 pages
Lecture 3
No ratings yet
Lecture 3
33 pages
Victoria Adaugo Onyekwere - 8109678605 - 20250102202313
No ratings yet
Victoria Adaugo Onyekwere - 8109678605 - 20250102202313
43 pages
Topic 7.6 Regression Analysis and Learning Regression Analysis
No ratings yet
Topic 7.6 Regression Analysis and Learning Regression Analysis
6 pages
FML Unit2
No ratings yet
FML Unit2
13 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Unit 2
No ratings yet
Unit 2
136 pages
Linear Regression
No ratings yet
Linear Regression
9 pages
CRD-L: Direct Acting Pressure Reducing Valve
No ratings yet
CRD-L: Direct Acting Pressure Reducing Valve
4 pages
Cold Storage: Air Coolers vs. Bunker Coils
No ratings yet
Cold Storage: Air Coolers vs. Bunker Coils
6 pages
Covumaiphuongthionline2 - Menhdequanhe
No ratings yet
Covumaiphuongthionline2 - Menhdequanhe
3 pages
Stock Transport
No ratings yet
Stock Transport
1 page
Linear Regression
No ratings yet
Linear Regression
89 pages
UNIT II Regration
No ratings yet
UNIT II Regration
62 pages
Halit Sahitaj - Criminal Network and Russian Intelligence Ties
No ratings yet
Halit Sahitaj - Criminal Network and Russian Intelligence Ties
5 pages
Unit-2 Supervised Machine Learning
No ratings yet
Unit-2 Supervised Machine Learning
132 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
RRB - Unit 2 Regresion
No ratings yet
RRB - Unit 2 Regresion
53 pages
Linear Regression
No ratings yet
Linear Regression
28 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages

Smai Lecture 06 Regression

Uploaded by

Smai Lecture 06 Regression

Uploaded by

22.08.

Statistical Methods in AI (CS7.403)

Ravi Kiran S (CVIT)

• Independent variables: Area of house, Population Density

• Price of a product and quantity produced or sold:

Source: CANSIM II Database (Vector v1576530 and v735048 respectively)

 1. Relationship Between Variables Is a Linear Function

• N samples, p-dimensional (what if p > N ?)

Also ref: Multivariate Adaptive Regression Splines (MARS)

Multivariate or multiple regression model

Model with simultaneous relationship

You might also like