0% found this document useful (0 votes)

18 views16 pages

SLChapter 5

This document discusses regularized regression techniques, specifically Ridge regression and Lasso, highlighting their properties, advantages, and disadvantages. It explains how these methods help reduce variance and improve model fitting by penalizing the complexity of linear regression models. Additionally, it provides examples and geometric interpretations of the constraints involved in these regularization methods.

Uploaded by

Sarp İLHAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views16 pages

SLChapter 5

Uploaded by

Sarp İLHAN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Regularized

regression

Chapter 5
Regularized regression
EEE 485/585 Statistical Learning and Data Analytics Regularization

Ridge regression

Lasso

Cem Tekin
Bilkent University

Cannot be distributed outside this class without the permission of the

instructor. 5.1
Ely97 XXI Xy
Regularized
Regularization
MERICA my
regression

Properties of the least squares estimate:

When relation between Y and X = [X1 , . . . , Xp ]T is almost
linear, least squares estimate have low bias
But it can have high variance. Ex: when p ⇡datafinerthydotatin
n or p n
Shrinking regression coefficients results in better fit
Regularization
Reducing the complexity of linear regression
Ridge regression

Lasso

5.2
Regularized
Two methods for regularization regression

ie
Ordinary least squares:

0 12
n
X p
X
RSS( ) = @yi 0 j xij
A
i=1 j=1

Ridge regression: Regularization

IF
Ridge regression
0 12 Lasso
n
X p
X p
X
@yi A + 2
LossR ( , ) = 0 j xij j
i=1 j=1 j=1
p

y notpeered
X
2
= RSS( ) + j
j=1

Lasso:
so
evilbe 0 Ferdie1squareof porcretes
w X @y
Loss ( , ) =
Xn
x A +
X
| |
p
2
p

L i 0 j ij j
i=1 j=1 j=1
p
X
= RSS( ) +
j=1
| j|
pendite
absolute role of
parameters 5.3
I Regularized
Ridge regression regression

0 12
n p p
X X X
@yi A + 2
LossR ( , ) = 0 j xij |{z} j
i=1 j=1 tuning parameter j=1
| {z }
Regularization
penalty
Ridge regression
ˆ R = arg min LossR ( , ) Lasso

What happens when

!0
LostCHOI ROTC I ER I
É
!1
How to select ? CB not penalized
Use CV to select

5.4
Regularized
Example - Credit card balance prediction regression

Onotstudent
Y =card balance s
X =(income, limit, rating, student, ...)
Elo I student
R
Lines show estimated regression coefficients ˆ by ridge
regression.
Regularization

Ridge regression
400

Income

400
Lasso
Limit
Standardized Coefficients

Standardized Coefficients
300

300
Rating
Student
200

200
100

100
0

0
−100

−100
−300
−300

1e−02 1e+00 1e+02 1e+04 0.0 0.2 0.4 0.6 0.8 1.0

so2 ˆR 2 / ˆ 2
Figure from “An introduction to statistical learning" by James et al. 5.5
Regularized
Scale invariance regression

Least squares linear regression is scale invariant

Is ridge regression scale invariant?
Making ridge regression fair:
Figgis.ie and
income in the
Standardize the predictors:

1 apithfeatureofdata instance i Regularization

D leastsquarer I
g ITE Totie Text Is
Ridge regression
xij x̄j
x˜ij = q P Lasso
1 n
n i=1 (xij x̄j )2
Inefficient
Pn of inome
of
iaug value
1
where x̄j = n i=1 xij fester j
Properties of standardized predictors: resale inone i iincome in thetas th
i new 0.001 xi old
1
Pn
n i=1 x˜ij = 0 (zero mean)
1
Pn 2
n i=1 x˜ij = 1 (unit variance)
5 dataset with retched inone
centare the response YE t ÉYi
I Fi Jill ridge
It gets It500017 ist
Exit Exit least it tide
instead 5.6 since squareinerrant
Bias-variance tradeoff
d Regularized
in general
En
regression

A D s ridge
E g IITItg I
treetoooin
ER
60

60
ridge
Mean Squared Error

Mean Squared Error

50
underfit
40 weft

40
ridge it not sale invariant
iI
Regularization
30

30
Ridge regression
20

20
i
Lasso

n
10

10
I
0

0
1e−01 1e+01 1e+03 0.0 0.2 0.4 0.6 0.8 1.0
ˆR 2 / ˆ 2

t model iorplexyd
bias: black, variance: green, MSE: red

n
1X
MSE := (yi f̂ (x i ))2
n
i=1

Figure from “An introduction to statistical learning" by James et al. 5.7

Regularized
How to solve ridge regression?

titter
regression

n
X
0 oh p
X
12
p
X
FICy y'tITI
Y
Cy XEtCy XE t II
@yi A + 2
LossR ( , ) = 0 j xij j
i=1 j=1 j=1

ytyy'TE Iffy ITTXI ITI

R Regularization
ˆ = arg min LossR ( , )
Ridge regression

Lasso

DELORCAAt Q
Center the predictors and the response (centering makes
the intercept ˆ0R = 0)
Standardize the predictors
Q yTX yTX Pt I CAT
t II I I IT Q
g
Zytx It Ext E
I IF EE Ext sixty
5.8
n
Itai
Ip
Regularized
How to solve ridge regression?
ridge t s when
a
regression

Some2 notation:2y 3 and X centered

F Ey
3 2 3
y1 1 x11 x12 . . . x1p
6 y2 7 6 27 6x21 x22 . . . x2p 7
ET esta't YEEIE
6 7 6 7 6 7
y = 6 . 7, = 6 . 7, X = 6 . .. .. .. 7
4.5. .
4.5 4 .. . . . 5
yn xn1 xn2 . . . xnp
p
Linear algebra and matrix calculus gives:
Regularization
d
of 15 coefficients
Ridge regression
downsided version

ˆ R = (XT X + I) 1
g
XT y
Lasso

Hence given a new (centered and scaled) input x, (centered

R
prediction) ŷ = x T ˆ

5.9
Regularized
How to solve ridge regression? regression

Some2 notation:
3 2y 3 and X centered
2 3
y1 1 x11 x12 . . . x1p
6 y2 7 6 27 6x21 x22 . . . x2p 7
6 7 6 7 6 7
y = 6 . 7, = 6 . 7, X = 6 . .. .. .. 7
4.5. .
4.5 4 .. . . . 5
yn p xn1 xn2 . . . xnp Regularization
Linear algebra and matrix calculus gives: Ridge regression

Lasso

R
ˆ = (XT X + I) 1
XT y

Hence given a new (centered and scaled) input x, (centered

R
prediction) ŷ = x T ˆ
Compare with least squares solution:

ˆ RSS = (XT X) 1
XT y

5.9
Regularized
Advantage of ridge regression regression

Reduces variance
XT X + I, > 0 is invertable even when XT X is not
invertable.

Regularization

Ridge regression

Lasso

Figure from http://stats.stackexchange.com 5.10

Regularized
Disadvantage of ridge regression regression

Coefficients will be small but still almost all of them will be

nonzero

Regularization

Ridge regression

Lasso

5.11
Regularized
Lasso (least absolute shrinkage and selection operator) regression

0 12
n p p
X X X
LossL ( , ) = @yi 0 j xij
A + | j|
i=1 j=1 j=1
L Regularization
ˆ = arg min LossL ( , )
Ridge regression

Lasso

No closed form solution (in general)

crepe
What happens when
!0
ELI's
11 0
!1

5.12
Regularized
Example - Credit card balance prediction regression

Y =card balance
X =(income, limit, rating, student, ...)
L
Lines show estimated regression coefficients ˆ by lasso.
Lasso performs variable selection (results in a sparse model)
lasso ridge regression
Regularization
400

400
Ridge regression
Standardized Coefficients

Standardized Coefficients
Lasso
300

300
200

200
100

100
i
0

0
−100
Income

I Limit
−200

Rating

b is
Student

−300
20 50 100 200 500 2000 5000 0.0 0.2 0.4 0.6 0.8 1.0
ˆL 1 / ˆ
feature selection
1

Figure from “An introduction to statistical learning" by James et al. after CV 5.13
Regularized
Ridge regression and lasso as constrained minimization regression

problems
Ridge:

Ijf
8 0 12 9
>
<X n p >
= p
X X
@yi A 2
minimize 0 j xij subject to j s Regularization
>
: i=1 >
;
j=1 j=1 Ridge regression

fo pappies
Lasso

Lasso:

8
>
0
tf 12 9
>
geometry of constraint region
<X n Xp = Xp
different
minimize @yi 0 x
j ij
A subject to | j|  s
>
: i=1 >
;
j=1 j=1

forp 2 Hittites
For each s in the constrained minimization problem there
is a corresponding in the equivalent unconstrained p
minimization problem.

5.14
Regularized
Geometric interpretation
p2
regression

prssinceons

so f01
Regularization

an x op spied Ridge regression

Lasso

Red lines: error contours for RSS (same error for all
values on the same contour)
ˆ : least squares solution
2 2
Blue areas: region for which | 1| +| 2|  s or 1 + 2 s

Figure from “An introduction to statistical learning" by James et al. 5.15

2025 IFT CFA Level I Facts and Formula Sheet hd4wwj
No ratings yet
2025 IFT CFA Level I Facts and Formula Sheet hd4wwj
17 pages
Machine Learning With Ridge and Lasso Regression
No ratings yet
Machine Learning With Ridge and Lasso Regression
19 pages
Chapter 6 - 1 Handsout Machine Learning
No ratings yet
Chapter 6 - 1 Handsout Machine Learning
29 pages
Slides Ridge Lasso Regression
No ratings yet
Slides Ridge Lasso Regression
23 pages
7SSMM700 Lecture 8
No ratings yet
7SSMM700 Lecture 8
33 pages
Ch5 Regularization
No ratings yet
Ch5 Regularization
23 pages
Regularization Methods Intro 1694372556
No ratings yet
Regularization Methods Intro 1694372556
38 pages
Ridge Regression LASSO
No ratings yet
Ridge Regression LASSO
18 pages
L3 Linear Regression
No ratings yet
L3 Linear Regression
23 pages
Regularization
No ratings yet
Regularization
3 pages
Cs 7265 Big Data Analytics Regularization On Linear Model: Mingon Kang, PH.D Computer Science, Kennesaw State University
No ratings yet
Cs 7265 Big Data Analytics Regularization On Linear Model: Mingon Kang, PH.D Computer Science, Kennesaw State University
24 pages
Notes - Lecture 13 - Regularization - LASSO and RIDGE Regression
No ratings yet
Notes - Lecture 13 - Regularization - LASSO and RIDGE Regression
29 pages
445 Lecture 7
No ratings yet
445 Lecture 7
30 pages
PGN AI and ML Presentation
No ratings yet
PGN AI and ML Presentation
28 pages
05 Regression Least Squares
No ratings yet
05 Regression Least Squares
5 pages
Aiml 6
No ratings yet
Aiml 6
30 pages
Modern Regression 1: Ridge Regression: Ryan Tibshirani Data Mining: 36-462/36-662
No ratings yet
Modern Regression 1: Ridge Regression: Ryan Tibshirani Data Mining: 36-462/36-662
21 pages
Modern Regression - Ridge Regression
100% (1)
Modern Regression - Ridge Regression
21 pages
Lect 6
No ratings yet
Lect 6
10 pages
Machine Learning PPT Part II
No ratings yet
Machine Learning PPT Part II
56 pages
3.3 Regularized Linear Model
No ratings yet
3.3 Regularized Linear Model
27 pages
Lecture 3
No ratings yet
Lecture 3
16 pages
Chap3 Ridge Lasso
No ratings yet
Chap3 Ridge Lasso
26 pages
AI34
No ratings yet
AI34
3 pages
Rudyregularization PDF
No ratings yet
Rudyregularization PDF
56 pages
Regularization: Ridge Regression and The LASSO: Statistics 305: Autumn Quarter 2006/2007
No ratings yet
Regularization: Ridge Regression and The LASSO: Statistics 305: Autumn Quarter 2006/2007
56 pages
21csc305p ML Unit 2
No ratings yet
21csc305p ML Unit 2
115 pages
10 - Linear Regression-Problems and Solutions
No ratings yet
10 - Linear Regression-Problems and Solutions
23 pages
EDA 4th Module
No ratings yet
EDA 4th Module
26 pages
Dependent Independent Variable (S) : Regression: What Is Regression
No ratings yet
Dependent Independent Variable (S) : Regression: What Is Regression
15 pages
9 - Linear Regression-Problems and Solutions
No ratings yet
9 - Linear Regression-Problems and Solutions
23 pages
Ridge vs Lasso Regression Guide
No ratings yet
Ridge vs Lasso Regression Guide
5 pages
Regression Shrinkage Techniques
No ratings yet
Regression Shrinkage Techniques
5 pages
LASSO and Ridge-1
No ratings yet
LASSO and Ridge-1
15 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Conceptual Exercises
No ratings yet
Conceptual Exercises
11 pages
Regression Models & Regularization
No ratings yet
Regression Models & Regularization
15 pages
Lasoo Regression
No ratings yet
Lasoo Regression
8 pages
Lecture BDS 7-23-24 Print
No ratings yet
Lecture BDS 7-23-24 Print
14 pages
LLM ML Interview Q
No ratings yet
LLM ML Interview Q
43 pages
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
No ratings yet
A Closer Look at Sparse Regression Ryan Tibshirani: 2.1 Three Norms: ', ', '
25 pages
Lasso NIPS
No ratings yet
Lasso NIPS
8 pages
ML 2024 Part2 Shrinkage Estimators
No ratings yet
ML 2024 Part2 Shrinkage Estimators
64 pages
Tutorial 5 - Solution Data Science
No ratings yet
Tutorial 5 - Solution Data Science
9 pages
L2 Regularization
No ratings yet
L2 Regularization
10 pages
6 Complexity
No ratings yet
6 Complexity
22 pages
Statisticians: Discover Lasso Regression
No ratings yet
Statisticians: Discover Lasso Regression
22 pages
2 RegularizedRegression
No ratings yet
2 RegularizedRegression
25 pages
Regularization and Feature Selectio N
No ratings yet
Regularization and Feature Selectio N
102 pages
Tibshirani Lasso
No ratings yet
Tibshirani Lasso
22 pages
ISYE 8803 - Kamran - M6 - LD Learning Using Regularization
No ratings yet
ISYE 8803 - Kamran - M6 - LD Learning Using Regularization
25 pages
L11+ Regularization
No ratings yet
L11+ Regularization
25 pages
Regularization & Generalized Linear Models
No ratings yet
Regularization & Generalized Linear Models
135 pages
Pa 1 Unit
No ratings yet
Pa 1 Unit
23 pages
Chapter2 Annotated Part2
No ratings yet
Chapter2 Annotated Part2
30 pages
Ridge Mt1cars
No ratings yet
Ridge Mt1cars
4 pages
B Ridge - and - Lasso - Regression
No ratings yet
B Ridge - and - Lasso - Regression
5 pages
Group30 Linear Regression
No ratings yet
Group30 Linear Regression
20 pages
Lasso Vs Ridge Vs Elastic 1
No ratings yet
Lasso Vs Ridge Vs Elastic 1
5 pages
Biostatistics Unit 10. Measures of Relationship
No ratings yet
Biostatistics Unit 10. Measures of Relationship
37 pages
Chapter15 Econometrics InstrumentalVariable
No ratings yet
Chapter15 Econometrics InstrumentalVariable
5 pages
ME781 Midsem 2016
No ratings yet
ME781 Midsem 2016
2 pages
Data Management Study Guide
No ratings yet
Data Management Study Guide
51 pages
978 0133507331 Quantitative Analysis For Management 12th Edition
100% (63)
978 0133507331 Quantitative Analysis For Management 12th Edition
61 pages
Shivani Mohan Impact of Npa On Profitability of Scheduled Commercial
No ratings yet
Shivani Mohan Impact of Npa On Profitability of Scheduled Commercial
19 pages
A Sinusoidal Model For Seasonal Bicycle Demand Estimation
No ratings yet
A Sinusoidal Model For Seasonal Bicycle Demand Estimation
16 pages
Exploring The Concept of Correlation and Its Applications in Data Science
No ratings yet
Exploring The Concept of Correlation and Its Applications in Data Science
17 pages
Abstract
No ratings yet
Abstract
4 pages
ASC Assignment 1 and 2 Questions
No ratings yet
ASC Assignment 1 and 2 Questions
2 pages
35867+fix+publish+2+ 685 704
No ratings yet
35867+fix+publish+2+ 685 704
20 pages
Gender Equity Scale
No ratings yet
Gender Equity Scale
20 pages
IAI CS1 Syllabus 2024
No ratings yet
IAI CS1 Syllabus 2024
6 pages
Determinants of Non Financial Performance of Real Estate Firms in Addis Ababa, Ethiopia
No ratings yet
Determinants of Non Financial Performance of Real Estate Firms in Addis Ababa, Ethiopia
57 pages
Linearity in Regression, Domodar N Gujrati - Basic Econometrics
No ratings yet
Linearity in Regression, Domodar N Gujrati - Basic Econometrics
2 pages
Asphalt Concrete Stiffness Prediction
No ratings yet
Asphalt Concrete Stiffness Prediction
138 pages
Contoh Artikel Kuantitatif
No ratings yet
Contoh Artikel Kuantitatif
11 pages
Inventory Management & Firm Performance
No ratings yet
Inventory Management & Firm Performance
16 pages
Chapter 4 Multiple Regression Model
No ratings yet
Chapter 4 Multiple Regression Model
31 pages
Investment Behavior
No ratings yet
Investment Behavior
19 pages
The Effect of Toxic Friendship On Students Mental
No ratings yet
The Effect of Toxic Friendship On Students Mental
8 pages
Farmers' Financial Literacy Impact
No ratings yet
Farmers' Financial Literacy Impact
12 pages
Regression Analysis PDF
100% (2)
Regression Analysis PDF
205 pages
Tire-Road Friction Coefficient PDF
No ratings yet
Tire-Road Friction Coefficient PDF
11 pages
(Ebook) Statistical Methods For Fuzzy Data by Reinhard Viertl ISBN 9780470699454, 9780470974421, 9780470974414, 0470699450, 0470974427, 0470974419 Instant Download
100% (1)
(Ebook) Statistical Methods For Fuzzy Data by Reinhard Viertl ISBN 9780470699454, 9780470974421, 9780470974414, 0470699450, 0470974427, 0470974419 Instant Download
56 pages
Business Analytics for Managers
No ratings yet
Business Analytics for Managers
5 pages
(Ebook PDF) Principles of Econometrics, 5th Editioninstant Download
100% (3)
(Ebook PDF) Principles of Econometrics, 5th Editioninstant Download
44 pages
Tutorial 23 Back Analysis Material Properties
No ratings yet
Tutorial 23 Back Analysis Material Properties
10 pages
The Influence of Islamic Work Ethics and Compensation Suitability To Fraud Accountancy
No ratings yet
The Influence of Islamic Work Ethics and Compensation Suitability To Fraud Accountancy
12 pages

SLChapter 5

Uploaded by

SLChapter 5

Uploaded by

Regularized

Cannot be distributed outside this class without the permission of the

Properties of the least squares estimate:

Ridge regression: Regularization

What happens when

Least squares linear regression is scale invariant

1 apithfeatureofdata instance i Regularization

Mean Squared Error

Figure from “An introduction to statistical learning" by James et al. 5.7

ytyy'TE Iffy ITTXI ITI

Some2 notation:2y 3 and X centered

Hence given a new (centered and scaled) input x, (centered

Hence given a new (centered and scaled) input x, (centered

Figure from http://stats.stackexchange.com 5.10

Coefficients will be small but still almost all of them will be

No closed form solution (in general)

an x op spied Ridge regression

Figure from “An introduction to statistical learning" by James et al. 5.15

You might also like