0% found this document useful (0 votes)

3 views80 pages

4 LinReg

linear regression

Uploaded by

wuyuman6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views80 pages

4 LinReg

linear regression

Uploaded by

wuyuman6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 80

Data Mining

Principal Component Analysis

Linear Regression

CS 584 :: Fall 2024

Ziwei Zhu
Department of Computer Science
George Mason University
Part of slides is from Dr. Theodora Chaspari.

1
• HW1 is due next Monday 09/23!

• For the PCA part, be careful about 𝑿 ∈ ℝ𝑁×𝐷 or

𝑿 ∈ ℝ𝐷×𝑁
• Will have the second quiz next week!

2
Outline

• Linear Regression definition

• Optimization: closed form solution via ordinary
least squares
• Optimization: numerical solution via Gradient
Descent
• Non-linear basis function for regression
• Overfitting

3
Example: Rent Price Prediction

Source: apartments.com

4
Example: Rent Price Prediction

Source: apartments.com

The price is a linear combination of features

RentPrice = w0 + w1 × Size + w2 × DistanceFromGMU + . . .
5
Example: Rent Price Prediction
RentPrice = w0 + w1 × Size

6
Example: Rent Price Prediction

More general, with more features

RentPrice = w0 + w1 × Size + w2 × DistanceFromGMU + . . .

With weights w0 , w1 , w2 ... corresponding to features

7
Example: Rent Price Prediction
RentPrice = w0 + w1 × Size + w2 × DistanceFromGMU + . . .

DistanceFromGMU

8
Linear Regression: Definition

9
Linear Regression: Definition

How to determine what is a good w?

10
Linear Regression: Evaluation
Minimizing the difference between predicted and actual
labels (i.e., prediction error).

11
Linear Regression: Objective Function
Minimizing the difference between predicted and actual
labels (i.e., prediction error).

Residual Sum of Squares (objective/loss function)

12
Linear Regression: Objective Function
Residual Sum of Squares (objective/loss function)

13
Linear Regression: Objective Function
objective/loss function:

Our goal is to find the solution w* to minimize

the objective/loss function:

14
Linear Regression: Objective Function
objective/loss function:

Our goal is to find the solution w* to minimize

the objective/loss function:

Next question: how to solve this optimization problem?

15
Outline

• Linear Regression definition

• Optimization: closed form solution via
ordinary least squares
• Optimization: numerical solution via Gradient
Descent
• Non-linear basis function for regression
• Overfitting

16
Linear Regression: Optimization

17
Linear Regression: Optimization

convex non-convex

18
Linear Regression: Optimization

19
Linear Regression: Optimization

20
Linear Regression: Optimization
When the 1st order derivative is 0, we find the local
minimum (global minimum in the convex case)

21
Linear Regression: Optimization
When the 1st order derivative is 0, we find the
global minimum

22
Linear Regression: Optimization
When the 1st order derivative is 0, we find the
global minimum

23
Linear Regression: Optimization
When the 1st order derivative is 0, we find the
global minimum

24
Linear Regression: Optimization
When the 1st order derivative is 0, we find the
global minimum

25
Linear Regression: Optimization
When the 1st order derivative is 0, we find the
global minimum

Ordinary Least Squares (OLS) 26

Computational Complexity

27
Computational Complexity

28
Outline

• Linear Regression definition

• Optimization: closed form solution via ordinary
least squares
• Optimization: numerical solution via Gradient
Descent
• Non-linear basis function for regression
• Overfitting

29
Gradient Descent

30
Gradient Descent

31
Gradient Descent

32
Gradient Descent

Global loss minimum

33
Gradient Descent

34
Gradient Descent

35
Gradient Descent

36
Gradient Descent

37
Gradient Descent

38
Gradient Descent: Algorithm Outline

39
Gradient Descent: Two Hyperparameters

40
Gradient Descent: Step Size/Learning Rate

41
Gradient Descent: Step Size/Learning Rate

42
Gradient Descent: Stopping Rule

43
Gradient Descent: Stopping Rule

44
Gradient Descent: Stopping Rule

• In practice, we can also directly set the number of

training epochs as the stopping rule, and tune it as a
hyper-parameter.
• Or, we evaluate the model performance on a
validation set after each epoch, and stop the training
process if the performance on validation does not
improve anymore.

45
Gradient Descent in Linear Regression

46
Gradient Descent in Linear Regression

47
Gradient Descent in Linear Regression

Intensive computation

48
Gradient Descent in Linear Regression

49
Gradient Descent in Linear Regression

50
Gradient Descent in Linear Regression

51
Gradient Descent in Linear Regression

52
Gradient Descent in Linear Regression

In practice, in each epoch, randomly

split the whole dataset into mini-
batches, iterate over all of them as
iterations

53
Gradient Descent in Linear Regression

54
Outline

• Linear Regression definition

• Optimization: closed form solution via ordinary
least squares
• Optimization: numerical solution via Gradient
Descent
• Non-linear basis function for regression
• Overfitting

55
Non-Linear Regression

56
Non-Linear Regression

57
Non-Linear Regression

58
Non-Linear Basis Function

OLS
OLS

59
Non-Linear Basis Function
𝜙 𝑥 = [1, 𝑥, 𝑥 2 , 𝑥 3 , … , 𝑥 𝑀 ]
𝜙 𝑥 = [1] 𝜙 𝑥 = [1, 𝑥]

[1, 𝑥, 𝑥 2 , 𝑥 3 ] [1, 𝑥, 𝑥 2 , 𝑥 3 , … , 𝑥 9 ]

60
Non-Linear Basis Function

underfitting underfitting

overfitting

61
Outline

• Linear Regression definition

• Optimization: closed form solution via ordinary
least squares
• Optimization: numerical solution via Gradient
Descent
• Non-linear basis function for regression
• Overfitting

62
Overfitting

63
Overfitting

:Cannot work well for unseen data

64
Overfitting

65
Overfitting

66
How to Avoid Overfitting?

67
How to Avoid Overfitting?

68
Overfitting

69
Overfitting

70
How to Avoid Overfitting?

71
How to Avoid Overfitting?

72
Regularization

73
Regularization

74
Regularization

75
Regularization

76
Regularization

𝜆=0 𝜆 = 𝑒 −18 𝜆=1

77
Regularization

𝜆=0 𝜆 = 𝑒 −18 𝜆=1

Tune 𝝀 as a hyper-parameter
78
Moreover …
The number of training steps is another important
factor influencing overfitting. Need to carefully
choose the stop condition

underfitting overfitting 79
What have we learned so far

• Non-linear basis, underfitting and overfitting (regularization)

Linear Regression Guide for Students
No ratings yet
Linear Regression Guide for Students
35 pages
Linear Regression Techniques
No ratings yet
Linear Regression Techniques
25 pages
Real Statistics Examples Part 2
No ratings yet
Real Statistics Examples Part 2
1,110 pages
Time Series and Panel Data Econometrics
No ratings yet
Time Series and Panel Data Econometrics
95 pages
Linear Regression
No ratings yet
Linear Regression
75 pages
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
No ratings yet
CIS 4526: Foundations of Machine Learning Linear Regression: (Modified From Sanja Fidler)
20 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
Question 1 B
No ratings yet
Question 1 B
6 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Linear Regression
No ratings yet
Linear Regression
36 pages
Week 04
No ratings yet
Week 04
101 pages
Linear Regression
No ratings yet
Linear Regression
61 pages
Numerical Method For Engineers-Chapter 20
100% (1)
Numerical Method For Engineers-Chapter 20
46 pages
Lecture 2-Linear-Regression-Part1
No ratings yet
Lecture 2-Linear-Regression-Part1
80 pages
Linear Regression Explained
No ratings yet
Linear Regression Explained
26 pages
Lect03 CSN382
No ratings yet
Lect03 CSN382
31 pages
Lecture 1, Part 1: Linear Regression: Roger Grosse
No ratings yet
Lecture 1, Part 1: Linear Regression: Roger Grosse
9 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
M02 Linear Regression Methods
No ratings yet
M02 Linear Regression Methods
40 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
MLA TAB Lecture3
No ratings yet
MLA TAB Lecture3
70 pages
Linear Regression Lecture Notes
No ratings yet
Linear Regression Lecture Notes
34 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Notes 04
No ratings yet
Notes 04
50 pages
Regression and Optimization in ML
No ratings yet
Regression and Optimization in ML
41 pages
2-LR Optim
No ratings yet
2-LR Optim
60 pages
Supervised Learning Basics
No ratings yet
Supervised Learning Basics
53 pages
Week 4
No ratings yet
Week 4
101 pages
Non Linear Regression
No ratings yet
Non Linear Regression
12 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Regression Analysis
No ratings yet
Regression Analysis
54 pages
Group30 Linear Regression
No ratings yet
Group30 Linear Regression
20 pages
Mlfa Autumn 22 Lec 02
No ratings yet
Mlfa Autumn 22 Lec 02
24 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Regression Notes-I
No ratings yet
Regression Notes-I
10 pages
Basic Interview Question of Linear Regression
No ratings yet
Basic Interview Question of Linear Regression
9 pages
Group 30
No ratings yet
Group 30
33 pages
Lecture 5 - Linear Regression
No ratings yet
Lecture 5 - Linear Regression
51 pages
Linear Regression With One Variable
No ratings yet
Linear Regression With One Variable
48 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Unit-2 Supervised Machine Learning
No ratings yet
Unit-2 Supervised Machine Learning
132 pages
Abstract: y F X X X, X, X
No ratings yet
Abstract: y F X X X, X, X
10 pages
EE708 Module 3A
No ratings yet
EE708 Module 3A
28 pages
Linear Regression
100% (1)
Linear Regression
8 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Regression
No ratings yet
Regression
25 pages
Regression
No ratings yet
Regression
16 pages
Pycse
No ratings yet
Pycse
320 pages
Introml 02 Regression Annotated PDF
No ratings yet
Introml 02 Regression Annotated PDF
26 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
SPE-170698-MS Rate Transient and Decline Curve Analyses For Continuously (Dual-Porosity) and Discretely Naturally Fractured Reservoirs
100% (1)
SPE-170698-MS Rate Transient and Decline Curve Analyses For Continuously (Dual-Porosity) and Discretely Naturally Fractured Reservoirs
24 pages
CSE 412 Lab Manual 3 Linear Regression
No ratings yet
CSE 412 Lab Manual 3 Linear Regression
10 pages
Regressions Et Equations Integrales
No ratings yet
Regressions Et Equations Integrales
85 pages
Intuitive Biostatistics: A Nonmathematical Guide To Statistical Thinking. ISBN 0190643560, 978-0190643560
100% (22)
Intuitive Biostatistics: A Nonmathematical Guide To Statistical Thinking. ISBN 0190643560, 978-0190643560
23 pages
Qsar Stastistical Method in Drug Design
No ratings yet
Qsar Stastistical Method in Drug Design
54 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Regression Linear Simple
No ratings yet
Regression Linear Simple
37 pages
ML Models and When To Choose One Over Others
No ratings yet
ML Models and When To Choose One Over Others
7 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
10 pages
7 Clustering1
No ratings yet
7 Clustering1
72 pages
Module3 Ch1
No ratings yet
Module3 Ch1
83 pages
Pycse
No ratings yet
Pycse
340 pages
CS2011 2
No ratings yet
CS2011 2
14 pages
8 Clustering2
No ratings yet
8 Clustering2
84 pages
(Ebook) Contemporary Statistical Models For The Plant and Soil Sciences by Oliver Schabenberger Francis J. Pierce ISBN 9781584881117, 1584881119
100% (2)
(Ebook) Contemporary Statistical Models For The Plant and Soil Sciences by Oliver Schabenberger Francis J. Pierce ISBN 9781584881117, 1584881119
55 pages
Benign Overfitting in Ridge Regression: Alexander Tsigler
No ratings yet
Benign Overfitting in Ridge Regression: Alexander Tsigler
76 pages
Nonlinear Regression for Analysts
No ratings yet
Nonlinear Regression for Analysts
59 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages
Applied Linear Regression 4th Edition Sanford Weisberg - Own The Complete Ebook Set Now in PDF and DOCX Formats
100% (1)
Applied Linear Regression 4th Edition Sanford Weisberg - Own The Complete Ebook Set Now in PDF and DOCX Formats
46 pages
Stochastic Optimization Under Distributional Drift: Joshua Cutler
No ratings yet
Stochastic Optimization Under Distributional Drift: Joshua Cutler
56 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Model Development For Entry Capacity Estimation of Selected Roundabouts of Nepal
No ratings yet
Model Development For Entry Capacity Estimation of Selected Roundabouts of Nepal
53 pages
Asymptotics For Sketching in Least Squares Regression: Edgar Dobriban and Sifan Liu October 8, 2019
No ratings yet
Asymptotics For Sketching in Least Squares Regression: Edgar Dobriban and Sifan Liu October 8, 2019
47 pages
A Lyapunov Analysis of Accelerated Methods in Optimization: Ashia C. Wilson
No ratings yet
A Lyapunov Analysis of Accelerated Methods in Optimization: Ashia C. Wilson
34 pages
Lecture W2ab
No ratings yet
Lecture W2ab
56 pages
Online Mirror Descent and Dual Averaging: Keeping Pace in The Dynamic Case
No ratings yet
Online Mirror Descent and Dual Averaging: Keeping Pace in The Dynamic Case
38 pages
Python Data Preprocessing & Regression
No ratings yet
Python Data Preprocessing & Regression
68 pages
A Parameter-Free Conditional Gradient Method For Composite Minimization Under H Older Condition
No ratings yet
A Parameter-Free Conditional Gradient Method For Composite Minimization Under H Older Condition
34 pages
Linear Regression
No ratings yet
Linear Regression
19 pages
Will Bilevel Optimizers Benefit From Loops: Kaiyi Ji, Mingrui Liu, Yingbin Liang and Lei Ying June 2, 2022
No ratings yet
Will Bilevel Optimizers Benefit From Loops: Kaiyi Ji, Mingrui Liu, Yingbin Liang and Lei Ying June 2, 2022
32 pages
Understanding The Role of Momentum in Stochastic Gradient Methods
No ratings yet
Understanding The Role of Momentum in Stochastic Gradient Methods
32 pages
A Statistical Perspective On Randomized Sketching For Ordinary Least-Squares
No ratings yet
A Statistical Perspective On Randomized Sketching For Ordinary Least-Squares
31 pages
7-2 Categories+Objects
No ratings yet
7-2 Categories+Objects
15 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting
No ratings yet
Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting
25 pages
Decline Curve Analysis Overview
No ratings yet
Decline Curve Analysis Overview
41 pages
1 s2.0 0022247X9190144O Main
No ratings yet
1 s2.0 0022247X9190144O Main
22 pages
Functionally Constrained Algorithm Solves Convex Simple Bilevel Problems
No ratings yet
Functionally Constrained Algorithm Solves Convex Simple Bilevel Problems
22 pages
Gasnikov 19 A
No ratings yet
Gasnikov 19 A
18 pages
On The Implicit Bias in Deep-Learning Algorithms: Gal Vardi TTI-Chicago and Hebrew University
No ratings yet
On The Implicit Bias in Deep-Learning Algorithms: Gal Vardi TTI-Chicago and Hebrew University
17 pages
Robust Nonlinear Regression in Enzyme Kinetic Parameters Estimation
No ratings yet
Robust Nonlinear Regression in Enzyme Kinetic Parameters Estimation
13 pages
EMF Nonlinear
No ratings yet
EMF Nonlinear
18 pages
1 s2.0 S0885064X14000831 Main
No ratings yet
1 s2.0 S0885064X14000831 Main
14 pages
Poe 5 Statatoc
No ratings yet
Poe 5 Statatoc
12 pages
Predicting Grout Rheology with GEP
No ratings yet
Predicting Grout Rheology with GEP
13 pages
Exploratory Data Analysis Guide
No ratings yet
Exploratory Data Analysis Guide
38 pages
Introduction To Curve Fitting
No ratings yet
Introduction To Curve Fitting
10 pages
Agri-Econometrics Assignment
No ratings yet
Agri-Econometrics Assignment
5 pages
The Bass Model - Marketing Engineering
100% (1)
The Bass Model - Marketing Engineering
8 pages
CES Prodn FN
100% (1)
CES Prodn FN
4 pages
Nonlinear Regression & Interaction Terms
No ratings yet
Nonlinear Regression & Interaction Terms
2 pages
OLS in Nonlinear Economic Models
No ratings yet
OLS in Nonlinear Economic Models
4 pages
Nonlinear Model
No ratings yet
Nonlinear Model
3 pages
Math Model Validation Worksheet
No ratings yet
Math Model Validation Worksheet
3 pages
cs580 HWK Set2 Sol
No ratings yet
cs580 HWK Set2 Sol
6 pages
Module10 Assignment
No ratings yet
Module10 Assignment
2 pages

4 LinReg

Uploaded by

4 LinReg

Uploaded by

Data Mining

Principal Component Analysis

CS 584 :: Fall 2024

• For the PCA part, be careful about 𝑿 ∈ ℝ𝑁×𝐷 or

• Linear Regression definition

The price is a linear combination of features

More general, with more features

RentPrice = w0 + w1 × Size + w2 × DistanceFromGMU + . . .

With weights w0 , w1 , w2 ... corresponding to features

How to determine what is a good w?

Residual Sum of Squares (objective/loss function)

Our goal is to find the solution w* to minimize

Our goal is to find the solution w* to minimize

Next question: how to solve this optimization problem?

• Linear Regression definition

Ordinary Least Squares (OLS) 26

• Linear Regression definition

Global loss minimum

• In practice, we can also directly set the number of

In practice, in each epoch, randomly

• Linear Regression definition

• Linear Regression definition

:Cannot work well for unseen data

𝜆=0 𝜆 = 𝑒 −18 𝜆=1

𝜆=0 𝜆 = 𝑒 −18 𝜆=1

• Non-linear basis, underfitting and overfitting (regularization)

You might also like