0% found this document useful (0 votes)

17 views35 pages

03 Regression

The document discusses regression and regularization in machine learning, focusing on applications, evaluation methods, and model types such as linear and polynomial regression. It emphasizes the importance of designing models that generalize well to unseen data and introduces concepts like training and test data. Additionally, it covers evaluation metrics and the transformation of features for polynomial regression to capture non-linear relationships.

Uploaded by

mouyoussef77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views35 pages

03 Regression

Uploaded by

mouyoussef77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Regression & Regularization

Botao Jiao
Review
• Last week:

• Binary classification applications

• Evaluating classification models

• Artificial neurons
Binary Classification

Distinguish 2 classes

For example, we might want to classify data into categories like yes/no, true/false, or
positive/negative. The key idea here is that there are only two possible outcomes, and our
task is to decide which one the data belongs to.
Evaluation Methods: Confusion Matrix

•TP (True Positive): This is when the model

Actual correctly predicts the positive class. For example, it
Spam Trusted correctly marks a spam email as spam.
•TN (True Negative): This is when the model
Spam

TP FP TP= true positive correctly predicts the negative class, like marking a
Predicted

non-spam email as not spam.

TN = true negative •FP (False Positive): This is when the model
FP= false positive
Trusted

wrongly predicts the positive class, like marking a

FN TN FN = false negative non-spam email as spam.
•FN (False Negative): This is when the model
wrongly predicts the negative class, like marking a
spam email as not spam.
Artificial Neuron

Python Machine Learning; Raschkka & Mirjalili

Perceptron: Model (Linear Threshold Unit)

By connecting several neurons, we create a network that can process more information and make more
accurate decisions. This is how we build powerful models that can solve difficult problems, like recognizing
images or understanding speech.

Frank Rosenblatt, The perceptron, a perceiving and recognizing automaton Project Para. Cornell Aeronautical Laboratory, 1957
Today’s Topics
• Regression applications

• Evaluating regression models

• Background: notation

• Linear regression

• Polynomial regression

• Regularization (Ridge regression and Lasso regression)

Today’s Topics
• Regression applications

• Evaluating regression models

• Background: notation

• Linear regression

• Polynomial regression

• Regularization (Ridge regression and Lasso regression)

Today’s Focus: Regression

Predict continuous value

Predict Price to Charge for Your Home
Predict Future Stock Price
Predict Credit Score for Loan Lenders

Demo: https://www.youtube.com/watch?time_continue=6&v=0bEJO4Twgu4&feature=emb_logo

https://emerj.com/ai-sector-overviews/artificial-intelligence-applications-lending-loan-management/
What Else to Predict?
Insurance Cost Public Opinion

Popularity of Social Media Posts

Factory Analysis Call Center Complaints

Class Ratings

Weather Animal Behavior

Classification vs. Regression

Today’s Topics
• Regression applications

• Evaluating regression models

• Background: notation

• Linear regression

• Polynomial regression

• Regularization (Ridge regression and Lasso regression)

Goal: Design Models that Generalize Well to
New, Previously Unseen Examples

Example:

Cost: $1,045,864 $918,000 $450,900 $725,000

Goal: Design Models that Generalize Well to
New, Previously Unseen Examples
1. Split data into a “training set” and “ ”
Training Data Test Data

Example:

Cost: $1,045,864 $918,000 $450,900 $725,000

Goal: Design Models that Generalize Well to
New, Previously Unseen Examples
2. Train model on “training set” to try to minimize prediction error on it
Training Data

Example:

Cost: $1,045,864 $918,000 $450,900

Goal: Design Models that Generalize Well to
New, Previously Unseen Examples
3. Apply trained model on “ ” to measure generalization error
Test Data

Example:

Prediction Model
Cost: $725,000

Predicted Cost: ?
Regression Evaluation Metrics
Results: e.g., • Mean absolute error
Regression Evaluation Metrics
Results: e.g., • Mean absolute error
Today’s Topics
• Regression applications

• Evaluating regression models

• Background: notation

• Linear regression

• Polynomial regression

• Regularization (Ridge regression and Lasso regression)

Matrices and Vectors
• X : each feature is in its own column and each sample is in its own row
• y : each row is the target value for the sample

Feature 1 Feature 2 Feature M Label

Sample 1: 0.7 100 0.81 0.8

Sample N: 0.5 121 0.3 0.1

Matrices and Vectors
• X : each feature is in its own column and each sample is in its own row
• y : each row is the target value for the sample
Today’s Topics
• Regression applications

• Evaluating regression models

• Background: notation

• Linear regression

• Polynomial regression

• Regularization (Ridge regression and Lasso regression)

Linear Regression Model
• General formula:

Feature vector: x = x[0], x[1], …,x[p]

• How many features are there?
• p+1

Parameter vector to learn: w = w[0], w[1], …,w[p]

• How many parameters are there?
• p+2

Predicted value
“Simple” Linear Regression Model
• Formula: (Line)

Feature vector

Target
• How many features are there?
• 1

Parameter vector to learn

• How many parameters are there?
• 2

Predicted value Feature x

Figure Credit: http://sli.ics.uci.edu/Classes/2015W-273a?action=download&upname=04-linRegress.pdf
“Multiple” Linear Regression Model
• Formula:

(Plane)

Feature vector
• How many features are there?
• 2

Parameter vector to learn

• How many parameters are there?
• 3
x[0]
Predicted value x[0] x[1]
x[1]
Figure Credit: http://sli.ics.uci.edu/Classes/2015W-273a?action=download&upname=04-linRegress.pdf
Linear Regression Model: What to Learn?

• Weight coefficients:
• Indicates how much the predicted value will vary when that feature varies
while holding all the other features constant
Linear Regression Model: Learning Parameters

• Great interactive demo:

https://www.nctm.org/Classroom-Resources/Illuminations/Interactives/Line-of-Best-Fit/
Today’s Topics
• Regression applications

• Evaluating regression models

• Background: notation

• Linear regression

• Polynomial regression

• Regularization (Ridge regression and Lasso regression)

Linear Models: When They Are Not Good
Enough, Increase Representational Capacity

polynomial equations linear equations polynomial equations

(higher capacity) (lowest capacity) (highest capacity)
Polynomial Regression: Transform Features
to Model Non-Linear Relationships
• e.g., (Recall) Formula:

Predicted value

• e.g., New Formula: Parameter vector

Feature vector

• Still a linear model!

• But can now model more complex relationships!!
Polynomial Regression Model:
What Feature Transformation to Use?

• Why does train error shrink and test error grow?

• The higher the polynomial order the greater the
model “overfits” to the training data since it can
model noise! Models capturing noise generalize
poorly to new test data
• What polynomial order should you use?
Let’s watch a video on the Polynomial Regression
Model to help us understand it better.

Polynomial Regression for Machine Learning

G. Srinivasan-Operations Research - Principles and Applications-Prentice-Hall of India (2010) PDF
90% (10)
G. Srinivasan-Operations Research - Principles and Applications-Prentice-Hall of India (2010) PDF
527 pages
CSE5311 Midterm Exam Practice Guide
No ratings yet
CSE5311 Midterm Exam Practice Guide
11 pages
Ds Module 4
No ratings yet
Ds Module 4
73 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Lect 1
No ratings yet
Lect 1
24 pages
AIMLB PGP 2025 Session 8
No ratings yet
AIMLB PGP 2025 Session 8
52 pages
(Slide) Non Linear Regression
No ratings yet
(Slide) Non Linear Regression
39 pages
Introduction To AI and ML
No ratings yet
Introduction To AI and ML
22 pages
List - Midterm - 1 ML
No ratings yet
List - Midterm - 1 ML
6 pages
Supervised Machine Learning Algorithm
100% (1)
Supervised Machine Learning Algorithm
111 pages
Supervised Learning. wk3
No ratings yet
Supervised Learning. wk3
18 pages
08 CSE358 Intro To Machine Learning II
No ratings yet
08 CSE358 Intro To Machine Learning II
100 pages
Supervised Learning & Regression
No ratings yet
Supervised Learning & Regression
41 pages
Session 6 Perceptron Logistic Regression
No ratings yet
Session 6 Perceptron Logistic Regression
27 pages
03 Supervised Classification
No ratings yet
03 Supervised Classification
68 pages
Unit 1
No ratings yet
Unit 1
92 pages
Assignment 3 AINLP
No ratings yet
Assignment 3 AINLP
4 pages
Linear Regression
No ratings yet
Linear Regression
4 pages
Linear Regression
No ratings yet
Linear Regression
3 pages
Machine Learning and Data Mining
No ratings yet
Machine Learning and Data Mining
88 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
ML Model Paper 2 Solution
No ratings yet
ML Model Paper 2 Solution
15 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
L22 KNN+Metrics
No ratings yet
L22 KNN+Metrics
18 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
Logistic Regression
No ratings yet
Logistic Regression
61 pages
05 Logistic Regression
No ratings yet
05 Logistic Regression
12 pages
Lecture 09 - 02.09.2024 - Regression-01
No ratings yet
Lecture 09 - 02.09.2024 - Regression-01
62 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
40 pages
Bayesian Belief and Regression
No ratings yet
Bayesian Belief and Regression
19 pages
DS-05 Introduction To Machine Learning
No ratings yet
DS-05 Introduction To Machine Learning
103 pages
INSY446 - 4 - Classification Part 1
No ratings yet
INSY446 - 4 - Classification Part 1
26 pages
Practical 7 Classification Revision Questions
No ratings yet
Practical 7 Classification Revision Questions
8 pages
Notes5 Regression
No ratings yet
Notes5 Regression
14 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Machine Learning Overview Guide
No ratings yet
Machine Learning Overview Guide
68 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
L9 RBF+PM
No ratings yet
L9 RBF+PM
33 pages
Datamining Unit4
No ratings yet
Datamining Unit4
21 pages
5 LogRegNN
No ratings yet
5 LogRegNN
74 pages
FDS Notes
No ratings yet
FDS Notes
6 pages
Data Science Lecture: Classification & Regression
No ratings yet
Data Science Lecture: Classification & Regression
27 pages
6 - Classification and Regression Tasks
No ratings yet
6 - Classification and Regression Tasks
115 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
Linear Regression Lab Guide
100% (1)
Linear Regression Lab Guide
8 pages
Week 9 - PROG 8510 Week 9
No ratings yet
Week 9 - PROG 8510 Week 9
27 pages
Unit1 6thsemCS
No ratings yet
Unit1 6thsemCS
22 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
ML Primer PDF
No ratings yet
ML Primer PDF
122 pages
CSE 440 AI Volume1 (p1)
No ratings yet
CSE 440 AI Volume1 (p1)
4 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
Machine Learning Essentials
No ratings yet
Machine Learning Essentials
12 pages
AI Informed-Search
No ratings yet
AI Informed-Search
26 pages
Gauss Quadrature Integration Guide
No ratings yet
Gauss Quadrature Integration Guide
9 pages
Geeta Bi Math 10 (Sat 2) 2025
No ratings yet
Geeta Bi Math 10 (Sat 2) 2025
7 pages
Shiksha Mantra: Mathematics
No ratings yet
Shiksha Mantra: Mathematics
1 page
Unsupervised Learning: K-Means & GMM
No ratings yet
Unsupervised Learning: K-Means & GMM
27 pages
Minimum Spanning Tree Algorithms
No ratings yet
Minimum Spanning Tree Algorithms
18 pages
P vs NP: Complexity Classes Explained
No ratings yet
P vs NP: Complexity Classes Explained
11 pages
CEDRON - OLA-LP Model Formulation
No ratings yet
CEDRON - OLA-LP Model Formulation
7 pages
DAA Lab Printout
No ratings yet
DAA Lab Printout
35 pages
AI - Informed Search - Lecture 7, 8
No ratings yet
AI - Informed Search - Lecture 7, 8
42 pages
Optimize Matrix Multiplication Order
No ratings yet
Optimize Matrix Multiplication Order
19 pages
CS3491 AI and ML Lab Manual
No ratings yet
CS3491 AI and ML Lab Manual
30 pages
OLMP
No ratings yet
OLMP
28 pages
BLM 1-11 Section 1.3 Practi
No ratings yet
BLM 1-11 Section 1.3 Practi
2 pages
Pretest-FACTORING POLYNOMIALS
100% (1)
Pretest-FACTORING POLYNOMIALS
2 pages
Numericals
No ratings yet
Numericals
4 pages
Jacobi Iteration Method Guide
100% (1)
Jacobi Iteration Method Guide
16 pages
Google Net
No ratings yet
Google Net
40 pages
Optimization for Engineers
No ratings yet
Optimization for Engineers
96 pages
Algorithms for CS Students
No ratings yet
Algorithms for CS Students
13 pages
### 1. Job Sequencing With Deadline and Knapsack
No ratings yet
### 1. Job Sequencing With Deadline and Knapsack
4 pages
221902285-Algorithm Lab Report 6
No ratings yet
221902285-Algorithm Lab Report 6
6 pages
Assignment 2 - 228265B - Excel Solver Attachement
No ratings yet
Assignment 2 - 228265B - Excel Solver Attachement
4 pages
Programming Assignment 4: Divide-and-Conquer
No ratings yet
Programming Assignment 4: Divide-and-Conquer
13 pages
Handbooks in Operations Research and Management Science - Vol 12 Discrete Optimization - (Elsevier) - 2005
No ratings yet
Handbooks in Operations Research and Management Science - Vol 12 Discrete Optimization - (Elsevier) - 2005
606 pages
Daniel Kane: Graph Algorithms Data Structures and Algorithms
No ratings yet
Daniel Kane: Graph Algorithms Data Structures and Algorithms
41 pages
Standard Reciprocal
No ratings yet
Standard Reciprocal
12 pages
Machine Learning
No ratings yet
Machine Learning
11 pages

03 Regression

Uploaded by

03 Regression

Uploaded by

Regression & Regularization

• Binary classification applications

• Evaluating classification models

•TP (True Positive): This is when the model

non-spam email as not spam.

wrongly predicts the positive class, like marking a

Python Machine Learning; Raschkka & Mirjalili

• Evaluating regression models

• Regularization (Ridge regression and Lasso regression)

• Evaluating regression models

• Regularization (Ridge regression and Lasso regression)

Predict continuous value

Popularity of Social Media Posts

Factory Analysis Call Center Complaints

Weather Animal Behavior

Classification vs. Regression

• Evaluating regression models

• Regularization (Ridge regression and Lasso regression)

Cost: $1,045,864 $918,000 $450,900 $725,000

Cost: $1,045,864 $918,000 $450,900 $725,000

Cost: $1,045,864 $918,000 $450,900

• Evaluating regression models

• Regularization (Ridge regression and Lasso regression)

Feature 1 Feature 2 Feature M Label

Sample 1: 0.7 100 0.81 0.8

Sample N: 0.5 121 0.3 0.1

• Evaluating regression models

• Regularization (Ridge regression and Lasso regression)

Feature vector: x = x[0], x[1], …,x[p]

Parameter vector to learn: w = w[0], w[1], …,w[p]

Parameter vector to learn

Predicted value Feature x

Parameter vector to learn

• Great interactive demo:

• Evaluating regression models

• Regularization (Ridge regression and Lasso regression)

polynomial equations linear equations polynomial equations

• e.g., New Formula: Parameter vector

• Still a linear model!

• Why does train error shrink and test error grow?

Polynomial Regression for Machine Learning

You might also like