0% found this document useful (0 votes)

15 views39 pages

Session 1

The document provides an overview of machine learning concepts, including supervised and unsupervised learning, with examples such as regression for housing price prediction and classification for cancer detection. It discusses various algorithms, cost functions, and gradient descent methods used in training models. Additionally, it highlights the differences between supervised and unsupervised learning, along with common applications and terminologies related to machine learning.

Uploaded by

redu0587

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views39 pages

Session 1

Uploaded by

redu0587

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

About Me

Email: [email protected] Office Hour: Wendesday 19.00-20.00

Agenda

Supervised Unsupervised Gradient Batch Gradient

Cost Function Learning Rate
Learning Learning Descent Descent

Classification Clustering

Regression
What is
Machine
Learning?
Machine Learning Algorithms

Supervised learning

Unsupervised learning

Recommender system

Reinforcement learning
Supervised Learning

Regression Classification
Supervised Learning
Circle

Model Prediction Square

Labeled Data

Circle
Test Data
Triangle Square

Lables
Regression : Housing Price Prediction
400

300 Linear

Price
200

100

500 1000 1500 2000 2500

House size
Classification: Cancer Detection

Malignant

Benign

Tumor size x (cm)

Classification: Cancer Detection
Malignant type2
2 Malignant type1

Benign

Tumor size x (cm)

Two or More Inputs

malignant
𝑥2

benign

𝑎𝑔𝑒

𝑥1

𝑇𝑢𝑚𝑜𝑟 𝑠𝑖𝑧𝑒
Q&A
Unsupervised Learning

Clustering Anomaly Detection Dimensionality Reduction

𝑥2 𝑥2

age age

𝑥1 𝑥1

𝑇𝑢𝑚𝑜𝑟 𝑠𝑖𝑧𝑒 𝑇𝑢𝑚𝑜𝑟 𝑠𝑖𝑧𝑒

Supervised learning
Unsupervised learning
Learn from data labeled
Find something interesting
with the ‘right answer’
unlabeled data
Clustering
• What is unsupervised learning, and how does it differ from supervised
learning?

• A) Unsupervised learning involves training a model with labeled data, while

supervised learning uses unlabeled data.
• B) Unsupervised learning is used for classification tasks, whereas
supervised learning is used for clustering.
• C) Unsupervised learning deals with unlabeled data and seeks to find
patterns or structures within the data without explicit target labels.
• D) Unsupervised learning requires more computational resources
compared to supervised learning.
• Which of the following is NOT a common application of unsupervised
learning?

• A) Customer segmentation in marketing

• B) Handwriting recognition
• C) Anomaly detection in cybersecurity
• D) Image compressio
Q&A
Linear Regression Model
Linear regression
500
400
Price
300
200
100
0
1000 2000 3000
House size
Regression model predicts numbers
Classification model predicts categories
Terminology

x y
Size(feet^2) Price Notation:
(1) 2104 460 x = ‘input’ variable feature
(2) 1416 232
y = ‘output’ variable
(3) 1534 315
m = number of training examples
(4) 852 178
… …
(47) 3210 870 (x,y) = single training example

(𝑥 (𝑖) , 𝑦 (𝑖) ) = 𝑖 𝑡ℎ 𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑒𝑥𝑎𝑚𝑝𝑙𝑒

Training set Learning Algorithm f 𝑦ො

How to represent f?

𝑓𝑤,𝑏 𝑥 = 𝑤𝑥 + 𝑏
y Linear regression with one variable
Univariate linear regression

x
Cost Function 3
y 2
𝑓𝑤,𝑏 𝑥 = 𝑤𝑥 + 𝑏 1
0 1 2 3
0 x

3 3 3
f(x) = 0*x + 1.5 f(x) = 0.5*x f(x) = 0.5*x + 1
2 2 2
1 1 1
0 0 0
0 1 2 3 0 1 2 3 0 1 2 3
w=0 w = 0.5 w = 0.5
b = 1.5 b=0 b=1
𝑓𝑤,𝑏 𝑥 = 𝑤𝑥 + 𝑏
𝑚
y 1
𝐽 𝑤, 𝑏 = ෍(𝑦ො (𝑖) −𝑦 𝑖 )2
2𝑚
𝑖=1

f(x) = 0.5*x + 1
4

2
1 1
𝐽 𝑤, 𝑏 = ∗ ( 1.5 − 1 2 + 4−2 2 + 2. −2.5 2 )
2∗3
1 2 3
Visual of Cost Function
Gradient Descent

OUTLINE:

Initialization: We start with some initial parameter values.

Calculate Gradient: We calculate the gradient of the loss function with respect to each
parameter. This gradient points in the direction of the steepest increase in the loss.

Update Parameters: We adjust the parameters by a small amount in the opposite 𝐽(𝑤)
direction of the gradient. This helps us move closer to the parameter values that
minimize the loss.

Repeat: We repeat steps 2 and 3 iteratively, each time moving a bit closer to the
minimum of the loss function. 𝑤
w

b
𝜕
𝑤𝑗 = 𝑤𝑗 −∝ 𝜕𝑤 𝐽 𝑤, 𝑏 Derivative
𝑗
Learning Rate
𝜕
𝑏 = 𝑏 −∝ 𝐽 𝑤, 𝑏
𝜕𝑤𝑗

Repeat until convergence

Correct: Simultaneous update Incorrect

𝜕 𝜕
temp_w = w −∝ 𝜕𝑤 𝐽 𝑤, 𝑏 temp_w = w −∝ 𝜕𝑤 𝐽 𝑤, 𝑏
𝜕 w = temp_w
temp_b = b −∝ 𝐽 𝑤, 𝑏
𝜕𝑏 𝜕
temp_b = b −∝ 𝜕𝑏 𝐽 𝑤, 𝑏
w = temp_w
b = temp_b b = temp_b
𝐽(𝑤)
𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑠𝑙𝑜𝑝𝑒

𝑤 𝜕
𝑤𝑗 = 𝑤𝑗 −∝ 𝐽 𝑤, 𝑏
𝜕𝑤𝑗

𝐽(𝑤)
𝑛𝑒𝑔𝑎𝑡𝑖𝑣𝑒 𝑠𝑙𝑜𝑝𝑒

𝑤
Learning Rate
𝜕
𝑤𝑗 = 𝑤𝑗 −∝ 𝐽 𝑤, 𝑏
𝜕𝑤𝑗

Proper learning rate Big learning rate

𝐽(𝑤)
Slope = 0

Local minimum

𝑤 𝑤 = 𝑤 −∝∗ 0
5
𝑤=𝑤
𝜕
w = w −∝ 𝐽 𝑤, 𝑏 =0
𝜕𝑤
Can reach local minimum with fixed learning rate
𝐽(𝑤)
𝜕
w = w −∝ 𝐽 𝑤, 𝑏 Large
𝜕𝑤
Not as large
𝑤
Smaller

Near a local minimum,

• Derivative becomes smaller
• Uprdate steps become smaller
The Curse of Local Minima: How to Escape and
Find the Global Minimum

• Adding noise
• Momentum
• Learning rate adjustment
Batch Gradient Descent
Stochastic Gradient Descent(SGD)
Stochastic Gradient Descent(SGD)
Mini-batch Gradient Descent
SUMMARY
Q&A
• Create Account
• Create repository

https://www.youtube.com/watch?v=HW29067qVWk https://www.youtube.com/watch?v=iv8rSLsi1xo

HISTORY ISC GRADE 11 2018-19 Emergence of The Colonial Economy. Why Was There A
No ratings yet
HISTORY ISC GRADE 11 2018-19 Emergence of The Colonial Economy. Why Was There A
13 pages
Sewage & Septage Ordinance Guide
100% (2)
Sewage & Septage Ordinance Guide
10 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
Machine Learning Guide 2017
No ratings yet
Machine Learning Guide 2017
15 pages
Andrew NG Week 1-2
No ratings yet
Andrew NG Week 1-2
120 pages
ML UNIT - 1 Part 1
No ratings yet
ML UNIT - 1 Part 1
82 pages
Linear Regression For Machine Learning Course
No ratings yet
Linear Regression For Machine Learning Course
41 pages
Regression
No ratings yet
Regression
30 pages
Linear Regression
No ratings yet
Linear Regression
75 pages
CS229
No ratings yet
CS229
69 pages
Notes 1
No ratings yet
Notes 1
30 pages
cs229 2
No ratings yet
cs229 2
275 pages
Linear Regression and Gradient Descent
No ratings yet
Linear Regression and Gradient Descent
30 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
30 pages
cs229 Notes1 PDF
No ratings yet
cs229 Notes1 PDF
28 pages
(Machine Learning Coursera) Lecture Note Week 1
No ratings yet
(Machine Learning Coursera) Lecture Note Week 1
8 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
Machine Learning Notes by Standard Andrew NG
No ratings yet
Machine Learning Notes by Standard Andrew NG
142 pages
CS229 Lecture Notes: Supervised Learning
No ratings yet
CS229 Lecture Notes: Supervised Learning
293 pages
Stanford ML CS229-Merged Notes
No ratings yet
Stanford ML CS229-Merged Notes
126 pages
Machine Learning Basics for Students
No ratings yet
Machine Learning Basics for Students
7 pages
Machine Learning Notes AndrewNg
No ratings yet
Machine Learning Notes AndrewNg
141 pages
01B DL2023 LinearModels
No ratings yet
01B DL2023 LinearModels
47 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
15 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Machine Learning: Welcome!
No ratings yet
Machine Learning: Welcome!
181 pages
CSE 440 AI Volume1 (p1)
No ratings yet
CSE 440 AI Volume1 (p1)
4 pages
CS 229: Supervised Learning Basics
100% (1)
CS 229: Supervised Learning Basics
48 pages
(MLP) MidtermNote
No ratings yet
(MLP) MidtermNote
31 pages
Linear - Regression - SGD
No ratings yet
Linear - Regression - SGD
71 pages
Cs229 ML Notes
No ratings yet
Cs229 ML Notes
192 pages
AIMLB PGP 2025 Session 5
No ratings yet
AIMLB PGP 2025 Session 5
67 pages
Lec1 PDF
No ratings yet
Lec1 PDF
56 pages
02 - Linear Models - A
No ratings yet
02 - Linear Models - A
23 pages
ML - Mca
No ratings yet
ML - Mca
48 pages
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
No ratings yet
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
86 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
5 pages
Machine Learning Basics for Beginners
No ratings yet
Machine Learning Basics for Beginners
8 pages
Machine Learning Basics Guide
No ratings yet
Machine Learning Basics Guide
56 pages
S1 - 25 (NSP) - ML - CS 34 - 10th17th Aug 2025
No ratings yet
S1 - 25 (NSP) - ML - CS 34 - 10th17th Aug 2025
89 pages
Supervised Learning Cheatsheet
No ratings yet
Supervised Learning Cheatsheet
2 pages
DSCTP 2022 1 ML Slides
No ratings yet
DSCTP 2022 1 ML Slides
351 pages
Classification and Regression
No ratings yet
Classification and Regression
34 pages
ML:Introduction: Week 1 Lecture Notes
No ratings yet
ML:Introduction: Week 1 Lecture Notes
10 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Brief Summary ML
No ratings yet
Brief Summary ML
25 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Unit3 ML
No ratings yet
Unit3 ML
52 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Week 1 Lecture Notes
No ratings yet
Week 1 Lecture Notes
7 pages
ML:Introduction What Is Machine Learning?: Continuous and Discrete Data
No ratings yet
ML:Introduction What Is Machine Learning?: Continuous and Discrete Data
6 pages
Intelligent Robotic Systems
No ratings yet
Intelligent Robotic Systems
66 pages
ML Day2
No ratings yet
ML Day2
7 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
No ratings yet
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
12 pages
Machine Learning Shortnote
No ratings yet
Machine Learning Shortnote
14 pages
(MLP) Lecture Notes
No ratings yet
(MLP) Lecture Notes
22 pages
Glass Block Technical Presentation
No ratings yet
Glass Block Technical Presentation
16 pages
Kibabii University: Conference Programme
No ratings yet
Kibabii University: Conference Programme
32 pages
Golgi Apparatus Structure and Function Relationship
No ratings yet
Golgi Apparatus Structure and Function Relationship
3 pages
Poisons An Introduction For Forensic Investigators, 1st Edition PDF
100% (9)
Poisons An Introduction For Forensic Investigators, 1st Edition PDF
17 pages
Emami Limited
100% (2)
Emami Limited
44 pages
Mastertop 1210i M 12-04
No ratings yet
Mastertop 1210i M 12-04
3 pages
Stress Strain
No ratings yet
Stress Strain
17 pages
Framed Structures
No ratings yet
Framed Structures
3 pages
5.03.2 Tanker Based FPSOs y Cont. - Budhiraja
No ratings yet
5.03.2 Tanker Based FPSOs y Cont. - Budhiraja
44 pages
Automobile Engineering Course Plan
No ratings yet
Automobile Engineering Course Plan
2 pages
Worship Lyrics: Praise and Devotion
No ratings yet
Worship Lyrics: Praise and Devotion
198 pages
Air Conditioner Parts For Assembling and Repairing Manufacturer-Supplier China
No ratings yet
Air Conditioner Parts For Assembling and Repairing Manufacturer-Supplier China
9 pages
Annex+A-2 Draft+Amended+Net+Metering+Agreement
No ratings yet
Annex+A-2 Draft+Amended+Net+Metering+Agreement
5 pages
Parlor Games
No ratings yet
Parlor Games
13 pages
LTC4365 240805 102331
No ratings yet
LTC4365 240805 102331
20 pages
Weight-For-Age BOYS: 6 Months To 2 Years (Percentiles)
No ratings yet
Weight-For-Age BOYS: 6 Months To 2 Years (Percentiles)
1 page
11th Physics Book Back Questions With Answers in English
No ratings yet
11th Physics Book Back Questions With Answers in English
29 pages
ASKEY-TCG220-d: D3.0 8x4 Data Cable Modem
No ratings yet
ASKEY-TCG220-d: D3.0 8x4 Data Cable Modem
2 pages
List of Land Lease in TPM
No ratings yet
List of Land Lease in TPM
3 pages
Return To Running Program Steve Cole WM Mary
No ratings yet
Return To Running Program Steve Cole WM Mary
6 pages
Trane 1 PDF
No ratings yet
Trane 1 PDF
25 pages
PowerPoint 9 Collisions (Momentum) in 2D (4U)
No ratings yet
PowerPoint 9 Collisions (Momentum) in 2D (4U)
11 pages
Aravali43 School Static 1623941251621 DATESHEET AND SYLLABUS PT1 GRADE X
No ratings yet
Aravali43 School Static 1623941251621 DATESHEET AND SYLLABUS PT1 GRADE X
1 page
Polygon Shafts & Components Guide
No ratings yet
Polygon Shafts & Components Guide
6 pages
Frankenstein Context
No ratings yet
Frankenstein Context
1 page
Poverty Reduction in Ethiopia and The Role of Ngos: Qualitative Studies of Selected Projects
No ratings yet
Poverty Reduction in Ethiopia and The Role of Ngos: Qualitative Studies of Selected Projects
83 pages
POLARIS RPG - Core Rulebook 1 Beta 05 (8527262) PDF
100% (1)
POLARIS RPG - Core Rulebook 1 Beta 05 (8527262) PDF
269 pages

Session 1

Uploaded by

Session 1

Uploaded by

About Me

Email: [email protected] Office Hour: Wendesday 19.00-20.00

Supervised Unsupervised Gradient Batch Gradient

Model Prediction Square

500 1000 1500 2000 2500

Tumor size x (cm)

Tumor size x (cm)

Clustering Anomaly Detection Dimensionality Reduction

𝑇𝑢𝑚𝑜𝑟 𝑠𝑖𝑧𝑒 𝑇𝑢𝑚𝑜𝑟 𝑠𝑖𝑧𝑒

• A) Unsupervised learning involves training a model with labeled data, while

• A) Customer segmentation in marketing

(𝑥 (𝑖) , 𝑦 (𝑖) ) = 𝑖 𝑡ℎ 𝑡𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝑒𝑥𝑎𝑚𝑝𝑙𝑒

Initialization: We start with some initial parameter values.

Repeat until convergence

Correct: Simultaneous update Incorrect

Proper learning rate Big learning rate

Near a local minimum,

You might also like