100% found this document useful (1 vote)

81 views35 pages

DSA5102 Lecture9

lecture9

Uploaded by

gjpnwmdpz7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

81 views35 pages

DSA5102 Lecture9

lecture9

Uploaded by

gjpnwmdpz7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Foundations of Machine Learning

DSA 5102 • Lecture 9

Li Qianxiao
Department of Mathematics
Last Time
Until now, we have focused on supervised learning
• Datasets comes in input-label pairs
• Goal is to learn their relationship for prediction

For the rest of the course, we are going to look at a variety of

unsupervised learning methodologies.

As always, we start with the simplest linear cases and proceed

from there.
Unsupervised Learning Overview
Supervised Learning

Supervised learning is about learning to make predictions

(Oracle) Cat

Predictive Dog
Model

Our goal: Using data, learn a predictive model that approximates

Unsupervised Learning

Unsupervised learning is where we do not have label information

(Oracle) Cat

Dog

Example goal: learn some task-agnostic patterns from the input data
Examples of Unsupervised Learning
Tasks: Dimensionality Reduction

https://media.geeksforgeeks.org/wp-content/uploads/Dimensionality_Reduction_1.jpg
Examples of Unsupervised Learning
Tasks: Clustering

https://upload.wikimedia.org/wikipedia/commons/thumb/c/c8/Cluster-2.svg/1200px-Cluster-2.svg.png
Examples of Unsupervised Learning
Tasks: Density Estimation

By ‫ طاها‬- Own work, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=24309466

Examples of Unsupervised Learning
Tasks: Generative Models

http://www.lherranz.org/wp-content/uploads/2018/07/blog_generativesampling.png
Why unsupervised learning?
• Labelled data is expensive to collect
• Labelled data is impossible to get
• Different application scenarios
Principal Component Analysis
Review: Eigenvalues and Eigenvectors
• For a square matrix , an eigenvector with associated eigenvalue satisfies

• We say is diagonalizable if there exists a diagonal (matrix of eigenvalues)

and an invertible (columns=eigenvectors) such that
• is symmetric if . is orthogonal if
• Well-known result: if is symmetric then it is diagonalizable by orthogonal
matrices, i.e.

Columns of are orthonormal: . In fact, is an orthonormal basis for . Moreover,

the eigenvalues are real.

Watch this! https://www.youtube.com/watch?v=PFDu9oVAE-g&t=453s

Review: Eigenvalues and Eigenvectors
• A symmetric matrix is
• Positive semi-definite if for all
• Positive definite if for all
• Suppose is symmetric positive definite. Then, WLOG we will
order its eigenvalues

and are the corresponding orthonormal eigenvectors.

Motivating PCA: Shoe Sizes
Capturing the Variation?
Although there are two dimensions to the data, there is really one
effective dimension! How do we uncover this dimension?
A Dynamic Visualization
Find the direction
that captures the
most variance

Two
Formulations
Find the direction
that minimizes
projection error
Derivation of PCA
(Maximize Variance)
Derivation of PCA
(Minimize Error)
The PCA Algorithm
Simple Example
Choosing The Embedding Dimension
PCA in Feature Space (Example)
PCA in Feature Space
We define a vector of feature maps

Form design matrix

Perform PCA on the Transformed dataset!

PCA in Feature Space
PCA in Feature Space (Example)
Define Feature Maps
PCA as a Form of Whitening
Recall: Principal component scores are given by

Define the transformation

Then, !

In other words, has uncorrelated features. This is known as a

PCA whitening transform.
Example: Iris Dataset
Autoencoders
PCA as Compression Algorithm

𝑍 𝑚 = 𝑋 𝑈𝑚 𝑋 ′ =𝑍 𝑚 𝑈 𝑀
𝑇

Encoder Decoder
Latent
Autoencoders
In this sense, the autoencoder is a nonlinear counter-part of PCA
based compression!
PCA: 𝑍 𝑚 = 𝑋 𝑈𝑚 𝑋 ′ =𝑍 𝑚 𝑈 𝑚
𝑇

Encoder Latent Decoder

AE: 𝑍 𝑚 =𝑇 enc ( 𝑋 ;𝜃) 𝑋 ′ =𝑇 dec ( 𝑍 𝑚 ; 𝜙 )

Neural Network Autoencoders
How do we pick the encoding and decoding and

One choice: use universal approximators, e.g. neural networks!

where
Neural Network Autoencoders
Given a dataset , we solve the empirical risk minimization to
minimize the distance between and

The empirical risk minimization uses inputs as labels!

Demo: PCA and Autoencoders
Summary
PCA fits an ellipsoid to data. Two interpretations:
• Maximize variance
• Minimize error

PCA is useful for:

• Dimensionality reduction
• Feature extraction / clustering
• Data whitening

Viewed as a reconstruction algorithm, autoencoders is a nonlinear

analogue of PCA

AI Basics for Tech Enthusiasts
No ratings yet
AI Basics for Tech Enthusiasts
125 pages
Unit 2 - Bda Notes
No ratings yet
Unit 2 - Bda Notes
37 pages
DSA5105 Lecture8
No ratings yet
DSA5105 Lecture8
35 pages
DBMS
No ratings yet
DBMS
334 pages
Presentation
No ratings yet
Presentation
31 pages
Unit 4
No ratings yet
Unit 4
38 pages
Chapter 3 - Old PPT - Deadlock
100% (1)
Chapter 3 - Old PPT - Deadlock
40 pages
Introduction To Distributed Database Presentation
100% (1)
Introduction To Distributed Database Presentation
67 pages
Module-2 Lecture 7
100% (1)
Module-2 Lecture 7
21 pages
Mini Max
100% (1)
Mini Max
9 pages
Mc5502 Bda Unit I Notes
No ratings yet
Mc5502 Bda Unit I Notes
106 pages
Lecture-3 Unit 3
No ratings yet
Lecture-3 Unit 3
22 pages
Unsupervised Learning Explained
No ratings yet
Unsupervised Learning Explained
20 pages
Distributed System
100% (1)
Distributed System
119 pages
Triggers
100% (1)
Triggers
9 pages
Concurrency Control in Distributed Databases
100% (1)
Concurrency Control in Distributed Databases
12 pages
SCO 2080R Datasheet
No ratings yet
SCO 2080R Datasheet
2 pages
W9a Autoencoders Pca
No ratings yet
W9a Autoencoders Pca
7 pages
Gaussian Mixture Models Unit-III
No ratings yet
Gaussian Mixture Models Unit-III
13 pages
R Language
No ratings yet
R Language
59 pages
Problem:: 1. Discharge of The Section
No ratings yet
Problem:: 1. Discharge of The Section
19 pages
Aqa 83002F SMS
No ratings yet
Aqa 83002F SMS
16 pages
Wk01 Machine Learning
No ratings yet
Wk01 Machine Learning
6 pages
DBMS Lab Manual for Students
100% (1)
DBMS Lab Manual for Students
94 pages
ML Notes MAKAUT 7th Sem
No ratings yet
ML Notes MAKAUT 7th Sem
31 pages
A Presenation BY: Pawan Sharma
100% (1)
A Presenation BY: Pawan Sharma
13 pages
Phonon - Data Understanding An Application of - HTM
100% (3)
Phonon - Data Understanding An Application of - HTM
2 pages
Clay Bricks
100% (1)
Clay Bricks
21 pages
DBMS Technical Question Bank
100% (1)
DBMS Technical Question Bank
21 pages
Intro to Database Systems
100% (1)
Intro to Database Systems
142 pages
Artificial Intelligence: Chapter 6: Representing Knowledge Using Rules
No ratings yet
Artificial Intelligence: Chapter 6: Representing Knowledge Using Rules
54 pages
Savina DS Mode
No ratings yet
Savina DS Mode
39 pages
Unit I Deeplearning
No ratings yet
Unit I Deeplearning
13 pages
Distributed Catalog Management Seminar
100% (1)
Distributed Catalog Management Seminar
12 pages
Data Mining Models - GeeksforGeeks
No ratings yet
Data Mining Models - GeeksforGeeks
4 pages
Unit 4 Data Science
No ratings yet
Unit 4 Data Science
21 pages
Forced Convection Heat Transfer Tutorial
No ratings yet
Forced Convection Heat Transfer Tutorial
3 pages
DSA5102 Lecture11
No ratings yet
DSA5102 Lecture11
44 pages
Concurrency Control in Distributed Databases
No ratings yet
Concurrency Control in Distributed Databases
5 pages
Lect 04 - CH 08 Static Magnetic Field - Part 1 - HW3
No ratings yet
Lect 04 - CH 08 Static Magnetic Field - Part 1 - HW3
46 pages
Electron Paramagnetic Resonance of Transition Ions A. Abragam Instant Download
100% (2)
Electron Paramagnetic Resonance of Transition Ions A. Abragam Instant Download
47 pages
Bending Test (Report) e
100% (1)
Bending Test (Report) e
14 pages
Book MCS226 DataScience BigData 2022
No ratings yet
Book MCS226 DataScience BigData 2022
70 pages
MACHINE LEARNING Presentation Logistic Regression
No ratings yet
MACHINE LEARNING Presentation Logistic Regression
18 pages
Data Warehousing & Mining Guide
No ratings yet
Data Warehousing & Mining Guide
142 pages
DBMS - Unit-3
No ratings yet
DBMS - Unit-3
35 pages
Data Science - Unit-3-Part-2
No ratings yet
Data Science - Unit-3-Part-2
32 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
29 pages
Acceptance Testing CT
No ratings yet
Acceptance Testing CT
8 pages
Thin Walled Pressure Vessel
No ratings yet
Thin Walled Pressure Vessel
2 pages
Intro To Iterative Deepening
100% (1)
Intro To Iterative Deepening
12 pages
Data Mining & Warehousing Basics
100% (1)
Data Mining & Warehousing Basics
86 pages
Data Modeling ER Model Concept: Component of ER Diagram
No ratings yet
Data Modeling ER Model Concept: Component of ER Diagram
21 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
Triggers Lecture
100% (1)
Triggers Lecture
27 pages
1.+basics of DBMS
0% (1)
1.+basics of DBMS
45 pages
Data Warehousing Full
No ratings yet
Data Warehousing Full
41 pages
Toluene Hydrodealkylation Design Guide
No ratings yet
Toluene Hydrodealkylation Design Guide
24 pages
DSA5102 Lecture3
No ratings yet
DSA5102 Lecture3
34 pages
Pressure Test Procedures
No ratings yet
Pressure Test Procedures
3 pages
AI & Soft Computing Lab Manual
No ratings yet
AI & Soft Computing Lab Manual
30 pages
Unit 5 - Data Mining - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Data Mining - WWW - Rgpvnotes.in
15 pages
Unit No.4 Parallel Database
No ratings yet
Unit No.4 Parallel Database
32 pages
Aviation History - by Ilie Maria
No ratings yet
Aviation History - by Ilie Maria
4 pages
Database Management Systems Raghu Ramakrishnan 185 192
No ratings yet
Database Management Systems Raghu Ramakrishnan 185 192
8 pages
1908 Radio Telegraphy
No ratings yet
1908 Radio Telegraphy
300 pages
Concurrency Control in Distributed Transactions
No ratings yet
Concurrency Control in Distributed Transactions
17 pages
C++ Programming Lab Guide
100% (1)
C++ Programming Lab Guide
19 pages
Reinforcement Learning & MDPs
100% (1)
Reinforcement Learning & MDPs
8 pages
Dbms Unit 1 Ppts
No ratings yet
Dbms Unit 1 Ppts
37 pages
DMWH M1
No ratings yet
DMWH M1
25 pages
ABP DWDM UNIT 4 Classification 1
No ratings yet
ABP DWDM UNIT 4 Classification 1
51 pages
Heuristic Search
No ratings yet
Heuristic Search
49 pages
Production Performance Ratio
No ratings yet
Production Performance Ratio
14 pages
06 - The Process of Wort Boiling 2 - 2
No ratings yet
06 - The Process of Wort Boiling 2 - 2
3 pages
PLSQL Triggers
100% (1)
PLSQL Triggers
3 pages
Factors in R
No ratings yet
Factors in R
6 pages
MLS111L-Activity 1
No ratings yet
MLS111L-Activity 1
9 pages
Data Mining KDD Process
No ratings yet
Data Mining KDD Process
22 pages
OMT
No ratings yet
OMT
19 pages
3 Markscheme SL Paper2
No ratings yet
3 Markscheme SL Paper2
41 pages
Trawler Model Ship Resistance Test
No ratings yet
Trawler Model Ship Resistance Test
8 pages
Rock Weir Design Guide
No ratings yet
Rock Weir Design Guide
21 pages
KNX Solutions en
No ratings yet
KNX Solutions en
1 page
OPTICS: Ordering Points To Identify The Clustering Structure
No ratings yet
OPTICS: Ordering Points To Identify The Clustering Structure
10 pages
DSA5102 Lecture10
No ratings yet
DSA5102 Lecture10
40 pages
DSA5102 Lecture12
No ratings yet
DSA5102 Lecture12
41 pages
Air Conditioning Processes
No ratings yet
Air Conditioning Processes
3 pages
NMR Analysis of Inulin in Matricaria
No ratings yet
NMR Analysis of Inulin in Matricaria
5 pages
SM3GZ47, SM3JZ47: Ac Power Control Applications
No ratings yet
SM3GZ47, SM3JZ47: Ac Power Control Applications
5 pages
Failure and Fracture of Short Flass Fibre Reinforced Nylon Composites Moore
No ratings yet
Failure and Fracture of Short Flass Fibre Reinforced Nylon Composites Moore
8 pages
Fluent DO Presentation
No ratings yet
Fluent DO Presentation
33 pages
Dyedx: 1.1 Error Types
No ratings yet
Dyedx: 1.1 Error Types
44 pages

DSA5102 Lecture9

Uploaded by

DSA5102 Lecture9

Uploaded by

Foundations of Machine Learning

DSA 5102 • Lecture 9

For the rest of the course, we are going to look at a variety of

As always, we start with the simplest linear cases and proceed

Supervised learning is about learning to make predictions

Our goal: Using data, learn a predictive model that approximates

Unsupervised learning is where we do not have label information

By ‫ طاها‬- Own work, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=24309466

• We say is diagonalizable if there exists a diagonal (matrix of eigenvalues)

Columns of are orthonormal: . In fact, is an orthonormal basis for . Moreover,

Watch this! https://www.youtube.com/watch?v=PFDu9oVAE-g&t=453s

and are the corresponding orthonormal eigenvectors.

Form design matrix

Perform PCA on the Transformed dataset!

Define the transformation

In other words, has uncorrelated features. This is known as a

Encoder Latent Decoder

AE: 𝑍 𝑚 =𝑇 enc ( 𝑋 ;𝜃) 𝑋 ′ =𝑇 dec ( 𝑍 𝑚 ; 𝜙 )

One choice: use universal approximators, e.g. neural networks!

The empirical risk minimization uses inputs as labels!

PCA is useful for:

Viewed as a reconstruction algorithm, autoencoders is a nonlinear

You might also like