0% found this document useful (0 votes)

13 views66 pages

Unsupervised Learning

Uploaded by

Karim Saad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views66 pages

Unsupervised Learning

Uploaded by

Karim Saad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 66

Unsupervised

Learning Your Image Here

AIcademy Summer Camp – Day 2

Learning Outcomes
Unsupervised learning 01
Supervised vs unsupervised learning 02
Case Study 03
Benefits and Challenges 04
K-Means Clustering 05
Elbow Method 06
Principle Component Analysis 07
Supervised vs Unsupervised Learning

Supervised Learning: Unsupervised Learning: Reinforcement Learning:

Labeled training data to teach Uses input data without The model learns by
the model predefined outputs interacting with an
Desired output values to No labeled data or training environment.
define correct answers context provided Desired behaviors are
reinforced through rewards
Learning algorithm to map Employs algorithms to discover
inputs to outputs inherent data patterns Undesired behaviors may
result in penalties
Labeled validation data to test
model accuracy Develop a policy that maps
states to actions
Blank
Unsupervised Learning
Unsupervised Learning
• Algorithms that learn from unlabeled data
• Used for exploratory analysis, image processing, and
identifying key data structures
• Applications: object recognition, medical imaging, anomaly
detection, recommendations
• Key Methods:
• Clustering (e.g., k-means)
• Dimensionality Reduction (e.g., Principal Component
Analysis)
Case Study
Imagine you have a large farm where various animals are randomly collected in one
container. You need an automated sorter. In this scenario, you can use unsupervised
learning techniques to automatically group similar animals together
Benefits and Challenges

Benefits: Challenges:

Reduced manual data preparation High computational complexity,

- no need for labeled data especially with large datasets

Ability to discover unknown patterns Increased risk of inaccurate results

in data
Potential need for human
Simpler algorithms intervention to validate groupings
K-Means Clustering

Imagine you had

some data that you
could plot on a line,
and you knew you
needed to put it into 3
clusters.
K-Means Clustering

Cluster 1 Cluster 2 Cluster 3

In this case the data

make three, relatively
obvious, clusters.
But, rather than rely on our eye, let's see if we can get a
computer to identify the same 3 clusters.
K-Means Clustering

Step I: Select the number of clusters you want to

identify in your data. This is the "K" in "K-means
clustering".

In this case, we will select K=3. That is to say, we

want to identify 3 clusters.
These are the initial
K-Means Clustering clusters

Step II: Randomly select three distinct data points.

K-Means Clustering

Distance from
the 1st point to
the blue cluster

Step III: Measure the distance between the 1st point

and the three initial clusters
K-Means Clustering

Distance from the 1st

point to the green
cluster

Step III: Measure the distance between the 1st point

and the three initial clusters
K-Means Clustering

Distance from the 1st point to the

orange cluster

Step III: Measure the distance between the 1st point

and the three initial clusters
K-Means Clustering

Step IV: Assign the 1st point to the nearest cluster.

In this case the nearest cluster is the blue cluster
Now we do the same
K-Means Clustering thing for the next point

Measure the distances

K-Means Clustering

Assign the point to the

nearest cluster (green
one)
Now figure out which
K-Means Clustering cluster the 3rd point
belongs to

Measure the distances

K-Means Clustering

Assign the point to the

nearest cluster (orange
one)
K-Means Clustering

The rest of these point

are closest to the orange
cluster
K-Means Clustering

The rest of these point

are closest to the orange
cluster
K-Means Clustering

Step V: Calculate the mean of each cluster

K-Means Clustering

Then we repeat what we just did, measure and

cluster using the mean value.
K-Means Clustering

Cluster 1 Cluster 2 Cluster 3

The K means clustering are terrible compared to what we did by eye

Cluster 1 Cluster 2 Cluster 3

K-Means Clustering

Cluster 1 Cluster 2 Cluster 3

Total Variation within the clusters

Since K-means clustering can't "see" the best clustering, its only
option is to keep track of these clusters, and their total variance,
and do the whole thing over again with different starting points.
K-Means Clustering

Pick three initial random clusters

• Cluster all the remaining points based
on the closest cluster
K-Means Clustering

• calculate the mean of each cluster and

then re-cluster based on the new
means.
K-Means Clustering

• Repeat until the clusters no longer

change.
K-Means Clustering

• It repeats until the clusters no longer

change.
K-Means Clustering

Cluster 1 Cluster 2 Cluster 3

Total Variation within the clusters

At this point, K-means clustering knows that the 2nd clustering is the best
clustering so far. But it doesn't know if it's the best overall, so it will do a few
more clusters (it does as many as you tell it to do) and then come back
and return that one if it is still the best.
K-Means Clustering
What if our data is plotted in 2 dimensions
K-Means Clustering
Just like before we pick three random points
and we use the euclidean distance

𝑥 2 + y 2
y
x
K-Means Clustering

Cluster 1 Cluster 2

Cluster 3
K-Means Clustering

• Groups the data into ‘K’ groups based on similarities (or

distance) between the features of the items in the data
• Finds K cluster centers that best splits the data
• Minimizes the inter-cluster variances
• When testing a new item, it's placed in the group it's most
similar to
Let’s group this point together
Given the point S with coordinates S(0.4,0.4) and two cluster X and O.
To which cluster does this new point can be classified ?
Justify your answer!
Euclidean Distance Reminder!

Let’s group this point together

Given the point S with coordinates S(0.4,0.4) and two cluster X and O.
To which cluster does this new point can be classified ?
Justify your answer!

𝑑 𝑆, 𝑋 = 0.4 − 0.25 2 + 0.4 − 0.72 2=0.1024

𝑑 𝑆, 𝑂 = 0.4 − 0.7 2 + 0.4 − 0.31 2=0.0081

𝑑 𝑆, 𝑂 < 𝑑(𝑆, 𝑋)

Our new point S belongs to the cluster O

How do we specify number of clusters K?

Choosing the right number of groups (k) can be tricky!

What happens if we pick k=2? k=4? k=5?
There are many ways to solve this:
• Elbow method (we will learn this)
• Silhouette method
• Gap statistics
Elbow Method

1. We try k-means for different values of

k (like k=1,2,...,10).
2. For each k, we calculate how far each
point is from the center of its group.
3. We plot these distances against k.
4. The "elbow" point, where the plot
bends, shows the best number of
groups.
Test Your Knowledge

True or False: In k-means clustering, the clusters are

defined by boundaries .
Test Your Knowledge

False
Clusters are defined by their centroids
Principal Component
Analysis
Taking the picture of a Teapot

How to take a picture to

capture the most information
about the teapot?
Which angle is the best?

A B

C D
Which angle is the best?

A B

C D
Best position for a teapot snapshot?
Why this position?
Because it provides the most visual
information.
How do we find this position ?
Rotate the teapot according to the PCA
algorithm.
Finding the longest axis
Finding the second longest axis while fixing the first axis
How PCA works?

Rotate the object around its center to find the best orientation:
• First find the axis so that the object has largest extend in average
along the axis.
• Rotate the object around the first axis to find the axis that is
perpendicular to the first axis, and the object has largest extend in
average along this axis.

The two axises found are the first and the second principal component.
The extends in average along the axises are called the eigenvalues
PCA

• PCA is a technique that allows the extraction of the most

important trends in the data
• It helps reveal the underlying trends by constructing a new
coordinate system by rotating axes
• The first direction is the direction that the data varies the
most, the second is the one that varies second most,…
• In summary it learns a few principal components that are
representative of the whole dataset from which any element
of the dataset can be reconstructed
Popular Applications

• Visualization of High dimensional data

• Find essential attributes and variables
• Dictionary Learning
• Dimensionality Reduction
• Filtering of data
Partner & a Business Idea

Capital required=$ 4M
Expected per partner contribution =
$ 4M / 4=$ 1 M
Partner & a Business Idea
Expected per partner contribution =
$ 4M / 4=$ 1 M

Actual 1.8M 1.2M 0.6M 0.4M

Contribution
Who is more important?

1.8M 1.2M 0.6M 0.4M

Proportion % 45% 30% 15% 10%
Cumulative % 45% 75% 90% 100%
45+30
Conclusion

• Principal Components are the partners (Eigenvectors)

• Each has their own contribution (Eigenvalues)
• Keeping only 90% of the contributions results in removing partner 4
from the equation.
• Maybe the company is better off with 3 partners : the top 3
principal components!
• Notes: Data should be free of outliers and should be on the same
scale
Test Your Knowledge

True or False: We can say that PCA is a compression

technique
Test Your Knowledge

True
Eigenfaces using PCA

• When PCA is applied on face images, the

Eigenvectors extracted are called
Eigenfaces
• Each person's face has unique features
that distinguish them from others.
• Is it possible to identify some facial
features that can represent all the faces in
the world?
• Example: normal ear, pointy ears, round
eyes, almond shaped eyes, hair, chin
shapes,…
Eigenfaces using PCA

• Consider the faces in the figure

• We want to learn the basis features of
these faces using PCA
• The most common features are
represented in the form Eigenface 1 (PC1),
the second most common features in the
form of Eigenface 2 (PC2), etc.
Eigen Faces
• Applying PCA on the faces dataset we extract say 2000
Eigenfaces
• The two dominant Eigenfaces are shown
• Every face can now be reconstructed using these Eigenfaces
Eigenface 1 Eigenface 2
• Example:

Original Face Eigenface 2 Eigenface 2000

= -1.3* + 2.3* + … + 0.02*

How many Eigenfaces should we consider?
Original Face

With 400 PCs, the

Reconstructed
face starts to look
Faces
like the original
Selecting the optimal number of
Principal Components

• A well-known technique to
selecting the optimal number of
components is to choose the
number of PCs that express 95% of
the variance.
• This assumes that the remaining
5% is noise
Test Your Knowledge

What are two examples of unsupervised machine

learning methods?
Test Your Knowledge

Clustering and Dimensionality Reduction

TestYour
Test YourKnowledge
Knowledge
Scan Me

Applying PCA
THANK YOU!

Unsupervised Learning
No ratings yet
Unsupervised Learning
84 pages
Mohr's Circle
100% (1)
Mohr's Circle
13 pages
Unit 4-Unsupervised Learning-K Means and Hierarchical Clustering
No ratings yet
Unit 4-Unsupervised Learning-K Means and Hierarchical Clustering
48 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
Unit 4
No ratings yet
Unit 4
125 pages
Session 37 CO4 Unsupervised Learning
No ratings yet
Session 37 CO4 Unsupervised Learning
34 pages
AI ML Lecture 6
No ratings yet
AI ML Lecture 6
20 pages
Module 4
No ratings yet
Module 4
63 pages
Clustering
No ratings yet
Clustering
55 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
78 pages
Clustering
No ratings yet
Clustering
84 pages
K Means
No ratings yet
K Means
25 pages
K Means Clustering
No ratings yet
K Means Clustering
27 pages
SBM Assessment Tool For Online Validation With Essential MOVs
No ratings yet
SBM Assessment Tool For Online Validation With Essential MOVs
10 pages
Lect 10 - Unsupervised Learning
No ratings yet
Lect 10 - Unsupervised Learning
50 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
Module 3
No ratings yet
Module 3
21 pages
Unit 4
No ratings yet
Unit 4
63 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
22 pages
MODULE 4 Clustering
No ratings yet
MODULE 4 Clustering
23 pages
2015 Turbine Day-Final
100% (4)
2015 Turbine Day-Final
217 pages
Lecture 08 Slides
No ratings yet
Lecture 08 Slides
43 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
Unit IV
No ratings yet
Unit IV
96 pages
Unit 4
No ratings yet
Unit 4
19 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
83 Revision Questions For IGCSE Questions Solutions PDF
100% (4)
83 Revision Questions For IGCSE Questions Solutions PDF
5 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
4 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
K Means Clustering
No ratings yet
K Means Clustering
13 pages
K Mean Clustering
No ratings yet
K Mean Clustering
32 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
Machine Learning Notes-1 (Clustering-1)
No ratings yet
Machine Learning Notes-1 (Clustering-1)
25 pages
EML %TH Module
No ratings yet
EML %TH Module
40 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
Clustering Explanation
No ratings yet
Clustering Explanation
8 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
Clustering FinancialData
No ratings yet
Clustering FinancialData
38 pages
Unsupervised Learning - Clustering
No ratings yet
Unsupervised Learning - Clustering
55 pages
Week 9
No ratings yet
Week 9
66 pages
Unsupervised Learning for Students
No ratings yet
Unsupervised Learning for Students
59 pages
Unsupervised Methods Overview
No ratings yet
Unsupervised Methods Overview
26 pages
4 Clustring
No ratings yet
4 Clustring
48 pages
Unsupervised Learning Insights
No ratings yet
Unsupervised Learning Insights
10 pages
Cluster Analysis Overview
No ratings yet
Cluster Analysis Overview
77 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
47 pages
UnsupervisedLearning FoundationalMathofAI S24
No ratings yet
UnsupervisedLearning FoundationalMathofAI S24
6 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
24 pages
ML - Unit - 2
No ratings yet
ML - Unit - 2
13 pages
Machine Learning For Humans, Part 3 - Unsupervised Learning - by Vishal Maini - Machine Learning For Humans - Medium
No ratings yet
Machine Learning For Humans, Part 3 - Unsupervised Learning - by Vishal Maini - Machine Learning For Humans - Medium
23 pages
K-Means Clustering Guide 2023
No ratings yet
K-Means Clustering Guide 2023
14 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Introduction To The K-Means Clustering Algorithm Based On The Elbow
No ratings yet
Introduction To The K-Means Clustering Algorithm Based On The Elbow
4 pages
U1 - KMeans - 5th Sem - DS
No ratings yet
U1 - KMeans - 5th Sem - DS
14 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
6 pages
Major1107202x12.2pscslab V4 Approve P11
No ratings yet
Major1107202x12.2pscslab V4 Approve P11
1 page
1.supervised and Unsupervised
No ratings yet
1.supervised and Unsupervised
42 pages
K-Means Clustering Insights
No ratings yet
K-Means Clustering Insights
8 pages
Machine Learning Bloque 4
No ratings yet
Machine Learning Bloque 4
12 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Dos and Donts
100% (1)
Dos and Donts
4 pages
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
No ratings yet
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
16 pages
SAEJ435 CV 001
100% (1)
SAEJ435 CV 001
13 pages
Kratus 2017 Music Listening Is Creative
No ratings yet
Kratus 2017 Music Listening Is Creative
6 pages
Moldflow 2021 Features Comparison Matrix A4 en
No ratings yet
Moldflow 2021 Features Comparison Matrix A4 en
4 pages
Acgih Manual 1998 (401-500)
No ratings yet
Acgih Manual 1998 (401-500)
100 pages
ERP Training Schedule
No ratings yet
ERP Training Schedule
21 pages
Law Firm Questions
No ratings yet
Law Firm Questions
5 pages
Irc 096-1987
No ratings yet
Irc 096-1987
9 pages
Moral Panics Assignment
No ratings yet
Moral Panics Assignment
7 pages
Aspen HYSYS Pump, Compressor, Expander, and Heat Exchanger Simulations
No ratings yet
Aspen HYSYS Pump, Compressor, Expander, and Heat Exchanger Simulations
22 pages
Graduands Convocation 2019 v2 PDF
No ratings yet
Graduands Convocation 2019 v2 PDF
53 pages
BALLOU Inclusion VS Empathy
No ratings yet
BALLOU Inclusion VS Empathy
5 pages
Final Paper - Tales Takezo
No ratings yet
Final Paper - Tales Takezo
8 pages
RSettings For 64GT & 99GT PDF
No ratings yet
RSettings For 64GT & 99GT PDF
7 pages
3rd Module
No ratings yet
3rd Module
5 pages
Additive Manufacturing For 3-Dimensional (3D) Structures: (Emphasis On 3D Printing)
No ratings yet
Additive Manufacturing For 3-Dimensional (3D) Structures: (Emphasis On 3D Printing)
153 pages
Math 10 SLM 18 Permutation and Combination
No ratings yet
Math 10 SLM 18 Permutation and Combination
17 pages
An Optimized Grounded Base Oscillator Design For VHF/UHF
No ratings yet
An Optimized Grounded Base Oscillator Design For VHF/UHF
12 pages
Infectious Smile Gui
No ratings yet
Infectious Smile Gui
4 pages
Chapter 4 Practice
No ratings yet
Chapter 4 Practice
10 pages
ST62T00CM6 TR
No ratings yet
ST62T00CM6 TR
100 pages
Subject G11-Goodyear Tvl-Ia Eclassrecord 1stsem 2018-19
No ratings yet
Subject G11-Goodyear Tvl-Ia Eclassrecord 1stsem 2018-19
29 pages
The Construction of Family in Selected Disney Animated Films
No ratings yet
The Construction of Family in Selected Disney Animated Films
4 pages
Teaching Tools for Parsing Education
No ratings yet
Teaching Tools for Parsing Education
5 pages
Journal of Materials Processing Tech.: Harikrishna Rana, Vishvesh Badheka
No ratings yet
Journal of Materials Processing Tech.: Harikrishna Rana, Vishvesh Badheka
13 pages

Unsupervised Learning

Uploaded by

Unsupervised Learning

Uploaded by

Unsupervised

Learning Your Image Here

AIcademy Summer Camp – Day 2

Supervised Learning: Unsupervised Learning: Reinforcement Learning:

Reduced manual data preparation High computational complexity,

Ability to discover unknown patterns Increased risk of inaccurate results

Imagine you had

Cluster 1 Cluster 2 Cluster 3

In this case the data

Step I: Select the number of clusters you want to

In this case, we will select K=3. That is to say, we

Step II: Randomly select three distinct data points.

Step III: Measure the distance between the 1st point

Distance from the 1st

Step III: Measure the distance between the 1st point

Distance from the 1st point to the

Step III: Measure the distance between the 1st point

Step IV: Assign the 1st point to the nearest cluster.

Measure the distances

Assign the point to the

Measure the distances

Assign the point to the

The rest of these point

The rest of these point

Step V: Calculate the mean of each cluster

Then we repeat what we just did, measure and

Cluster 1 Cluster 2 Cluster 3

The K means clustering are terrible compared to what we did by eye

Cluster 1 Cluster 2 Cluster 3

Cluster 1 Cluster 2 Cluster 3

Total Variation within the clusters

Pick three initial random clusters

• calculate the mean of each cluster and

• Repeat until the clusters no longer

• It repeats until the clusters no longer

Cluster 1 Cluster 2 Cluster 3

Total Variation within the clusters

• Groups the data into ‘K’ groups based on similarities (or

Let’s group this point together

𝑑 𝑆, 𝑋 = 0.4 − 0.25 2 + 0.4 − 0.72 2=0.1024

𝑑 𝑆, 𝑂 = 0.4 − 0.7 2 + 0.4 − 0.31 2=0.0081

Our new point S belongs to the cluster O

Choosing the right number of groups (k) can be tricky!

1. We try k-means for different values of

True or False: In k-means clustering, the clusters are

How to take a picture to

• PCA is a technique that allows the extraction of the most

• Visualization of High dimensional data

Actual 1.8M 1.2M 0.6M 0.4M

1.8M 1.2M 0.6M 0.4M

• Principal Components are the partners (Eigenvectors)

True or False: We can say that PCA is a compression

• When PCA is applied on face images, the

• Consider the faces in the figure

Original Face Eigenface 2 Eigenface 2000

= -1.3* + 2.3* + … + 0.02*

With 400 PCs, the

What are two examples of unsupervised machine

Clustering and Dimensionality Reduction

You might also like