0% found this document useful (0 votes)

12 views30 pages

Unsupervised Learning

The document provides an overview of unsupervised learning in machine learning, defining it as a technique where models learn from unlabeled data to discover hidden patterns. It categorizes unsupervised learning into clustering, association, and dimensionality reduction, detailing various algorithms and methods for each type. Additionally, it discusses evaluation metrics, applications, and limitations of unsupervised learning.

Uploaded by

mwascoder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views30 pages

Unsupervised Learning

Uploaded by

mwascoder

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Murang’a University of Technology

Innovation for Prosperity

Lecture 4

Unsupervised Learning
What is Unsupervised Learning?

• In the previous topic, we learned supervised learning in which

models are trained using labeled data under the supervision of
training data.
• But there may be many cases in which we do not have labeled data
and need to find the hidden patterns from the given dataset.
• So, to solve such types of cases in machine learning, we need
unsupervised learning techniques.
• As the name suggests, unsupervised learning is a machine learning
technique in which models are not supervised using training
dataset. Instead, models itself find the hidden patterns and insights
from the given data.

3
What is Unsupervised Learning?

• Unsupervised learning can be defined as a type of machine learning in

which models are trained using unlabeled dataset and are expected to
learn from that data without any supervision.
• Unsupervised learning processes unlabeled input data by discovering
hidden patterns and grouping similar objects together.

4
Types of Unsupervised Learning
• Unsupervised learning can be categorized into three primary types:
– Clustering: Clustering is a method of grouping the objects into
clusters such that objects with most similarities remains into a
group and has less or no similarities with the objects of another
group.
– Association: Association in unsupervised learning is a method
which is used for finding the relationships between variables in
the large database. It determines the set of items that occurs
together in the dataset.
– Dimensionality Reduction: Refers to a process of transforming
high-dimensional data into a lower-dimensional space that still
preserves the essence of the original data.

5
Clustering
• This is a method of unsupervised learning that
groups together data points that share similar
characteristics.

6
Clustering Algorithms
1. k-Means Clustering:
• K-means clustering is a technique used to organize data into groups
based on their similarity. It divides the dataset into k clusters by
minimizing the distance between data points and the cluster
centroids.
Steps:
1. Choose the number of clusters k.
2. Initialize k centroids randomly.
3. Assign each data point to the nearest centroid.
4. Update centroids based on the mean of assigned points.
5. Repeat steps 3-4 until centroids stabilize.

7
k-Means Clustering

8
Clustering Algorithms
2. Hierarchical Clustering:
• This is an unsupervised learning technique that builds a hierarchy of
clusters by either merging smaller clusters into larger ones or splitting
larger clusters into smaller ones
• The result is often represented as a dendrogram, a tree-like diagram
showing the nested grouping of data points and their similarity levels.
• Two main approaches:
• Agglomerative: Bottom-up approach (merge clusters). Here we
consider all data points to be part of individual clusters and then these
clusters are clubbed together to make one big cluster with all data
points.
• Divisive: Top-down approach (split clusters). Here we consider all data
points to be part one big cluster and then this cluster is divide into
smaller groups
9
Hierarchical Clustering

10
Clustering Algorithms
3. DBSCAN (Density-Based Spatial Clustering for Applications with
Noise)
• DBSCAN identifies clusters of high-density data points and labels
points in low-density regions as noise.
Key Concepts:
• Core Points: Points with enough neighbors within a specified radius.
• Border Points: Points within ϵ of a core point but not dense
themselves.
• Noise Points: Points that are neither core nor border points.

11
Association
• Association rule learning a type of unsupervised learning technique
that checks for the dependency of one data item on another data
item.
• It tries to find some interesting relations or associations among the
variables of dataset. It is based on different rules to discover the
interesting relations between variables in the database.
• The association rule learning is one of the very important concepts
of machine learning, and it is employed in Market Basket analysis,
Web usage mining, continuous production, etc.

12
Association
• Association rule learning works on the concept of If and Else
Statement, such as if A then B. Here the If element is
called antecedent, and then statement is called as Consequent.

13
Association Metrics
1. Support:
• Measures the frequency of a rule in the dataset.
• Example: If 20 out of 100 transactions contain {bread, milk}, support is 0.2
or 20%.
2. Confidence:
• Measures the likelihood of Y occurring given X.
• Example: If 15 out of 20 transactions with bread also include milk,
confidence is 0.75 or 75%.
3. Lift:
• Measures the strength of a rule compared to random chance.
• Example: Lift > 1 indicates a strong positive association.

14
Association Algorithms
i. Apriori Algorithm
• Generates frequent itemsets using a breadth-first search and
identifies association rules. Eliminates infrequent itemsets using a
minimum support threshold.
ii. FP-Growth (Frequent Pattern Growth):
• Builds a compact tree structure (FP-tree) to identify frequent
itemsets without candidate generation. It is the improved version of
the Apriori Algorithm.
iii. Eclat Algorithm
• Eclat algorithm stands for Equivalence Class Transformation. This
algorithm also uses a depth-first search technique to find frequent
itemsets.

15
Practical Example in Python

16
Dimensionality Reduction
• Refers to a process of transforming high-dimensional data into a
lower-dimensional space that still preserves the essence of the
original data.
• Dimensionality reduction technique can be defined as, "a way of
converting the higher dimensions dataset into lesser dimensions
dataset ensuring that it provides similar information."
• It is a critical aspect of unsupervised learning, particularly useful
when dealing with datasets that have a large number of dimensions
or features.
• The main goal of dimensionality reduction is to simplify the dataset
without losing much information, making it easier to visualize,
analyze, and interpret.

17
The Curse of Dimensionality
• The Curse of Dimensionality refers to the challenges that arise as
the number of features (dimensions) in a dataset increases.
• If the dimensionality of the input dataset increases, any machine
learning model becomes more complex.
• As the number of features increases, the number of samples also
gets increased proportionally, and the chance of overfitting also
increases.
• If the machine learning model is trained on high-dimensional data,
it becomes overfitted and results in poor performance.
• The Curse of Dimensionality in machine learning leads to reduced
model performance, higher computational costs, and challenges in
analyzing and generalizing high-dimensional data effectively.

18
Dimensionality Reduction
• To overcome the curse of dimensionality, there are two main
approaches of reducing the number of features (dimensions) in a
dataset while retaining as much relevant information as possible.
1. Feature Selection
• This is the process of selecting the subset of the relevant features
and leaving out the irrelevant features present in a dataset to build
a model of high accuracy. In other words, it is a way of selecting the
optimal features from the input dataset.
2. Feature Extraction
• Feature extraction creates new features by transforming the
original features into a lower-dimensional space.

19
Feature Selection
Three methods are used for the feature selection:
1. Filters Methods
• In this method, the dataset is filtered, and a subset that contains only
the relevant features is taken.
• Select features based on statistical measures or relevance scores
independently of the machine learning model.
2. Embedded Methods
• Embedded methods check the different training iterations of the
machine learning model and evaluate the importance of each feature.
• Integrates feature selection into the model training process, where
the algorithm itself identifies important features.

20
Feature Selection
3. Wrappers Methods
• The wrapper method has the same goal as the filter method, but it
takes a machine learning model for its evaluation.
• In this method, some features are fed to the ML model, and the
performance is evaluated.
• The performance decides whether to add those features or remove
to increase the accuracy of the model.

21
Feature Extraction
1. Principal Component Analysis (PCA)
• PCA is a linear dimensionality reduction technique that projects
data onto a new set of orthogonal components, called principal
components, ordered by the amount of variance they explain.
Steps:
1. Standardize the dataset.
2. Compute the covariance matrix of the features.
3. Calculate eigenvectors and eigenvalues from the covariance matrix.
4. Select the top K eigenvectors corresponding to the largest
eigenvalues to form the new feature space.

22
Principal Component Analysis (PCA)

23
Feature Extraction
2. t-SNE (t-Distributed Stochastic Neighbor Embedding)
• t-SNE is a non-linear dimensionality reduction technique used
primarily for visualizing high-dimensional data in 2D or 3D.

How It Works:
i. Models pairwise similarities between points in high-dimensional
and low-dimensional spaces.
ii. Optimizes the low-dimensional representation to preserve local
relationships.

24
t-SNE (t-Distributed Stochastic Neighbor Embedding)

25
Feature Extraction
3. Linear Discriminant Analysis (LDA)
• LDA is a dimensionality reduction technique that finds a linear
combination of features that best separates different classes.

Steps:
i. Compute the mean vectors for each class.
ii. Compute the scatter matrices to measure class separability.
iii. Solve the eigenvalue problem to find the linear
discriminants.

26
Linear Discriminant Analysis (LDA)

27
Evaluation Metrics in Unsupervised Learning

1. Elbow Method
• Evaluates the sum of squared distances (inertia) between data points
and their respective cluster centroids.
2. Davies-Bouldin Index (DBI)
• Measures cluster compactness and separation.
• Lower values indicate better-defined clusters.
3. Silhouette Score
• Measures how similar a data point is to its own cluster compared to
other clusters.
• Ranges from -1 (poor clustering) to 1 (well-clustered).
More details here: https://www.kdnuggets.com/2023/04/exploring-unsupervised-
learning-metrics.html

28
Applications of Unsupervised Learning

1. Customer Segmentation
• Grouping customers based on purchasing behavior.
• Improves targeted marketing strategies.
2. Anomaly Detection
• Identifying fraud in transactions or unusual network activity in
cybersecurity.
3. Recommendation Systems
• Suggesting products to users based on clustering or latent features.
4. Bioinformatics
• Grouping genes with similar expressions or discovering subtypes of
diseases.

29
Limitations of Unsupervised Learning

i. Choosing the Number of Clusters (k):

– For clustering algorithms like k-Means, deciding the optimal
number of clusters can be subjective.
– Assignment: Discuss the 3 main approaches of choosing the
optimal value of "K number of clusters"?
ii. Interpretability:
– Results are harder to interpret since there are no labels to
validate the findings.
iii. No Ground Truth:
– Without labels, validating the quality of clusters or reduced
dimensions is challenging.

Unit 4
No ratings yet
Unit 4
62 pages
Slides Courtesy: Ling Chen [email protected]
No ratings yet
Slides Courtesy: Ling Chen [email protected]
42 pages
ML Unit-3
No ratings yet
ML Unit-3
22 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
9 pages
Data Science Unit-4 B.sc. III Sem. MDC
No ratings yet
Data Science Unit-4 B.sc. III Sem. MDC
6 pages
DM Chapter 4
No ratings yet
DM Chapter 4
47 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Classification in Data Mining
No ratings yet
Classification in Data Mining
60 pages
2nd Unit NN Final Class Notes
No ratings yet
2nd Unit NN Final Class Notes
50 pages
Chapter - 4
No ratings yet
Chapter - 4
14 pages
Ai - W8L15
No ratings yet
Ai - W8L15
44 pages
Module 3
No ratings yet
Module 3
21 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
12 pages
How To Perform Clustering Algorithms in Machine Learning
No ratings yet
How To Perform Clustering Algorithms in Machine Learning
9 pages
Introduction To Dimensionality Reduction-1
No ratings yet
Introduction To Dimensionality Reduction-1
16 pages
Data Mining Techniques
No ratings yet
Data Mining Techniques
11 pages
Ds Unit 2
No ratings yet
Ds Unit 2
36 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
FML AAT (Techtalk)
No ratings yet
FML AAT (Techtalk)
25 pages
Data Warehouse and Mining Notes
No ratings yet
Data Warehouse and Mining Notes
12 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
63 pages
Discovering Knowledge in Data: Lecture Review of
No ratings yet
Discovering Knowledge in Data: Lecture Review of
20 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
13 pages
Data Mining Notes
No ratings yet
Data Mining Notes
3 pages
Data Mining Implementation
No ratings yet
Data Mining Implementation
9 pages
Unit 2
No ratings yet
Unit 2
13 pages
Unit 4 Introduction To Algorithm
No ratings yet
Unit 4 Introduction To Algorithm
10 pages
Machine Learning
No ratings yet
Machine Learning
48 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
49 pages
Clustering in Machine Learning
No ratings yet
Clustering in Machine Learning
4 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
10 pages
Mooc Part 2
No ratings yet
Mooc Part 2
8 pages
Fifth Chapter Classification Clustering
No ratings yet
Fifth Chapter Classification Clustering
16 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
16 pages
ML Lect1
100% (1)
ML Lect1
51 pages
ML Unit 1
No ratings yet
ML Unit 1
74 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
4 pages
Machine Learning Note Modul 4 5
No ratings yet
Machine Learning Note Modul 4 5
20 pages
Supervised vs Unsupervised Learning
No ratings yet
Supervised vs Unsupervised Learning
15 pages
M Learning
No ratings yet
M Learning
11 pages
Unit 3
No ratings yet
Unit 3
33 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
IMTC634 - Data Science - Chapter 6
No ratings yet
IMTC634 - Data Science - Chapter 6
22 pages
Unit Iv
No ratings yet
Unit Iv
29 pages
Unit 6
No ratings yet
Unit 6
22 pages
Unsupervised Machine Learning in Python
100% (2)
Unsupervised Machine Learning in Python
89 pages
Week 15 Lecture Notes
No ratings yet
Week 15 Lecture Notes
66 pages
Chapter 2,3,4
No ratings yet
Chapter 2,3,4
8 pages
Fam Question Bank CT
No ratings yet
Fam Question Bank CT
14 pages
On Unit-3
No ratings yet
On Unit-3
30 pages
Bia Unit-3 Part-2
No ratings yet
Bia Unit-3 Part-2
43 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
SJNanda - Spider and CollidingBodies
No ratings yet
SJNanda - Spider and CollidingBodies
50 pages
3 DM Classification
No ratings yet
3 DM Classification
55 pages
Unit 5a
No ratings yet
Unit 5a
60 pages
Rook Pivoting
No ratings yet
Rook Pivoting
12 pages
Alva
No ratings yet
Alva
6 pages
GeorgiaTech CS-6515: Graduate Algorithms: EXAM3 Flashcards by Daniel Conner - Brainscape
No ratings yet
GeorgiaTech CS-6515: Graduate Algorithms: EXAM3 Flashcards by Daniel Conner - Brainscape
10 pages
Non-Linear Optimization With Constraints and Extra Arguments
No ratings yet
Non-Linear Optimization With Constraints and Extra Arguments
4 pages
Mws Gen Int PPT Simpson3by8
No ratings yet
Mws Gen Int PPT Simpson3by8
43 pages
QUESTION PAPER SET Data Structures CSE
No ratings yet
QUESTION PAPER SET Data Structures CSE
4 pages
1.7B Rational Functions & End Behavior
No ratings yet
1.7B Rational Functions & End Behavior
5 pages
Minimum Spanning Tree Algorithms
No ratings yet
Minimum Spanning Tree Algorithms
18 pages
Experiment # 09
No ratings yet
Experiment # 09
5 pages
Bubble Sort
No ratings yet
Bubble Sort
11 pages
Geeta Bi Math 10 (Sat 2) 2025
No ratings yet
Geeta Bi Math 10 (Sat 2) 2025
7 pages
AI - Informed Search - Lecture 7, 8
No ratings yet
AI - Informed Search - Lecture 7, 8
42 pages
Algorithms for CS Students
No ratings yet
Algorithms for CS Students
13 pages
Grade 9 (2 Quarter) - 6 Weeks
No ratings yet
Grade 9 (2 Quarter) - 6 Weeks
3 pages
Quantum Bridge Analytics: QUBO Guide
No ratings yet
Quantum Bridge Analytics: QUBO Guide
46 pages
System of Different Constraints
No ratings yet
System of Different Constraints
14 pages
Lec20 RidgeRegression
No ratings yet
Lec20 RidgeRegression
21 pages
Bivariate Lagrange Interpolation
No ratings yet
Bivariate Lagrange Interpolation
11 pages
Numerical Integration Notes
No ratings yet
Numerical Integration Notes
9 pages
Maths Class X Chapter 02 Polynomials Practice Paper 02 1
No ratings yet
Maths Class X Chapter 02 Polynomials Practice Paper 02 1
3 pages
Competitive Programming: Maximum Bipartite Matching
No ratings yet
Competitive Programming: Maximum Bipartite Matching
19 pages
Chapter2 Annotated Part2
No ratings yet
Chapter2 Annotated Part2
30 pages
Evaluate and Graph Polynomial Functions: Before Now Why?
No ratings yet
Evaluate and Graph Polynomial Functions: Before Now Why?
8 pages
Linear Programming by Keithly Navales, Zhaina Herrera, Kate Reyes
No ratings yet
Linear Programming by Keithly Navales, Zhaina Herrera, Kate Reyes
41 pages
Numerical Integration
No ratings yet
Numerical Integration
24 pages
Wa0009.
No ratings yet
Wa0009.
5 pages
Cenumes313 p1m Exam Set Blue v1!10!16-23
No ratings yet
Cenumes313 p1m Exam Set Blue v1!10!16-23
3 pages
Design and Analysis of Algorithm - QB
No ratings yet
Design and Analysis of Algorithm - QB
14 pages
### 1. Job Sequencing With Deadline and Knapsack
No ratings yet
### 1. Job Sequencing With Deadline and Knapsack
4 pages
Econometrics Exam Guide
No ratings yet
Econometrics Exam Guide
2 pages

Unsupervised Learning

Uploaded by

Unsupervised Learning

Uploaded by

Murang’a University of Technology

Innovation for Prosperity

• In the previous topic, we learned supervised learning in which

• Unsupervised learning can be defined as a type of machine learning in

i. Choosing the Number of Clusters (k):

You might also like