0% found this document useful (0 votes)

48 views17 pages

Chapter 1.2. Overview of ML

Uploaded by

Sơn Trịnh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views17 pages

Chapter 1.2. Overview of ML

Uploaded by

Sơn Trịnh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Nhân bản – Phụng sự – Khai phóng

Introduction to Dimensionality
Reduction for Machine Learning
Machine Learning
Problems
• Data have many features
• Training extremely slow,
• Harder to find a good solution.
• This problem is often referred to as the curse of dimensionality.
• Possible to reduce the number of features

Machine Learning
Techniques for Dimensionality Reduction

• Feature Selection Methods

• Matrix Factorization
• Manifold Learning
• Autoencoder Methods

Machine Learning
What is Dimensionality Reduction

• Dimensionality reduction means reducing feature

• is a way of converting the higher dimensions dataset into lesser
dimensions dataset ensuring that it provides similar information

Machine Learning
Dimensionality Reduction Methods and Approaches

Machine Learning
The Curse of Dimensionality
• Handling the high-dimensional data is very difficult in practice, commonly
known as the curse of dimensionality.

• If the dimensionality of the input dataset increases, any machine learning

algorithm and model becomes more complex.

Machine Learning
Why Dimensionality Reduction is Important
• Few features mean less complexity
• Less storage space because you have fewer data
• Fewer features require less computation time
• Model accuracy improves due to less misleading data
• Algorithms train faster
• Reducing the data set’s feature dimensions helps visualize the data faster
• It removes noise and redundant features

Machine Learning
Disadvantages of dimensionality Reduction

• Some data may be lost due to dimensionality reduction.

• In the PCA dimensionality reduction technique, sometimes the principal

components required to consider are unknown.

Machine Learning
Approaches of Dimension Reduction

• Feature Selection
• Feature Extraction

Machine Learning
Feature Selection
• selecting the subset of the relevant features and leaving out the
irrelevant features

• it is a way of selecting the optimal features from the input dataset.

• => to build a model of high accuracy

Machine Learning
Methods are used for the feature selection
• Filters Methods
• Correlation
• Chi-Square Test
• ANOVA
• Wrappers Methods
• Forward Selection
• Backward Selection
• Both-directional
• Embedded Methods:
• LASSO
• Elastic Net
• Ridge Regression
Machine Learning
Feature Extraction
• Feature extraction is the process of transforming the space containing
many dimensions into space with fewer dimensions.
• feature extraction techniques
• Principal Component Analysis
• Linear Discriminant Analysis
• Kernel PCA
• Quadratic Discriminant Analysis

Machine Learning
Principal Component Analysis (PCA)
• Principal Component Analysis is a statistical process that converts the
observations of correlated features into a set of linearly uncorrelated
features

• PCA works by considering the variance of each attribute because the

high attribute shows the good split between the classes, and hence it
reduces the dimensionality.

Machine Learning
Backward Feature Elimination
• The backward feature elimination technique is mainly used while
developing Linear Regression or Logistic Regression model.
• all the n variables of the given dataset are taken to train the model.
• The performance of the model is checked.
• remove one feature each time and train the model on n-1 features for
n times, and will compute the performance of the model.
• check the variable that has made the smallest or no change in the
performance of the model, and then we will drop that variable or
features
• Repeat the complete process until no feature can be dropped.

Machine Learning
Forward Feature Selection
• Forward feature selection follows the inverse process of the backward
elimination process.
• find the best features that can produce the highest increase in the
performance of the model.
• start with a single feature only, and progressively we will add each
feature at a time.
• Train the model on each feature separately.
• The feature with the best performance is selected.
• The process will be repeated until we get a significant increase in the
performance of the model.

Machine Learning
• Example and Exercises
https://github.com/ageron/handsonml2/blob/master/
08_dimensionality_reduction.ipynb

Machine Learning
• Demo

Machine Learning

MP2 User Guide
57% (7)
MP2 User Guide
360 pages
SURT Service Manual - RMA - Rev3b
91% (11)
SURT Service Manual - RMA - Rev3b
43 pages
Mechanical Beta Features
No ratings yet
Mechanical Beta Features
104 pages
Feature Engineering and Dimensionality Reduction
No ratings yet
Feature Engineering and Dimensionality Reduction
146 pages
Apache - Kafka Notes
No ratings yet
Apache - Kafka Notes
9 pages
What Is Dimensionality Reduction
No ratings yet
What Is Dimensionality Reduction
3 pages
Introduction To Dimensionality Reduction
No ratings yet
Introduction To Dimensionality Reduction
5 pages
ML.0-Introduction To ML Course
No ratings yet
ML.0-Introduction To ML Course
7 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
Confusion Matrix
No ratings yet
Confusion Matrix
26 pages
Deminesionality Reduction
No ratings yet
Deminesionality Reduction
13 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
Unit 3
No ratings yet
Unit 3
24 pages
MS SQL Administrator Resume
No ratings yet
MS SQL Administrator Resume
1 page
ASM-BDM - Module 3 - Notes
No ratings yet
ASM-BDM - Module 3 - Notes
12 pages
ML Chapter 4
No ratings yet
ML Chapter 4
38 pages
Internship Report Template
No ratings yet
Internship Report Template
25 pages
ML Unit 4
No ratings yet
ML Unit 4
20 pages
Unit 1 Formula
No ratings yet
Unit 1 Formula
3 pages
Dimentiality
No ratings yet
Dimentiality
4 pages
ML Unit 4 (R22)
No ratings yet
ML Unit 4 (R22)
34 pages
DimensionalityReduction (Filter and Wrapper Methods)
No ratings yet
DimensionalityReduction (Filter and Wrapper Methods)
47 pages
Module 2 ML
No ratings yet
Module 2 ML
15 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
15 pages
SQP 43 - QP
No ratings yet
SQP 43 - QP
10 pages
(CC-202) (Data Structures)
No ratings yet
(CC-202) (Data Structures)
4 pages
ML Unit Iv Part I
No ratings yet
ML Unit Iv Part I
11 pages
PAK - STUDIES - PRESENTATION Report
No ratings yet
PAK - STUDIES - PRESENTATION Report
21 pages
Dimensionality Reduction in Machine Learning-1
No ratings yet
Dimensionality Reduction in Machine Learning-1
16 pages
IBM CC0103EN Certificate Cognitive Class
No ratings yet
IBM CC0103EN Certificate Cognitive Class
1 page
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
56 pages
Unit 3
No ratings yet
Unit 3
23 pages
Reinforcement Learning Guide
No ratings yet
Reinforcement Learning Guide
48 pages
ML Mod 4 & 6 Pyq
No ratings yet
ML Mod 4 & 6 Pyq
11 pages
Unit 4 - ML (NEW)
No ratings yet
Unit 4 - ML (NEW)
80 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
27 pages
Dimensionality Reduction Techniques
No ratings yet
Dimensionality Reduction Techniques
6 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Feature Dimensionality Reduction: A Review: Survey and State of The Art
No ratings yet
Feature Dimensionality Reduction: A Review: Survey and State of The Art
31 pages
Conference 101719
No ratings yet
Conference 101719
7 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
ML Module 6
No ratings yet
ML Module 6
29 pages
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 2 (Advanced Phython Packages)
20 pages
Deep Learning For Data Analytics 2023 Answer
No ratings yet
Deep Learning For Data Analytics 2023 Answer
6 pages
Swayam 8thmajor
No ratings yet
Swayam 8thmajor
57 pages
ML Lecture UIII 1 Dim Red
No ratings yet
ML Lecture UIII 1 Dim Red
25 pages
Machine Learning Unit-5
No ratings yet
Machine Learning Unit-5
49 pages
Business Data Mining Week 4
No ratings yet
Business Data Mining Week 4
12 pages
Geometric Sequences (Using Standard Formulae) - Lesson3
No ratings yet
Geometric Sequences (Using Standard Formulae) - Lesson3
15 pages
ML Unit 4 at VS
No ratings yet
ML Unit 4 at VS
33 pages
Cloud Classification and Rainfall Prediction
No ratings yet
Cloud Classification and Rainfall Prediction
5 pages
A New PWM Controller With One Cycle Response
No ratings yet
A New PWM Controller With One Cycle Response
7 pages
Unit - 4
No ratings yet
Unit - 4
76 pages
Introduction To Dimensionality Reduction-1
No ratings yet
Introduction To Dimensionality Reduction-1
16 pages
L-10 - Presentation1-09052024-072206pm
No ratings yet
L-10 - Presentation1-09052024-072206pm
27 pages
Machine Learning Dimensionality Guide
No ratings yet
Machine Learning Dimensionality Guide
9 pages
Machine Learning for Beginners
No ratings yet
Machine Learning for Beginners
24 pages
UNIT-1 Regression vs. Classification
No ratings yet
UNIT-1 Regression vs. Classification
25 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
Vintage Games 2.0
100% (10)
Vintage Games 2.0
375 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
47 pages
ML Unit 4
No ratings yet
ML Unit 4
34 pages
AML Unit 5
No ratings yet
AML Unit 5
13 pages
University Institute of Engineering Department of Computer Science & Engineering
No ratings yet
University Institute of Engineering Department of Computer Science & Engineering
23 pages
Dimensionality
No ratings yet
Dimensionality
9 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
ICAML 2021: 3 International Conference On Applications of AI & Machine Learning
No ratings yet
ICAML 2021: 3 International Conference On Applications of AI & Machine Learning
2 pages
ML Labs
No ratings yet
ML Labs
46 pages
2arduino Intro
No ratings yet
2arduino Intro
23 pages
1-Python Algebra Maths
No ratings yet
1-Python Algebra Maths
26 pages
Prog Tutorial2 PDF
No ratings yet
Prog Tutorial2 PDF
2 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
30 pages
Cheeku
No ratings yet
Cheeku
5 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
Presentation Template Guide
No ratings yet
Presentation Template Guide
19 pages
Dimensionality Reduction in ML
No ratings yet
Dimensionality Reduction in ML
6 pages
Brave MMA Event Expenses 2016
No ratings yet
Brave MMA Event Expenses 2016
18 pages
Chapter6 - Unit IV2024
No ratings yet
Chapter6 - Unit IV2024
84 pages
MAN0070A0002 Pilots Manual
No ratings yet
MAN0070A0002 Pilots Manual
37 pages
Crash
No ratings yet
Crash
4 pages
Thrift Fashion-Website Development-SRS
No ratings yet
Thrift Fashion-Website Development-SRS
7 pages
Brook For Free Pascal PDF
100% (1)
Brook For Free Pascal PDF
128 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
104 pages
Form 1 Math Exam Paper
No ratings yet
Form 1 Math Exam Paper
6 pages
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
No ratings yet
Unit 5 - Machine Learning - WWW - Rgpvnotes.in PDF
14 pages
Top 11 Dimensionality Reduction Techniques
No ratings yet
Top 11 Dimensionality Reduction Techniques
12 pages
g11 Etech Month of October 2
No ratings yet
g11 Etech Month of October 2
41 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
38 pages
EPON OLT WebGUI User Manual
No ratings yet
EPON OLT WebGUI User Manual
82 pages
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
No ratings yet
1.variable Reduction 2.principal Component Analysis: Topic UNIT-4
19 pages

Chapter 1.2. Overview of ML

Uploaded by

Chapter 1.2. Overview of ML

Uploaded by

Nhân bản – Phụng sự – Khai phóng

• Feature Selection Methods

• Dimensionality reduction means reducing feature

• If the dimensionality of the input dataset increases, any machine learning

• Some data may be lost due to dimensionality reduction.

• In the PCA dimensionality reduction technique, sometimes the principal

• it is a way of selecting the optimal features from the input dataset.

• => to build a model of high accuracy

• PCA works by considering the variance of each attribute because the

You might also like