0% found this document useful (0 votes)

11 views28 pages

Module 2 - LDA

Linear Discriminant Analysis (LDA) is a supervised machine learning technique used for multi-class classification by reducing data dimensionality while preserving class discriminatory information. It identifies a linear combination of features that best separates multiple classes, contrasting with Principal Component Analysis (PCA) which does not consider class labels. The objective of LDA is to maximize the separation between class means while minimizing the variance within each class, ultimately leading to improved classification performance.

Uploaded by

krishnakrishna32907

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views28 pages

Module 2 - LDA

Uploaded by

krishnakrishna32907

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 28

Linear Discriminant

Analysis (LDA)

Module No. Training Models/ Regression and classification

Linear Regression, Multivariate Regression, Subset Selection, Shrinkage Methods, Principal

Component Regression, Partial Least squares, Linear Classification, Logistic Regression, LDA, K-
Nearest Neighbor learning.
Linear Discriminant Analysis
• Linear discriminant analysis (LDA) is an approach used in supervised
machine learning to solve multi-class classification problems.
• LDA separates multiple classes with multiple features through data
dimensionality reduction.
• Linear discriminant analysis, also known as normal discriminant
analysis (NDA) or discriminant function analysis (DFA).
• LDA works by identifying a linear combination of features that
separates or characterizes two or more classes of objects or events
LDA Objective
• The objective of LDA is to perform dimensionality reduction …

– So what, PCA does this …

• However, we want to preserve as much of the class

discriminatory information as possible.

– OK, that’s new, let’s well deeper  …

Recall … PCA
• In PCA, the main idea to re-express the available dataset to extract the
relevant information by reducing the redundancy and minimize the
noise.

m - dimensional data vector

• We didn’t care about whether this dataset represent features from one
or more classes, i.e. the discrimination power was not taken into
consideration while we were talking about PCA.

• In PCA, we had a dataset matrix X with dimensions mxn, where

columns represent different data samples.
• We first started by subtracting the mean to have a zero mean dataset,
n – feature vectors
then we computed the covariance matrix S = (data samples)
x
XXT.
• Eigen values and eigen vectors were then computed for Sx. Hence the new basis vectors are those
eigen vectors with highest eigen values, where the number of those vectors was our choice.

• Thus, using the new basis, we can project the dataset onto a less dimensional space with more
powerful data representation.
PCA Vs LDA
PCA Vs LDA
PCA Vs LDA
LDA … Two Classes
• In order to find a good projection vector, we need to define a measure
of separation between the projections.
• The mean vector of each class in x and y feature space is:

1 1 1
 N
i
i

x
x and i N
i

y
y N
i
w T

~
i i i

x  x

1
 wT Ni xi x  wT i

– i.e. projecting x to y will lead to projecting the mean of x to the mean of y.

• We could then choose the distance between the projected means as our
objective function

J (w)  1 ~ 2  ~ 1 1

 wT 2  wT   2
 wT  
LDA … Two Classes
• However, the distance between the projected means is not a very good
measure since it does not take into account the standard deviation within
the classes.

This axis yields better class separability

This axis has a larger distance between means

LDA … Two Classes
• The solution proposed by Fisher is to maximize a function that represents
the difference between the means, normalized by a measure of the within-
class variability, or the so-called scatter.
• For each class we define the scatter, an equivalent of
the variance, as; (sum of square differences between the projected
samples and their class mean).
• ~ ~
si 2   y  
2
i

~2
yi

• si
measures the variability within class ω
i after projecting it on
the y-space.

• Thus ~
2 2
s1  2 measures the variability within the
~
s projection,
two classes atit is called within-class scatter of the
hand after hence
projected samples.
LDA … Two Classes
• The Fisher linear discriminant is defined as the
linear function wTx that maximizes the criterion
function: (the distance between the projected
means normalized by the within- class scatter of
the projected samples.

2

J (w) 1 2

 ~~
s12 ~~
s22

• Therefore, we will be looking for a projection where examples from the

same class are projected very close to each other and, at the same time,
the projected means are as farther apart as possible.
LDA … Two Classes
• In order to find the optimum projection w*, we need to express
J(w) as an explicit function of w.
• We will define a measure of the scatter in multivariate feature space
x which are denoted as scatter matrices;
2

J (w) ~ 
1 2

Si  x  i x  i
T ~s~12  ~22
 
s
 x i

S w  S1  S 2
• Where Si is the covariance matrix of class
ωi, and Sw is called the within-class
scatter matrix.
LDA … Two Classes
• Now, the scatter of the projection y can then be expressed as a function
ofthe scatter matrix in feature space 2
x.  ~
1  2

 
~ J (w)
 2 2
si 2  ~ wT
x  w T
 ~ ~2
y
2
s 
i i  ~1 2
yi xi s
 
 w x   x   
T T
i i
w x i

 T 
   x  i x  i  w  wT Si w
T

w  x  i


~ ~
s12  s 22 w T 1S w S 2 w  w S1  S2 w  w SW w 
T T T

~ w SW

WhereSW~is the within-class scatter matrix of the projected samples y.
LDA … Two Classes
• Similarly, the difference between the projected means (in y-space) can be expressed in
terms of the means in the original feature space (x-space).
2

 1
~
~   2   w
2 T
1  T 2
 2w J (w) ~ 
~
1 2

s~12  ~s2 2
 
 
T ~
 w SB w 
• The matrix SB
S B the between-class scatter of
is called the original samples/feature
vectors, whileS~B is the between-class scatter of the projected samples y.
• Since SB is the outer product of two vectors, its rank is at most one.
LDA … Two Classes

• We can finally express the Fisher criterion in terms of SW and SB

as:

~  2 wT S B w
J (w) ~
1
 T
 s~12  ~s2 2 w SW w


• Hence J(w) is a measure of the difference between class means

(encoded in the between-class scatter matrix) normalized by a
measure of the within-class scatter matrix.
LDA … Two Classes
• To find themaximum of J(w), we differentiate and
equate to zero.
d J (w) d  wTT S B w 
dw  dw  w SW w   0
d wT S d
 w SW
T
 wT SB w B  T
w SW 0
dw w dw
wT SW
2S 
2SW w  w
w w  wT S
B B
Dividing by 2wT S  w 0
w: w
W
 wT S W w  wT S B w 
  wT S w  S B w   wT S w SW w 
 W   W 
0 
 S B w  J (w)SW w  0
 S W1S B w  J (w)w 
LDA … Two Classes

• Solving the generalized eigen value problem

SW1 S B w  wher   J (w) 

yield w e scalar
s
w  arg max J (w)  arg  T S B w   SW1 1   2
* 
 w T

max w  w SW w 
w

• This is known as Fisher’s Linear Discriminant, although it is not a discriminant
but rather a specific choice of direction for the projection of the data down
to one dimension.
• Using thesame notation as PCA, the solution will be
vector(s) ofS X  SW
the eigen
S
1 B
LDA … Two Classes - Example
• Compute the Linear Discriminant projection for the
following two- dimensional dataset.
– Samples for class ω1 : X1=(x1,x2)={(4,2),(2,4),(2,3),(3,6),(4,4)}

– Sample for class ω2 : X2=(x1,x2)={(9,10),(6,8),(9,5),(8,7),(10,8)}

1
0

6
x
2

1
0
1 2 3 4 5 6 7 8 9 1
0 0
x1
LDA … Two Classes - Example

• The classes mean

are :
1

N
x 1  4   2   2   3   4    3 
5  2    4    3    6    4 
   3.8 
1 1 x 
1


 1 1  9     6   9    8   10   8.4 

2
x 8 5 7 8 7.6 
2 x  5  10 
2


        
N
LDA … Two Classes - Example

• Covariance matrix of the first class:


2 2
x  1 x    4    3    2    3 
T
S1
 x1 1  2   3.8  4   3.8
       
2 2 2
 2   3    3   3    4   3  
                    
3 3.8  6 3.8  4 3.8 
           
 1  0.25 
  
 0.25
2.2
LDA … Two Classes - Example

• Covariance matrix of the second class:

2 2
S2  x   x   
2
T
 9   8.4   6   8.4
10    7.6     8    7.6 
 x 2 2
       

2 2 2
 9   8.4    8   8.4   10   8.4  
                    
5 7.6  7 7.6  8 7.6 
           
 2.3  0.05 
  
 0.05
3.3
LDA … Two Classes - Example

• Within-class scatter matrix:

 1  0.25 
   2.3  
S w  S1  S2 
  2.2    0.05 3.3 
0.05 
  0.25  0.3 

 3.3 
 0.3 5.5
LDA … Two Classes - Example
Between-class scatter matrix:
LDA … Two Classes - Example
The LDA projection is then obtained as the solution of the generalized eigen value
problem
LDA … Two Classes - Example

The optimal projection is the one that given maximum λ = J(w)

LDA … Two Classes - Example
LDA - Projection
LDA - Projection

Fisher's Linear Discriminant
No ratings yet
Fisher's Linear Discriminant
25 pages
Pattern Recognition (CSE4213) : Linear Discriminant Analysis (LDA)
No ratings yet
Pattern Recognition (CSE4213) : Linear Discriminant Analysis (LDA)
33 pages
LDA Two Classes - Example: Compute The Linear Discriminant Projection For The Following Two-Dimensional Dataset
No ratings yet
LDA Two Classes - Example: Compute The Linear Discriminant Projection For The Following Two-Dimensional Dataset
14 pages
9 - Linear Discriminant Analysis
No ratings yet
9 - Linear Discriminant Analysis
19 pages
Lda PDF
No ratings yet
Lda PDF
47 pages
Linear and Quadratic Discriminant Analysis: Tutorial: Benyamin Ghojogh
No ratings yet
Linear and Quadratic Discriminant Analysis: Tutorial: Benyamin Ghojogh
16 pages
Fishers LDA
No ratings yet
Fishers LDA
47 pages
Week 3
100% (1)
Week 3
3 pages
Linear Discriminant Analysis Guide
No ratings yet
Linear Discriminant Analysis Guide
47 pages
Course MDA-12
No ratings yet
Course MDA-12
48 pages
Lda1 PDF
No ratings yet
Lda1 PDF
15 pages
LDA Final
No ratings yet
LDA Final
25 pages
FDA Class 2025
No ratings yet
FDA Class 2025
29 pages
FDS IMPORTANT QUESTIONS EduEngg
100% (1)
FDS IMPORTANT QUESTIONS EduEngg
7 pages
Data Preprocessing-VI (Feature Extraction - LDA)
No ratings yet
Data Preprocessing-VI (Feature Extraction - LDA)
24 pages
Machine Learning Unit 4 Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4 Machine Learning Unit 4
29 pages
Linear Discriminant Analysis: January 2015
No ratings yet
Linear Discriminant Analysis: January 2015
67 pages
Topic 2.8: Topics Comes Under The Topic of Discriminant Functions
No ratings yet
Topic 2.8: Topics Comes Under The Topic of Discriminant Functions
6 pages
MAS-01 Cost Behavior Analysis
No ratings yet
MAS-01 Cost Behavior Analysis
6 pages
Linear Classifiers
No ratings yet
Linear Classifiers
48 pages
Objectives:: Linear Discriminant Analysis
No ratings yet
Objectives:: Linear Discriminant Analysis
10 pages
CH 10 LDA
No ratings yet
CH 10 LDA
26 pages
B.Tech IT ML Study Guide
100% (2)
B.Tech IT ML Study Guide
21 pages
Hota ML LDF
No ratings yet
Hota ML LDF
28 pages
Inbound 5071897253100924049
No ratings yet
Inbound 5071897253100924049
10 pages
Inbound 294668028401590159
No ratings yet
Inbound 294668028401590159
10 pages
Reviewed - IJAMSS - Equivalence of Fisher Discriminant Analysis and Least Square
No ratings yet
Reviewed - IJAMSS - Equivalence of Fisher Discriminant Analysis and Least Square
11 pages
Incomplete 1
No ratings yet
Incomplete 1
9 pages
SAS Regression Analysis Guide
No ratings yet
SAS Regression Analysis Guide
4 pages
ML Unit4
No ratings yet
ML Unit4
44 pages
Lecture W12ab
No ratings yet
Lecture W12ab
60 pages
Lec 9 Lda
No ratings yet
Lec 9 Lda
48 pages
Week3 Summary Detail
No ratings yet
Week3 Summary Detail
9 pages
LDA for Binary Classification
No ratings yet
LDA for Binary Classification
12 pages
Compute The Mean For Each Class
No ratings yet
Compute The Mean For Each Class
4 pages
Discriminant Functions
No ratings yet
Discriminant Functions
33 pages
Financial Management - Risk and Return Assignment 2 - Abdullah Bin Amir - Section A
No ratings yet
Financial Management - Risk and Return Assignment 2 - Abdullah Bin Amir - Section A
3 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
14 Linear Discriminant Analysis 05-09-2024
No ratings yet
14 Linear Discriminant Analysis 05-09-2024
3 pages
Fischer LDA
No ratings yet
Fischer LDA
8 pages
Fishers Linear Discriminant Function
No ratings yet
Fishers Linear Discriminant Function
24 pages
Week 7 Notes
No ratings yet
Week 7 Notes
24 pages
Week 8 Notes - DM
No ratings yet
Week 8 Notes - DM
26 pages
Fisher Linear Discriminant Analysis: 1 What's LDA
No ratings yet
Fisher Linear Discriminant Analysis: 1 What's LDA
6 pages
Bayesian Classification
No ratings yet
Bayesian Classification
14 pages
Linear Models for Classification
No ratings yet
Linear Models for Classification
72 pages
Supervised Machine Learning Guide
No ratings yet
Supervised Machine Learning Guide
74 pages
B22CS014 Report
No ratings yet
B22CS014 Report
11 pages
LDA for Machine Learning Experts
No ratings yet
LDA for Machine Learning Experts
16 pages
Dimensions Reduction
No ratings yet
Dimensions Reduction
27 pages
Asdfghjkl
No ratings yet
Asdfghjkl
22 pages
Detailed Linear Discriminant Functions Notes
No ratings yet
Detailed Linear Discriminant Functions Notes
2 pages
Week 4 Solution PDS
No ratings yet
Week 4 Solution PDS
9 pages
ML Unit4
No ratings yet
ML Unit4
41 pages
Non-Linear Regression in Pharmacy
No ratings yet
Non-Linear Regression in Pharmacy
20 pages
Materi 5 - 2
No ratings yet
Materi 5 - 2
25 pages
Linear Discriminat Analysis
No ratings yet
Linear Discriminat Analysis
23 pages
Book Review: 'Reading Statistics and Research'
No ratings yet
Book Review: 'Reading Statistics and Research'
4 pages
Linear Methods in Supervised Learning
No ratings yet
Linear Methods in Supervised Learning
15 pages
Linear Discriminant Analysis
No ratings yet
Linear Discriminant Analysis
27 pages
Feature Engineering
No ratings yet
Feature Engineering
23 pages
Linear Discriminant Analysis Reference
No ratings yet
Linear Discriminant Analysis Reference
6 pages
Bivariate Data Booklet 2024
No ratings yet
Bivariate Data Booklet 2024
30 pages
2 STA 2420 ARCH and GARCH
No ratings yet
2 STA 2420 ARCH and GARCH
10 pages
Reference Material - LDA
No ratings yet
Reference Material - LDA
24 pages
Multivariate Analysis (Slides 8)
No ratings yet
Multivariate Analysis (Slides 8)
19 pages
ML Lab 8 - LDA
No ratings yet
ML Lab 8 - LDA
4 pages
Multivariate Analysis An Overview
No ratings yet
Multivariate Analysis An Overview
9 pages
SFC Exam Autumn 2024 QP (28011)
No ratings yet
SFC Exam Autumn 2024 QP (28011)
7 pages
Analisis Pengaruh Pemasaran Internal Dan Efikasi Setelah Seminar Hasil Untuk Jurnal
No ratings yet
Analisis Pengaruh Pemasaran Internal Dan Efikasi Setelah Seminar Hasil Untuk Jurnal
184 pages
Ken Black QA 5th Chapter15 Solution
100% (1)
Ken Black QA 5th Chapter15 Solution
12 pages
Econometric Quantile Regression Guide
No ratings yet
Econometric Quantile Regression Guide
72 pages
6th Online FDP Advanced Analysis Simsree
No ratings yet
6th Online FDP Advanced Analysis Simsree
1 page
Pattern Recognition for CS Scholars
0% (1)
Pattern Recognition for CS Scholars
37 pages
Sample Exam With Solutions. Econometrics II 2015.
No ratings yet
Sample Exam With Solutions. Econometrics II 2015.
15 pages
BAN 602 - Project1
No ratings yet
BAN 602 - Project1
4 pages
Data Science Basics for Beginners
No ratings yet
Data Science Basics for Beginners
16 pages
Microfit Guide2
No ratings yet
Microfit Guide2
17 pages
SPSS 19.0 Guide: Features & Data Processing
No ratings yet
SPSS 19.0 Guide: Features & Data Processing
119 pages
SSC and HSC Scores Analysis
No ratings yet
SSC and HSC Scores Analysis
12 pages
Decision Trees and Random Forest
No ratings yet
Decision Trees and Random Forest
79 pages
SPSS Factor Analysis Guide
No ratings yet
SPSS Factor Analysis Guide
8 pages
Journal of Financial Economics: Jonathan B. Cohn, Zack Liu, Malcolm I. Wardlaw
No ratings yet
Journal of Financial Economics: Jonathan B. Cohn, Zack Liu, Malcolm I. Wardlaw
23 pages
Hasil Uji Statistik
No ratings yet
Hasil Uji Statistik
7 pages
Heart Disease Prediction: Leveraging Machine Learning: Presented By: Haritima Sinha 2022UGCM004
No ratings yet
Heart Disease Prediction: Leveraging Machine Learning: Presented By: Haritima Sinha 2022UGCM004
21 pages
Combining Multiple Imputations: Thomas Lumley April 26, 2019
No ratings yet
Combining Multiple Imputations: Thomas Lumley April 26, 2019
5 pages
Ijstra 2024 0073
No ratings yet
Ijstra 2024 0073
6 pages

Module 2 - LDA

Uploaded by

Module 2 - LDA

Uploaded by

Linear Discriminant

Module No. Training Models/ Regression and classification

Linear Regression, Multivariate Regression, Subset Selection, Shrinkage Methods, Principal

– So what, PCA does this …

• However, we want to preserve as much of the class

– OK, that’s new, let’s well deeper  …

m - dimensional data vector

• In PCA, we had a dataset matrix X with dimensions mxn, where

This axis yields better class separability

This axis has a larger distance between means

• Therefore, we will be looking for a projection where examples from the

• We can finally express the Fisher criterion in terms of SW and SB

• Hence J(w) is a measure of the difference between class means

• Solving the generalized eigen value problem

SW1 S B w  wher   J (w) 

– Sample for class ω2 : X2=(x1,x2)={(9,10),(6,8),(9,5),(8,7),(10,8)}

• The classes mean

 1 1  9     6   9    8   10   8.4 

• Covariance matrix of the first class:

• Covariance matrix of the second class:

• Within-class scatter matrix:

The optimal projection is the one that given maximum λ = J(w)

You might also like