0% found this document useful (0 votes)

16 views7 pages

PCA and FA Assignment - Group 1

Statistics

Uploaded by

Vikki Kajubi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views7 pages

PCA and FA Assignment - Group 1

Statistics

Uploaded by

Vikki Kajubi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

MAB 8102 Multivariate Analysis

PCA and FA Assignment -Group 1

Description of the methods for estimation in Factor Analysis Models Including

the advantages and disadvantages of each of the methods.

Group Members

01 AWINO DIANA 2018/HD07/3889U 1800743646

02 NAKALANZI VICTORIA 2020/HD07/19442U 2000719442
03 THERESA NAMUYIGE 2020/HD07/19452U 2000719452
04 KIDUBULE IBRAHIM 2020/HD07/19448U 2000719448

Group contributions
MEMBER CONTRIBUTIONS

Coordination
Principal component Analysis
Theresa Namuyige Principal factor Analysis
Organising the work, edits
Conclusion, Powerpoint Presentation.

Principal Axis Factor Analysis

Maximum-likelihood Factor Analysis
Nakalanzi Victoria
Organising the work, edits.
Conclusion, Powerpoint Presentation.

Introduction
Unweighted least-squares factor analysis
Ibrahim Kidubule
Alpha factor analysis
Conclusion, Powerpoint Presentation.

Image Factor Analysis analysis

Generalized least squares (GLS)
Awino Diana
Organising the work, final edits
Reading through the work
Introduction
Factor analysis is a statistical technique commonly used for evaluating the strength of the relationship of
individual items of a scale with the hidden concept, assessing the content or construct validity of an
instrument, determining plausible structures underlying a set of variables, and combining a set of
variables into one composite score. It assumes several assumptions like a linear relationship, no
multicollinearity, only relevant variables in the analysis, and a true correlation between variables and
factors. The purpose of this report is to describe the methods for estimation in Factor Analysis Models
and include the advantages and disadvantages of each method. The several methods available are
categorized into two: Classic models which assume that all error is random whereas the neoclassic
acknowledges both a random and a systematic component (Ferketich & Muller, 1990).

Methods of estimation

1. Classical Methods

1.1 Maximum Likelihood Method

Maximum likelihood (ML) estimation is popular for fitting factor analysis models, especially those having
restrictions on the parameters. [Chuanhai Liu And Donald B. Rubin]. This method finds estimates of the
factor loadings and unique variances by maximizing the likelihood function associated with the
multivariate normal model, assuming the data are independently sampled from a multivariate normal
distribution with mean vector μ, and variance-covariance matrix of the form (Σ = LL’ + Ψ), Where L is the
matrix of factor loadings and Ψ is the diagonal matrix of specific variances. The MLE procedure involves
the estimation of μ, the matrix of factor loadings L, and the specific variance Ψ, from the log-likelihood
function which is given by the following expression:

By maximizing the above log-likelihood function, the maximum likelihood estimators for μ, L, and Ψ are
obtained.

Advantages
1. Maximum likelihood provides a consistent approach to parameter estimation problems.
2. Several popular statistical software packages provide excellent algorithms for MLE. This helps
mitigate the computational complexity of maximum likelihood estimation.
Disadvantages
1. Maximum likelihood estimates can be heavily biased for small samples.

1
2. The likelihood equations need to be specifically worked out for a given distribution and
estimation problem. The mathematics is often non-trivial, particularly if confidence intervals for
the parameters are desired.

1.2 Principal Component Factor analysis

Principal components extracts “real” factors in the sample, those that are due to optimization of the data
at hand. Because these factors are “real” rather than hypothetical, some sources refer to them as
components or component factors rather than as factors (Kim & Mueller, 1978). PC assumes that there is
no unique variance, the total variance is equal to the common variance. The PC method is based on an
approximation 𝐐 of 𝐐, the factor loadings matrix. The sample covariance matrix is diagonalized, 𝑺 =
𝚪𝚲𝚪𝑻. Then the first K eigenvectors are retained to build 𝐐 = 𝝀𝟏𝜸𝟏,…, 𝝀𝒌𝜸𝒌.

Advantages
● It is best for the Uniform distribution of all variables and all samples.
● The principal Component factor explains more variance than the other factors.
Disadvantages
● Independent variables become less interpretable since principal components are not as readable
as the original features.
● Since PCF analysis is affected by scales; data standardization is a must before PCF occurs.
2. Neoclassical Methods

2.1 Principal factor method (Common Factor Analysis)

It seeks the fewest factors which can account for the common variance (correlation) of a set of variables.
The “problem” is that common factors can only be computed as combinations of the “common parts” of
the variables. Unfortunately, we can’t separate each person’s score on each variable into the “common”
and “unique” parts. So, common factor scores have to be “estimated”. The principal factor method
involves finding an approximation 𝚿 of Ψ, the matrix of specific variances, and then correcting R, the
correlation matrix of X, by 𝚿.

Advantages
1. PCA helps in overcoming the overfitting issue by reducing the number of features.
Disadvantages
1. You must standardize your data before implementing PCA, otherwise PCA will not be able to find
the optimal Principal Components.

2.2 Principal axis factor analysis

This method seeks the least number of factors that can account for the common variance of a set of
variables. Though the Principal axis factor analysis uses the principal component analysis strategy, it

2
applies it to a different version of the correlation matrix. Because the analysis of the data structure is
focused on common variance and not sources of error that are specific to individual measurements, the
correlation matrix has estimates of commonalities as its diagonal entries. Allowing for specific variance,
the correlation matrix is estimated as (ρ = LL’ + Ψ). Where Ψ is a diagonal matrix of specific variances.
The estimate of the specific variances is obtained as (Ψ = ρ - LL’). Where the matrix and diagonal entries

Ψ are estimated as

Advantages
1. PAF is the best for small variables and sample sizes.
2. The PAF approach does not require meeting specific assumptions regarding your data such as
multivariate normal distribution which is often violated in practice.
Disadvantages
1. This factoring approach does not give statistics regarding the tested model such as chi square.
2.3 Alpha Factoring Method

This technique is based on the maximization of the reliability of factors, assuming that the variables are
randomly sampled from a very large set of variables. It determines the factors which have the maximum
generalizability in the Kuder-Richardson sense. The AFA has the property of giving the same factors
regardless of the units of measurement of the observable variables. The AFA operates in the metric of
the common parts. In the establishment of the number of factors, the AFA retains only those alpha
factors with positive generalizability. Alpha factor analysis is derived from finding uncorrelated common
factors Xs (s = 1, 2, ..., q), each of which successively has maximum generalizability in the
coefficient-alpha sense, a psychometric measure of reliability in the generalized Kuder-Richardson sense.
McDonald (1970)

Advantages
1. No distributional assumptions.
2. Error is due to sampling symptoms, not individuals.
3. Emphasizes low (<0.40) communalities.
4. Always converges.
Disadvantages
1. Unlike other methods, this method does not assume sampled cases and fixed variables.
2.4 Unweighted least squares (ULS) factoring:

This technique is based upon minimizing the sum of squared differences between the observed and
estimated correlation matrices, without counting the diagonal. Ordinary or Unweighted least squares

3
(ULS) is the algorithm that directly aims at minimizing the residuals between the input correlation matrix
and the reproduced (by the factors) correlation matrix (while diagonal elements as the sums of
communality and uniqueness are aimed to restore 1s). ULS method can work with singular and even not
positive semidefinite matrix of correlations provided the number of factors is less than its rank, -
although it is questionable if theoretically FA is appropriate then. ULS method includes iterative eigen
decomposition of the reduced correlation matrix, like PAF, but within a more complex Newton-Raphson
optimization procedure aiming to find unique variances at which the correlations are reconstructed
maximally.

Advantages:
1. Provides more accurate and less variable parameter estimates,
2. Gives more precise standard errors and better coverage rates.
Disadvantage:

Least squares provide "best linear unbiased estimators if the response really does have a linear
relationship with any predictors. If it does not, then least squares estimators of the data are biased,
regardless of how much data you have.

2.5 Image Factor Analysis

Guttman's image theory is based on the assumption that each variable in the set of variables in the
domain of the characteristics of interest, can be split into two parts: the "image" of the variable (or, the
"common part") and the "anti-image" of the variable (or, the "unique part"). This method is based on
correlation matrix. Ordinary Least Squares Regression is used to predict the factors. Image factor
analysis is given by:

Where Diag[X] stands for a diagonal matrix formed from the diagonal elements of a square matrix X. J

Advantages
1. With image factor analysis, more factors can be extracted without yielding a perfect fit to the
observed data unlike in traditional factor analytic models.
2. No distribution assumptions

Disadvantages
1. It has little advantage over Principal Axis factoring.

4
2.6 Generalized least squares (GLS)

The generalized least squares estimator of y is the y that minimizes the residual quadratic form

where D is either a positive definite random matrix which converges in probability to a positive definite
matrix D as n approaches infinity, or D is a positive definite constant matrix (D = D). The generalized least
squares estimator will be denoted by ý . The estimator of y can be constructed by defining a matrix T
such that D = T’T and using a nonlinear least squares program with T s as the dependent variable.

Advantages
1. GLS generates a Chi-squared goodness-of-fit test to guide the number of factors.
Disadvantages
1. Requires a large sample.

Conclusions
An article by Fabrigar, Wegener, MacCallum, and Strahan (1999) argued that if data are relatively
normally distributed, maximum likelihood is the best choice because it allows for the computation of a
wide range of indexes of the goodness of fit of the model [and] permits statistical significance testing of
factor loadings and correlations among factors and the computation of confidence intervals. If the
assumption of multivariate normality is “severely violated” they recommend one of the principal factor
methods or the “principal axis factors" (Fabrigar et al., 1999). Other authors have argued that in
specialized cases, or for particular applications, other extraction techniques (e.g., alpha extraction) are
most appropriate, but the evidence of advantage is slim. In general, ML, PCA, or PAF will give you the
best results, depending on whether your data are generally normally-distributed or significantly
non-normal, respectively.

5
References
1. https://support.minitab.com/en-us/minitab/18/help-and-how-to/modeling-statistics/multivariat
e/how-to/factor-analysis/methods-and-formulas/methods-and-formulas/
2. A Comparison of Principal Component Analysis, Maximum Likelihood and the Principal Axis in
Factor Analysis by Onyekachi Akuoma Mabel, Olanrewaju Samuel Olayemi
3. Fabrigar, L.R., Wegener,D.T., MacCallum, R.C., and Strahan, E. J. (1999). Evaluating the use of
exploratory factor analysis in psychological research. Psychological Methods, 4(3), 272-299.
4. Maximum likelihood estimation of factor analysis using the ecme algorithm with complete and
incomplete data. [Chuanhai Liu and Donald B. Rubin]
5. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3732178/
6. https://www.statisticssolutions.com/free-resources/directory-of-statistical-analyses/factor-analy
sis/
7. https://www.statisticssolutions.com/exploratory-factor-analysis/
8. https://www.koreascience.or.kr/article/JAKO201915658233382.pdf
9. https://conservancy.umn.edu/bitstream/handle/11299/107735/v14n1p029.pdf?sequence=1&is
Allowed=y
10. itl.nist.gov/div898/handbook/eda/section3/eda3652.htm
11. Identification of Cancer-Related Symptom Clusters: An Empirical Comparison of Exploratory
Factor Analysis Methods Helen M. Skerman, PhD, Patsy M. Yates, PhD, and Diana Battistutta, PhD
Institute of Health and Biomedical Innovation (H.M.S., P.M.Y., D.B.), School of Nursing (H.M.S.,
P.M.Y.), and School of Public Health (D.B), Queensland University of Technology, Brisbane,
Queensland, Australia
12. Generalized Least Squares Estimation of the Functional Multivariate Linear Errors-in-Variables
Model P. FRED DAHM AND WAYNE A. FULLER Texas A&M University and Iowa State University

Daily Math Review Sheets Grade 5 PDF
100% (2)
Daily Math Review Sheets Grade 5 PDF
77 pages
Factor Analysis Estimation Methods
No ratings yet
Factor Analysis Estimation Methods
8 pages
Pertemuan 9
No ratings yet
Pertemuan 9
34 pages
Factor Analysis Guide: Techniques & Applications
No ratings yet
Factor Analysis Guide: Techniques & Applications
22 pages
Factor Analysis
No ratings yet
Factor Analysis
35 pages
The Steps in Factor Analysis
No ratings yet
The Steps in Factor Analysis
12 pages
10 5923 J Ajms 20201002 03
No ratings yet
10 5923 J Ajms 20201002 03
11 pages
Factor Analysis Full
No ratings yet
Factor Analysis Full
61 pages
Research Notes
No ratings yet
Research Notes
21 pages
Lecture 4 - Notes On Principal Components Analysis and Factor Analysis1
No ratings yet
Lecture 4 - Notes On Principal Components Analysis and Factor Analysis1
3 pages
Factor Analysis (FA)
No ratings yet
Factor Analysis (FA)
61 pages
Types of Factor Analysis
No ratings yet
Types of Factor Analysis
7 pages
2.6 Factor Analysis
No ratings yet
2.6 Factor Analysis
35 pages
Factor Analysis
No ratings yet
Factor Analysis
18 pages
Factor Analysis
No ratings yet
Factor Analysis
8 pages
Factor Analysis Techniques Guide
No ratings yet
Factor Analysis Techniques Guide
20 pages
Factor Analysis
No ratings yet
Factor Analysis
4 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
2 pages
Factor Analysis
No ratings yet
Factor Analysis
8 pages
Factor Analysis - Stata
No ratings yet
Factor Analysis - Stata
4 pages
Factor Analysis As A Tool For Survey Analysis
No ratings yet
Factor Analysis As A Tool For Survey Analysis
10 pages
Lecture-10 Factor Analysis - Reduced & Modified James McNeill Set W Consent
No ratings yet
Lecture-10 Factor Analysis - Reduced & Modified James McNeill Set W Consent
55 pages
Confirmatory Factor Analysis
No ratings yet
Confirmatory Factor Analysis
68 pages
Factor Analysis
No ratings yet
Factor Analysis
26 pages
Factor Analysis
No ratings yet
Factor Analysis
13 pages
Factor Analysis
No ratings yet
Factor Analysis
4 pages
Factor Analysis
No ratings yet
Factor Analysis
31 pages
Factor Analysis
No ratings yet
Factor Analysis
4 pages
Exploratory Factor Analysis
No ratings yet
Exploratory Factor Analysis
35 pages
Chapter 13 Multivariate Analysis Techniques
No ratings yet
Chapter 13 Multivariate Analysis Techniques
58 pages
Exploratory Factor Analysis Guide
No ratings yet
Exploratory Factor Analysis Guide
45 pages
Ch9-Factor Analysis Model
No ratings yet
Ch9-Factor Analysis Model
44 pages
Factor Analysis
No ratings yet
Factor Analysis
21 pages
Factors
No ratings yet
Factors
68 pages
Factor Analysis
No ratings yet
Factor Analysis
54 pages
Factor Analysis for PoliSci Students
No ratings yet
Factor Analysis for PoliSci Students
7 pages
IBM SPSS Statistics Base
No ratings yet
IBM SPSS Statistics Base
36 pages
Factor Analysis Is An Interdependence Technique Whose Primary Purpose Is To Define The Underlying
No ratings yet
Factor Analysis Is An Interdependence Technique Whose Primary Purpose Is To Define The Underlying
3 pages
Class 10 Factor Analysis I
No ratings yet
Class 10 Factor Analysis I
45 pages
Dimensionality Reduction-PCA FA LDA
No ratings yet
Dimensionality Reduction-PCA FA LDA
12 pages
Unit V - Research Application in Business - Factor Analysis
No ratings yet
Unit V - Research Application in Business - Factor Analysis
8 pages
Factor Analysis
No ratings yet
Factor Analysis
16 pages
Factor Analysis
No ratings yet
Factor Analysis
44 pages
Chapter Three Factor Analysis
No ratings yet
Chapter Three Factor Analysis
13 pages
Factors Influencing Student CGPA
100% (1)
Factors Influencing Student CGPA
54 pages
JML Regression
No ratings yet
JML Regression
36 pages
Session 13 - Factor Analysis
No ratings yet
Session 13 - Factor Analysis
22 pages
Amrcb Unit 5
No ratings yet
Amrcb Unit 5
29 pages
Factor Analysis Guide for SPSS
No ratings yet
Factor Analysis Guide for SPSS
3 pages
Unit-4 ML
No ratings yet
Unit-4 ML
17 pages
ML Unit-4
No ratings yet
ML Unit-4
17 pages
ML - Unit 3
No ratings yet
ML - Unit 3
4 pages
Factor Analysis for Statisticians
No ratings yet
Factor Analysis for Statisticians
41 pages
Factor Analysis
No ratings yet
Factor Analysis
45 pages
Factor Analysis
No ratings yet
Factor Analysis
8 pages
14.factor Analysis
No ratings yet
14.factor Analysis
18 pages
Factor Analysis for Data Experts
No ratings yet
Factor Analysis for Data Experts
39 pages
Sessions 21-24 Factor Analysis - Ppt-Rev
No ratings yet
Sessions 21-24 Factor Analysis - Ppt-Rev
61 pages
Factor Analysis
No ratings yet
Factor Analysis
12 pages
Unit-4 ML
No ratings yet
Unit-4 ML
19 pages
MSC Solid State Physics Lecture#3
No ratings yet
MSC Solid State Physics Lecture#3
17 pages
FLC Provider Database
0% (1)
FLC Provider Database
15 pages
Marginal Costing
No ratings yet
Marginal Costing
4 pages
Unit 3 Theories and Principles in The Use and Design of Technology Driven Learning Lessons
100% (1)
Unit 3 Theories and Principles in The Use and Design of Technology Driven Learning Lessons
49 pages
NTA IGNOU PHD Entrance Exam Syllabus
No ratings yet
NTA IGNOU PHD Entrance Exam Syllabus
85 pages
Biology of Stem Cells: An Overview: Pedro C. Chagastelles and Nance B. Nardi
No ratings yet
Biology of Stem Cells: An Overview: Pedro C. Chagastelles and Nance B. Nardi
5 pages
Machine Learning Assignment Questions
No ratings yet
Machine Learning Assignment Questions
2 pages
2014 E400 W212 Relay & Fuse Guide
No ratings yet
2014 E400 W212 Relay & Fuse Guide
15 pages
U.S.S. Europa Starship Specs
No ratings yet
U.S.S. Europa Starship Specs
1 page
SARA-R5 ATCommands UBX-19047455
No ratings yet
SARA-R5 ATCommands UBX-19047455
558 pages
Types of Resorts by Seasonality
50% (2)
Types of Resorts by Seasonality
34 pages
HR Synopsis
No ratings yet
HR Synopsis
11 pages
Parle Products List
100% (3)
Parle Products List
5 pages
SLW Investment Group
No ratings yet
SLW Investment Group
30 pages
Purposive Communication - Lesson 3
No ratings yet
Purposive Communication - Lesson 3
7 pages
BIOLOGY PLUS TWO Short Notes - Line Foundation
No ratings yet
BIOLOGY PLUS TWO Short Notes - Line Foundation
9 pages
Parthavi Electricals
No ratings yet
Parthavi Electricals
11 pages
Configure Eap Tls Authentication With Is
No ratings yet
Configure Eap Tls Authentication With Is
20 pages
MH 400
No ratings yet
MH 400
81 pages
Benchmarking Sox Costs, Hours and Controls
No ratings yet
Benchmarking Sox Costs, Hours and Controls
45 pages
FU5 P3 DTy Qeg 5 O4 Ne FKWG
No ratings yet
FU5 P3 DTy Qeg 5 O4 Ne FKWG
18 pages
Daily Lesson Log of Stem - Bc11Lc-Iiib-2: Compare The Graph of The Three Special Functions
No ratings yet
Daily Lesson Log of Stem - Bc11Lc-Iiib-2: Compare The Graph of The Three Special Functions
5 pages
Unidad 4
No ratings yet
Unidad 4
12 pages
Kalimba Song Book For Beginners - Play by Letter
No ratings yet
Kalimba Song Book For Beginners - Play by Letter
168 pages
Cre6-C-240
No ratings yet
Cre6-C-240
1 page
Omkar Resume
No ratings yet
Omkar Resume
2 pages
Simulation Thickener
No ratings yet
Simulation Thickener
11 pages
Design and Manufacturing of Pneumatic Burr Removing Machine: Kakde D V, Lokawar V L
No ratings yet
Design and Manufacturing of Pneumatic Burr Removing Machine: Kakde D V, Lokawar V L
3 pages
Contact Process for Sulphuric Acid
No ratings yet
Contact Process for Sulphuric Acid
8 pages

PCA and FA Assignment - Group 1

Uploaded by

PCA and FA Assignment - Group 1

Uploaded by

MAB 8102 Multivariate Analysis

PCA and FA Assignment -Group 1

Description of the methods for estimation in Factor Analysis Models Including

01 AWINO DIANA 2018/HD07/3889U 1800743646

Principal Axis Factor Analysis

Image Factor Analysis analysis

1.1 Maximum Likelihood Method

1.2 Principal Component Factor analysis

2.1 Principal factor method (Common Factor Analysis)

2.2 Principal axis factor analysis

2.5 Image Factor Analysis

You might also like