BF02289845
BF02289845
3
SEPTEMBER, 1959
W. A. GIBsoNt
PERSONNEL RESEARCH BRANCH, THE ADJUTANT GENERAL'S OFFICE
sample may be considered representative is not taken up for any of the three
models.
Four examples of the latent profile model are given. Two of them are
fictitious but their form closely parallels the corresponding two empirical
examples. These examples are discussed from the viewpoint of the further
work they suggest concerning the metric and dimensionality of the latent
space. The use of higher order empirical relations for testing fit and for
further particularization of the latent profile model is briefly discussed, and
the possibility of close parallelism between developments in the latent
structure and latent profile models is pointed out. The paper concludes with
a plea for continued flexibility in the choice of multivariate models as new
and improved ones appear.
deal with those elements in the correlation matrix that have repeated sub-
scripts--the diagonal ri/s. To take these as perfect self-correlations of unity
would amount to trying to analyze all of the variance of every test, including
the unreliable part. To insert the test reliabilities into the diagonal cells would
imply an interest in analyzing all of the "true" or repeatable variance of the
tests, including that specific to each test and unrelated to other tests in the
battery. More commonly preferred among factor analysts is to attempt to
analyze only that part of a test's variance---its communality--that is shared
with other tests in the battery. Communalities could be defined alternatively
as those portions of the self-correlations accounted for by the factors that
suffice to account for the between-test correlations. In any event, the com-
munalities are not empirically given and yet are needed at the start for maxi-
mum efficiency of solution. In principle the communalities could be determined
by certain operations (cf. [18], pp. 294-307) applied to the empirically given
side entries in the correlation matrix, but these operations are usually so
time consuming that a successive approximations approach is often sub-
stituted. Rough communality estimates are used to obtain an initial factorial
solution, which in turn provides improved estimates for a second cycle of
the same kind, and so on until the communalities are snfficiently "stabilized."
It fortunately happens that with large test batteries little or no iterating
of this kind is needed, but for small batteries several cycles may be required
before the communality problem is adequately resolved.
A second and more important problem in factor analysis is the inherent
partial indeterminacy of the a's that is known as the rotational problem. It
is most easily understood in terms of q-dimensional geometry. The a's for
any test j may be thought of as the projections of a point j on a set of co-
ordinate axes in q-space. The table of factor loadings for the s tests then
defines a configuration of s points in terms of their projections on a q-di-
mensional reference frame. But the equations of factor analysis, by themselves,
do not indicate which position of the reference frame, among an infinite
number of possibilities in the same q-space, is to be preferred. Only the origin
of the coordinate system is fixed, so that the reference frame can be rotated
freely from any position to any other without distortion of the configuration
of points defined by it. Naturally the a's change with such rotations, but
always in such a way as to preserve the spatial interrelations among the
points they define. Many ways of resolving this rotational indeterminacy
have been proposed. Probably the most notable is the simple structure
principle ([18], pp. 181-193), which strives to simplify the factorial structure
of the tests by maximizing the number of near-zero factor loadings. Many
of these proposals (especially those centering around the simple structure
principle) involve heavy computing loads, relatively rare geometric intuition,
or both; many are debated or debatable; all are in the nature of afterthoughts
that are not built into the equations defining the model.
232 PSYCHOMETRIKA
The quantities on the left are empirically given or manifest data. They
indicate the number of people in the entire sample, n, the number endorsing
a single item, n, , the number endorsing any pair of items, n,~ , and so on.
The quantities on the right are the underlying or latent parameters of the
model. The number of terms on the right is q, the number of mutually exclu-
sive and exhaustive subgroups (latent classes) into which the analysis will
divide the total sample. The number of people in latent class 1 is n~ , and
so on. The latent probability Pli is the proportion of the members of latent
class 1 who endorse item j, Pl~'~ is the proportion of class 1 members who
endorse both items j and k, and so on. Equations (3) merely show how the
man~fest joint endorsements are recruited from the latent classes. It may
be noted, in passing, that equations (3), being a set of recruitment equations,
are intrinsically linear, while the initial equation of factor analysis is linear
only because it was made so.
w . A. ~ m S O ~ 233
The substitution of (4) into (3) yields the basic equations of latent
structure analysis ([15], p. 385).
n = n~ + n2 + ... + n, ,
Thus all of the manifest joint frequencies are accounted for in terms of
(q A - s q ) latent parameters, q class sizes and q latent probabilities (Pl~ ,
p~ , -. • , p,~) for each of the s items. The successive levels of manifest fre-
quencies (n, n~ , ni~ , etc.) number, respectively, 1, s, s ( s - - 1 ) / 2 , etc., the
coefficients in the binomial expansion (a -t- b)L These add up to 2", the
number of equations relating manifest to latent data in this model.
The task here, as in factor analysis, is to solve the basic equations for
the unknown latent parameters. Several latent structure solutions already
exist. The most recent of these [1, 7] avoid the use of any joint frequencies
with repeated subscripts (n~.~ , n~,.~ , n;~'i , n;~k, , etc.). In latent structure
analysis these are treated as analogous to the communalities of factor analysis,
in not being manifest. To interpret them as equivalent to the corresponding
234 PSYCH01~IETRIKA.
+ + ... + etc.
All summations in (6) are over individuals. The summations on the left
are over the entire sample, while those on the right are over the members of
the various subgroups. X , is the, score of individual i on test j, and it m a y be
in raw, deviational, standard, or a n y other derived units. The same is true
of X ~ , X ~ , etc.
I t is convenient to designate the various summations in (6) b y the
letter m with appropriate subscripts. Thus mi and m;k represent, respectively,
the sum of scores on test j and the sum of products of scores on tests j ahd J k,
each sum being for the entire sample, ml~ and mxi~ , on the other hand,
stand for the same things in subgroup 1 only, and so on. Equations (6)!then
become
n=nl + n ~ + "'" + n ~ ,
Xk Xtj
dikll II
X t~
dij
0 -- Xi
FIGURE 1
A Hypothetical Scatter Diagram
tests j and k is zero. The two lines labeled X , and Xtk in Figure 1 indicate
the means of subgroup t on tests j and k. They intersect in what is called the
centroid of the points comprising subgroup t. The point corresponding to
individual i , a member of subgroup t, is shown with the coordinates
( X , , X~k), his scores on the two tests. The two distances, d , and d~k , are
the deviations of individual i from the j and k means of his subgroup. A
property of such deviations is that their sum, over all members of the sub-
group, is zero.
W. A. GIBSON 237
(10) ms; = k X .
= k ( X , ; + d.)
= ~,X,,+ 5~d,
= ntXti •
The first term in the third line simplifies because X , is the same for every
member of the subgroup. The last term in the third line vanishes because it is
a sum of deviation scores. Thus the j scores for any subgroup t contribute
to the sum of all j scores as il the members of the subgroup were concentrated
at their mean for test j. This result (by no means a new one) will next be
generalized, in an appropriate way, to the second-order product moment,
m , ~ , for subgroup t.
B y definition and with the help of (8) and (9), *n,~ becomes
(11) m,i~ = z ~ X , X i ~
= k<<
= n~XtiXt~ .
T h e first term in the fourth line of (11) simplifies because X , and X,k are
the same for all members of the subgroup. The second and third terms in t h a t
line vanish because they contain sums of deviations. The last term vanishes
because it is the numerator of the formula for the correlation, within the
subgroup, between tests j and k. The subgroup was earlier defined as having
t h a t correlation equal to zero. Thus the second-order product moment,
m , ~ , is the same as it would be if all members of subgroup t were concentrated
a t their centroid. This holds for any subgroup within which tests j and k are
uncorrelated.
A further distinction in terminology needs to be made here. The quantity
~38 I~SYCHOMETRIKA
m,i~ is a sum of products of horizontal and vertical distances from the origin
in Figure 1. It will therefore be called a product moment about the origin.
Another kind of product moment in (11) is the last term in line 4, the sum
of products of horizontM and vertical distances of a set of points from their
own centroid. Such a quantity, is appropriately called a product moment
about the centroid of the set of points that is involved. The geometric interpre-
tation of scores and deviations as distances makes it clear that the point of
reference (origin, centroid, or some other point) of a product moment must
Mways be specified in some way. Naturally this holds for all orders of product
moments. Up to now, with four exceptions, all product moments have had
the origin as their point of reference. The four exceptions are the last term
in line 3 of (10) and the summations in the last three terms of line 4 in (11),
which have the centroid of subgroup t as their point of reference.
Theresults of (10) and (11) can be generalized to higher order product
moments by imposing additional restrictions on subgroup t. Not only must
that subgroup be defined as having all pairs of tests j,/c, and l uncorrelated
within it, but, let it also have, for those three tests, a vanishing third-order
product moment .about its centroid. With these restrictions its third-order
product moment about the origin, mr;k, , becomes
(12) m,~ = n~X,X,~X, .
This result is, in form and mode of development, a third-order analogue of
the final step in the previous two equations. The fourth-order equivalent is
obtained by anMogous higher order restrictions, and so on.
Since in all of this discussion the origin could have been placed at any
point 0, it now becomes possible to state the fundamental theorem of latent
profile analysis.
The g-order product moment, about any point 0, of n, points having
zero product moments of order g and less about their centroid, is
equal to the g-order product moment, about the point 0, of n~ points
placed at that centroid.
The Basic Equations o] Latent Profile Analysis
The foregoing theorem provides a basis for grouping in the recruitment
equations introduced previously. The close analogy with latent structure
analysis will be obvious. Each subgroup or latent class should be homogeneous
in whatever underlying dimensions are necessary to account for the observed
interrelations. The homogeneity need not be complete, so long as deviations
from the class averages are random, i.e., independent.
In the statistics of dichotomous attributes (as employed in coin-tossing
experiments, for example)., the notion of independence has usuMly applied
to all orders of joint occurrence, and not just to pairs of events. This is the
case in latent structure analysis, where within-class independence is defined
w . A. Q m s o ~ 239
as pertaining not only to all pairs of items but also to all triplets, all quad-
ruplets, and so on. (It is easily shown by example that higher order independ-
ence among dichotomous attributes is not a mere logical consequence of
uneorrelatedness between all pairs of such attributes.)
The concept of uncorretatedness among quantitative measures, on the
other hand, ha.s more often been restricted, at least among psychometricians,
to pairs of such measures. (A statistically oriented prepublieation reviewer
has pointed out that the two kinds of independence being discussed here are
known to statisticians as pair-wise and mutual independence.) This is not an
intrinsic or logical difference between qualitative and quantitative statistics.
It is rather only a historical accident that the question of higher order in-
dependence among quantitative measures has less often arisen in psycho-
metrics. That question arises here, for it turns out that the proper definition
of such independence is crucial for this model.
In the previous section the within-class uncorrelatedness between pairs
of tests was shown to be synonymous with vanishing second-order product
moments about tile eentroid of the class. This is because such product
moments are the numerators of the formulas for the corresponding correla-
tions. Purely by analogy, higher order within-class independence may be
equated with the vanishing of higher order product moments about the
centroid of the class. (The failure of such.product moments to vanish would
allow, for example, a positive correlation between tests j and h among class
members with high scores on test l, accompanied by a compensating negative
correlation between the same two tests among class members having low
scores on test 1. This could happen in spite of zero correlations between all
pairs of the three tests within the class as a whole. If correlational patterns
can differ within subdivisions of a class, then the class is not homogeneous
even from a commonsense point of view.) Therefore let the within-class
independence of the present model be defined as applying to all orders of
interrelations, and as expressing itself in the vanishing of product moments
of all orders about the eentroid of the class. Then the fundamental theorem
applies with full force to the product moments of each class, so that the results
of equations (10), (11), and (12) can be used to transform (7) into the basic
equations of latent profile analysis:
Thus for s tests the 2 ~manifest product moments (including n) are accounted
for in terms of (q + sq) latent parameters--the q latent class sizes and the
240 PSYCHOMETRIKA
q class averages for each of the s tests. Each latent class is therefore character-
ized by its size and its profile of s test averages, its latent profile.
The latent profile equations and those of latent structure analysis turn
out to be identical in form, as can be seen from a comparison of (5) and (13).
This means that all Mgebraic latent structure solutions [1, 7, 11] are directly
applicable to the latent profile equations. Hence the latent profile equations
have a solution that can be obtained without the involvement of eommunality
analogues (m~i , m i l k , m l i i , etc.), and that is, in general, rotationally unique.
(A conversation with Robert P. Abelson at Yale University has clarified the
fact that, when deviational scores are used, the m~ are the between-groups
variances.) Nor do the latent profile equations restrict the occurrence of
curvilinear relations among tests or between tests and underlying dimensions.
Thus the dilemma of difficulty factors is avoided in this model.
1 = Pl + P 2 + "'" + P ~ ,
0 = p l Z l l -4- p~Z2~ 4- " . + p,Z~ ,
(14)
r~k = piZliZlk + p2Z2iZ2~ + "'" + PQZaiZq~ ,
r~k, = p,ZliZIkZ1, + p~Z~Z:~Z2z + . . . + poZqiZ~Z~, , etc.
The p's are the proportionate class sizes, and of course their sum is unity.
The Z's are the average standard scores of classes on tests. Their weighted
sum for any test (in the second line) vanishes because it is the mean of all
standard scores on that test. The r~-, , being average products of pairs of
standard scores, are the same correlations for which factor analysis attempts
to account. The r;~.., are, analogously, average triple products, and so on.
w. A. Gmso~ 241
The latent profile equations in standard form have the advantage of dealing
with magnitudes that are independent of sample size and of arbitrary score
units. It is in standard form that the equations will be applied to the examples
in the next four sections.
R for a Fictitious Two-Class Case R I for a Fictitious Two-Class Case Latent Profile Solution for
a Fictitious Two-Class Case
3 .oo .5o .90 .9o .90 3 2.00 .00 .00 .00 .O0 -.71 .71
4 .oo .9o .9o .5o .90 4 2.00 .00 .00 .00 .00 Class Sizes .90 .90
co
TABLE 4 TABLE 9
however, the comparison will be made when possible in order to appraise the
adequacy of the solution.
The latent profile solution for the present example, obtained by applying
the algebra of the latent structure solution of Green Ill], is shown in Table 3.
Each of the two latent classes is defined in terms of its relative size and its
latent profile---the complete set of average test scores for its members.
Apparently Class I consists of those who are poor at arithmetic, while Class
II contains the good arithmetic students.
A fruitful way to visualize this latent profile solution is in terms of the
regressions of the tests on the latent continuum of arithmetic ability: Such
a graph of mean test score, Y, against position along the latent continuum,
X, is shown in Figure 2. Here the regressions of all four tests on the latent
Y
1.00
Tests
I-4
Glossl!
I" IX
-I.00 l.O0
Clas! II
Tests/
I-4
-!.00
F~GURE 2
Regression of Tests on Latent Continuum for A Fictitious Two-Class Case
the same (X/2/2) as the factor loadings that would result from a factor
analysis of the correlations in Table 1. This simple correspondence between
the two models will vanish, however, as soon as any of the regressions become
nonlinear.
The regressions for this solution are pictured in Figure 3. Again both
axes are in standard units, so that the slopes of the regressions are identical
with the factor loadings reported by Thurstone. Although the present latent
profile solution is not rotationally unique, it can fairly readily be sho~n to
possess an important kind of invariauce, namely, that the slopes of the re-
gressions, when both axes are in standard units, remain constant regardless o]
how the rotational indeterminacy is resolved. This is but one illustration of
the fact that the factor analysis and latent profile models are mutually
complementary when the assumptions of both models are not violated.
W. A. GIBSON 245
Y
1.00 Tests
ClQss 1
........ X
-I.00
FzO~IR~ 3
Regressions of Tests on Latent Continuum for Nine Reading Tests
TA~I~ 6 •AB~ 8
TASL~ 7 TABL~ 9
Y
Tests
1.00"
1,2
Closs I
{X:-V-f}
1I i I I X
-LO0 1.00 I
Tests CIQss IIl
4,5 - - (x=+~l
3/ / -I.00~
1,2
/ A.
IX=O)
Fmu~E 4
Regressions of Tests on Latent Continuum for A Fictitious Three-Class Case
The simple structure correlated factor so]ution for the Ferg~ison example
is shown in Table 12. That solution was obtained by a rotation of the factori-
zation reported by Ferguson ([4], p. 328). Here again it would be absurd to
conclude, according to the usual rules, that the data must be thought of in
teiTns of two relatively independent factors--high-level and low-level verbal
intelligence.
A letter from Ferguson has indicated that the raw scores are no longer
available for the computation of higher order manifest product moments.
It therefore was necessary to approximate the latent profile solution by
means of procedures similar to those used in the Davis example. For this
purpose Fergllson's factorization of his corre|ations was used. His two factors
accounted for the correlations with discrepancies not exceeding .03, and the
present latent profile solution fits the correlations in eactly the same way.
The rotational indeterminacy here was resolved in two stages. The
first step was to rotate Ferguson's factorization, with attention being given
only to subtests 1 and 6, into maxhnum correspondence with. the initial
factorization for the first and last tests in Example Ill. The latter faetori-
zation was a part of the latent profile solution for that example. In Example
III the unique solution was obtained by applying, to the initial factorization
248 PSYCH0:METRIKA
TABL~ lO
TABLE 1 / m/BL~ 13
6 ~.?? - .24 .~5 z.?.5 z.7o 2.7z ~.~7 Class Sizes •29 ,9o .25
Y
Tesls
.6
1.00"
Class l
(x=~,]TI
I I !
'X
-I.(X) 1.00 T
Class III
Tests (x= +/'~)
-'°°t
Cla!s 11
x=o)
FIGURB 5
Regressions of Subtests on Latent
Continuum for Six Verbal IntelligenceSubtests
produced by the absence of higher order manifest data in the case of the two
empirical examples. The first is the lsck of a scale of measurement for the
latent continuum. For two classes this is unimportant, for it leaves only an
arbitrariness as to the origin and the distances of the two classes from that
origin. The problem of the relative distances between classes cannot arise
when there is only one such distance. With three-class solutions, however,
the problem of relative spacing is a critical one. Without some resolution of
it the regressions of tests on the latent continuum could not be drawn. Nor
could the shape of the distribution of positions along the latent continuum
be ascertained. In both of the present three-class solutions, the problem was
resolved by the arbitrary assumption of equal spacing of the classes along the
latent continuum. The regressions were drawn on that basis, and on the
same basis the latent distribution in each case became symmetric and approxi-
mately normal. Other assumptions about the underlying metric would have
led to different regressions and to different latent distributions. A separate
paper [9], stemming from some recent developments in latent structure
analysis [16], den.is further with this metric problem. It indicates one way in
which, with the aid of manifest product moments of still higher order, a
metric for the latent continuum can be made to emerge as an integral part
of the latent profile solution.
A second limitation of all latent profile examples in the present paper is
their unidhnensionality. It will be recalled that nothing in the development
of the latent profile equations restricts the number of underlying dimensions
within which the latent classes lie. Of course the two-class examples here can
be understood in terms of a single continuum, for that would be true of any
two-class case. The present three-class examples, however, are unidimensional
because of the special nature (homogeneous in content but graded in difficulty)
of the tests involved in them. Many three-class examples would require two
underlying dimensions for an adequate understanding of their psychological
meaning. In general, a q-class solution could require as many as (q - 1)
underlying dimensions for its interpretation. Subsequent work [cf. 10]
will deal with such multidimensional examples and with the problems of
dimensionality and metric that arise in their interpretation.
The reader may have noticed that in all four latent profile examples
no mention was made of a need for manifest data of order higher than the
third. Even for the two empirical examples a unique solution would not
have required the use of such higher order manifest data. These higher
order data therefore constitute a means for testing the assumption of higher
order within-class independence. This could be done by comparing the given
higher order data with the corresponding fitted values as generated from
the latent parameters by subsequent lines of equations (13) or (14). Alter-
natively it could be argued t h a t , if the solution never~ requires data above
third order, there is no need to postulate within-class independence beyond
w . h. GIBSON 251
that order. From this parsimonious viewpoint, the latent profile equations
need extend no higher than third order, and it would be inappropriate to
think of using higher order data to test the fit of the model. However, it
might then be more important to test the adequacy of the solution by relating
it to variables not included in the original analysis. (An empirical example in
[8] gives an illustration of how this could be carried out in latent structure
analysis, where the very same argument over the use or non-use of higher
order equations not needed in a particular solution can be made.)
The latent profile equations that have been derived and illustrated here
are analogous to only one form of latent structure analysis--that knox~n
as the discrete class case. The analysis divides the sample into a small number
of discrete classes possessing second- and third-order within-class independ-
ence, and stops there. Other varieties of latent structure analysis, one of
which has already been referred to, have gone further in stipulating the
algebraic form of the regressions (the so-called trace lines [eft 15] of latent
structure analysis) or of the set of class sizes, or of both. Always, however,
the postulate of within-class independence is retained. Usually these further
restraints require within-class independence of higher than third order, so
that the corresponding higher orders of manifest data become directly
involved in the solution. Most of these variants of latent structure analysis
are readily translated into latent profile terms. In fact, the analogy between
the two models is so close t h a t almost whatever progress is made in the
various solutions for one model is convertible into a corresponding advance
for the other.
Conclusion
After outlining the derivation of the factor analysis and latent structure
models, this paper has shown how the latter can be generalized for analyzing
the interrelations among quantitathTe measures in a way that avoids some of
the troublesome problems of factor analysis. The resulting latent profile
n~ode] is applied to some simple fictitious and empirical data to illustrate its
use. Because such applications may seem to show some promise, it is perhaps
appropriate and not premature to conclude this paper merely by broadening
the reference for the admonition ([18], p. 70) with which it began.
It would be unfortunate if some initial success with the analytical
methods.., described here should lead us to commit ourselves to them with
such force of habit as to disregard the development of entirely different con-
structs that may be indicated by improvements in measurement and by
inconsistencies between theory and experiment.
REFERENCES
[2] Carroll, J. B. The effect of difficulty and chance success on correlations between items
or between tests. Psychometrika, 1945, 10, 1-19.
[3] Davis, F. B. Fundamental factors of comprehension in reading. Psychometrika, 1944,
9, 185-197.
[4] Ferguson, G. A. The factorial interpretation of test difficulty. Psychometrika, 1941,
5, 323-329.
[5] Gibson, W. A. Applications of the mathematics of multiple-factor analysis to problems
of latent structure analysis. In P. F. Lazarsfeld et al., The use of mathematical models
in the measurement of attitudes. Res. Memo. No. 455, Rand Corp., 1951.
f6] Gibson, W. A. Applications of the mathematics of multiple-factor analysis to problems
of latent structure analysis. Unpublished doctoral dissertation, Univ. Chicago, 1951.
[7] Gibson, W. A. An extension of Anderson's solution for the latent structure equations.
Psychometrika, 1955, 20, 69~73.
[8] Gibson, W. A. Multiple factors and latent structure. Unpublished manuscript.
[9] Gibson, W. A. Nonlinear factor analysis, single factor case. Amer. Psychologist, 1955,
10, 438. (Abstract)
[10] Gibson, W. A. Nonlinear factors in two dimensions. Amer. Psychologist, 1956, 11,
415 (Abstract)
[11] Green, B. F. A general solution for the latent class model of latent structure analysis.
Psychometrika, 1951, 15, 151-166.
[12] Green, B. F. Latent structure analysis and its relation to factor analysis. J. Amer.
statist. Ass., 1952, 47, 71-76.
[13] Guilford, J. P. The difficulty of a test and its factor composition. Psychometrika, 1941,
6, 67-77.
[14] Lazarsfeld, P. F. The interpretation and computation of some latent structures. In
S. A. Stouffer et al., Measurement and prediction. Princeton: Princeton Univ. Press,
1950. Ch. 11.
[15] Lazarsfeld, P. F. The logical and mathematical foundation of latent structure analysis.
In S. A. Stouffer et al., Measurement and prediction. Princeton: Princeton Univ. Press,
1950. Ch. 10.
[16] Lazarsfeld, P. F. Some new results and problems in latent structure analysis. Un-
published manuscript.
[17] Lazarsfeld, P. F., et al. The use of mathematical models in the measurement of atti-
tudes. Res. Memo. No. 455, Rand Corp., 1951.
[18] Thurstone, L. L. Multiple-factor analysis. Chicago: Univ. Chicago Press, 1947.
[19] Thurstone, L. L. Note on a reanalysis of Davis' reading tests. Psychometrika, 1946,
11, 185-188.
[20] Wherry, R. J. and Gaylord, R. H. Factor pattern of test items and tests as a function
of the correlation coefficient: content, difficulty, and constant error factors. Psycho-
metrika, 1944, 9, 237-244.
Manuscript received 2/7/58
Revised manuscript received P/19/58