Thanks to visit codestin.com
Credit goes to www.scribd.com

100% found this document useful (1 vote)
239 views47 pages

Advanced Marketing Analysis

This document provides an outline and overview of discriminant analysis and logit models. It discusses key concepts such as discriminant functions, canonical correlations, centroids, and classification matrices. It also outlines the steps to conducting discriminant analysis, including formulating the problem, estimating discriminant function coefficients, determining significance, interpreting results, and validating the analysis. Finally, it provides an illustrative data sample on resort visits that could be analyzed using these techniques.

Uploaded by

Shachi Desai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
239 views47 pages

Advanced Marketing Analysis

This document provides an outline and overview of discriminant analysis and logit models. It discusses key concepts such as discriminant functions, canonical correlations, centroids, and classification matrices. It also outlines the steps to conducting discriminant analysis, including formulating the problem, estimating discriminant function coefficients, determining significance, interpreting results, and validating the analysis. Finally, it provides an illustrative data sample on resort visits that could be analyzed using these techniques.

Uploaded by

Shachi Desai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 47

Chapter 18: Discriminant and Logit Analysis

Advanced Marketing Research


Chapter Outline
1) Overview
2) Basic Concept
3) Relation to Regression and ANOVA
4) Discriminant Analysis Model
5) Statistics Associated with Discriminant Analysis
6) Conducting Discriminant Analysis
i. Formulation
ii. Estimation
iii. Determination of Significance
iv. Interpretation
v. Validation
7) Multiple Discriminant Analysis
i. Formulation
ii. Estimation
iii. Determination of Significance
iv. Interpretation
v. Validation
8) Stepwise Discriminant Analysis

9) The Logit Model

i. Estimation

ii. Model Fit

iii. Significance Testing

iv. Interpretation of Coefficients

v. An Illustrative Application

10) Summary
Similarities and Differences between ANOVA,
Regression, and Discriminant Analysis
ANOVA REGRESSION DISCRIMINANT/LOGIT
Similarities
Number of One One One
dependent
variables
Number of
independent Multiple Multiple Multiple
variables

Differences
Nature of the
dependent Metric Metric Categorical
variables
Nature of the
independent Categorical Metric Metric
variables
Discriminant Analysis
• Discriminant analysis is a technique for analyzing data when the
criterion or dependent variable is categorical and the predictor or
independent variables are interval in nature.
• The objectives of discriminant analysis are as follows:
1. Development of discriminant functions, or linear combinations
of the predictor or independent variables, which will best
discriminate between the categories of the criterion or dependent
variable (groups).
2. Examination of whether significant differences exist among the
groups, in terms of the predictor variables.
3. Determination of which predictor variables contribute to most of
the intergroup differences.
4. Classification of cases to one of the groups based on the values of
the predictor variables.
5. Evaluation of the accuracy of classification.
• When the criterion variable has two categories, the
technique is known as two-group discriminant analysis.
• When three or more categories are involved, the technique
is referred to as multiple discriminant analysis.
• The main distinction is that, in the two-group case, it is
possible to derive only one discriminant function.
• In multiple discriminant analysis, more than one function
may be computed. In general, with G groups and k
predictors, it is possible to estimate up to the smaller of G
- 1, or k, discriminant functions.
• The first function has the highest ratio of between-groups
to within-groups sum of squares.
• The second function, uncorrelated with the first, has the
second highest ratio, and so on. However, not all the
functions may be statistically significant.
Geometric Interpretation
Fig. 18.1

X2 G1
1 1 2 2
G2
1 1 11 2
1 1 1 1 2
1
2 2
2 22
1 2
21
1 22
22

G1

G2 X1

D
Discriminant Analysis Model
The discriminant analysis model involves linear combinations of
the following form:
D = b0 + b1X1 + b2X2 + b3X3 + . . . + bkXk
Where:
D = discriminant score
b 's = discriminant coefficient or weight
X 's = predictor or independent variable

• The coefficients, or weights (b), are estimated so that the groups differ as much
as possible on the values of the discriminant function.
• This occurs when the ratio of between-group sum of squares to within-group
sum of squares for the discriminant scores is at a maximum.
Statistics Associated with Discriminant Analysis

• Canonical correlation: Canonical correlation measures the


extent of association between the discriminant scores and the
groups. It is a measure of association between the single
discriminant function and the set of dummy variables that
define the group membership.
• Centroid: The centroid is the mean values for the discriminant
scores for a particular group. There are as many centroids as
there are groups, as there is one for each group. The means for
a group on all the functions are the group centroids.
• Classification matrix: Sometimes also called confusion or
prediction matrix, the classification matrix contains the number
of correctly classified and misclassified cases.
• Discriminant function coefficients. The discriminant function
coefficients (unstandardized) are the multipliers of variables,
when the variables are in the original units of measurement.
• Discriminant scores. The unstandardized coefficients are
multiplied by the values of the variables. These products are
summed and added to the constant term to obtain the
discriminant scores.
• Eigenvalue. For each discriminant function, the Eigenvalue is
the ratio of between-group to within-group sums of squares.
Large Eigenvalues imply superior functions.
• F values and their significance. These are calculated from a
one-way ANOVA, with the grouping variable serving as the
categorical independent variable. Each predictor, in turn,
serves as the metric dependent variable in the ANOVA.
• Group means and group standard deviations. These are
computed for each predictor for each group.
• Pooled within-group correlation matrix. The pooled within-
group correlation matrix is computed by averaging the separate
covariance matrices for all the groups.
• Standardized discriminant function coefficients. The
standardized discriminant function coefficients are the
discriminant function coefficients and are used as the multipliers
when the variables have been standardized to a mean of 0 and a
variance of 1.
• Structure correlations. Also referred to as discriminant
loadings, the structure correlations represent the simple
correlations between the predictors and the discriminant
function.
• Total correlation matrix. If the cases are treated as if they were
from a single sample and the correlations computed, a total
correlation matrix is obtained.
• Wilks'Λ: Sometimes also called the U statistic,
Wilks'Λ for each predictor is the ratio of the
within-group sum of squares to the total sum of
squares. Its value varies between 0 and 1.
• Large values of Λ (near 1) indicate that group
means do not seem to be different. Small values
of Λ (near 0) indicate that the group means seem
to be different.
Conducting Discriminant Analysis
• Fig. 18.2
Formulate the Problem

Estimate the Discriminant Function Coefficients

Determine the Significance of the Discriminant Function

Interpret the Results

Assess Validity of Discriminant Analysis


Formulate the Problem
• Identify the objectives, the criterion variable, and the
independent variables.
• The criterion variable must consist of two or more mutually
exclusive and collectively exhaustive categories.
• The predictor variables should be selected based on a theoretical
model or previous research, or the experience of the researcher.
• One part of the sample, called the estimation or analysis sample,
is used for estimation of the discriminant function.
• The other part, called the holdout or validation sample, is
reserved for validating the discriminant function.
• Often the distribution of the number of cases in the analysis and
validation samples follows the distribution in the total sample.
Information on Resort Visits: Analysis Sample
Table 18.2
Annual Attitude Importance Household Age of Amount
Resort Family Toward Attached Size Head of Spent on
No. Visit Income Travel to Family Household Family
($000) Vacation Vacation

1 1 50.2 5 8 3 43 M (2)
2 1 70.3 6 7 4 61 H (3)
3 1 62.9 7 5 6 52 H (3)
4 1 48.5 7 5 5 36 L (1)
5 1 52.7 6 6 4 55 H (3)
6 1 75.0 8 7 5 68 H (3)
7 1 46.2 5 3 3 62 M (2)
8 1 57.0 2 4 6 51 M (2)
9 1 64.1 7 5 4 57 H (3)
10 1 68.1 7 6 5 45 H (3)
11 1 73.4 6 7 5 44 H (3)
12 1 71.9 5 8 4 64 H (3)
13 1 56.2 1 8 6 54 M (2)
14 1 49.3 4 2 3 56 H (3)
15 1 62.0 5 6 2 58 H (3)
Annual Attitude Importance Household Age of Amount
Resort Family Toward Attached Size Head of Spent on
No. Visit Income Travel to Family Household Family
($000) Vacation Vacation

16 2 32.1 5 4 3 58 L (1)
17 2 36.2 4 3 2 55 L (1)
18 2 43.2 2 5 2 57 M (2)
19 2 50.4 5 2 4 37 M (2)
20 2 44.1 6 6 3 42 M (2)
21 2 38.3 6 6 2 45 L (1)
22 2 55.0 1 2 2 57 M (2)
23 2 46.1 3 5 3 51 L (1)
24 2 35.0 6 4 5 64 L (1)
25 2 37.3 2 7 4 54 L (1)
26 2 41.8 5 1 3 56 M (2)
27 2 57.0 8 3 2 36 M (2)
28 2 33.4 6 8 2 50 L (1)
29 2 37.5 3 2 3 48 L (1)
30 2 41.3 3 3 2 42 L (1)
Information on Resort Visits:
Holdout Sample
Table 18.3
Annual Attitude Importance Household Age of Amount
Resort Family Toward Attached Size Head of Spent on
No. Visit Income Travel to Family Household Family
($000) Vacation Vacation

1 1 50.8 4 7 3 45 M(2)
2 1 63.6 7 4 7 55 H (3)
3 1 54.0 6 7 4 58 M(2)
4 1 45.0 5 4 3 60 M(2)
5 1 68.0 6 6 6 46 H (3)
6 1 62.1 5 6 3 56 H (3)
7 2 35.0 4 3 4 54 L (1)
8 2 49.6 5 3 5 39 L (1)
9 2 39.4 6 5 3 44 H (3)
10 2 37.0 2 6 5 51 L (1)
11 2 54.5 7 3 3 37 M(2)
12 2 38.2 2 2 3 49 L (1)
Estimate the Discriminant Function Coefficients

• The direct method involves estimating the


discriminant function so that all the predictors are
included simultaneously.

• In stepwise discriminant analysis, the predictor


variables are entered sequentially, based on their
ability to discriminate among groups
Results of Two-Group Discriminant Analysis
Table 18.4
GROUP MEANS VISIT INCOME TRAVEL VACATION
HSIZE AGE

1 60.52000 5.40000 5.80000 4.33333 53.73333


2 41.91333 4.33333 4.06667 2.80000 50.13333
Total 51.21667 4.86667 4.9333 3.56667 51.93333

Group Standard Deviations

1 9.83065 1.91982 1.82052 1.23443 8.77062


2 7.55115 1.95180 2.05171 .94112 8.27101
Total 12.79523 1.97804 2.09981 1.33089 8.57395

Pooled Within-Groups Correlation Matrix


INCOME TRAVEL VACATION HSIZE AGE

INCOME 1.00000
TRAVEL 0.19745 1.00000
VACATION 0.09148 0.08434 1.00000
HSIZE 0.08887 -0.01681 0.07046 1.00000
AGE - 0.01431 -0.19709 0.01742 -0.04301 1.00000

Wilks' (U-statistic) and univariate F ratio with 1 and 28 degrees of freedom

Variable Wilks' F Significance

INCOME 0.45310 33.800 0.0000


TRAVEL 0.92479 2.277 0.1425
VACATION 0.82377 5.990 0.0209
HSIZE 0.65672 14.640 0.0007
AGE 0.95441 1.338 0.2572 Cont.
Table 18.4, cont.
CANONICAL DISCRIMINANT FUNCTIONS
% of Cum Canonical After Wilks'
Function Eigenvalue Variance % Correlation Function  Chi-square df Significance
: 0 0 .3589 26.130 5 0.0001
1* 1.7862 100.00 100.00 0.8007 :

* marks the 1 canonical discriminant functions remaining in the analysis.

Standard Canonical Discriminant Function Coefficients


FUNC 1

INCOME 0.74301
TRAVEL 0.09611
VACATION 0.23329
HSIZE 0.46911
AGE 0.20922

Structure Matrix:
Pooled within-groups correlations between discriminating variables & canonical discriminant functions (variables ordered by
size of correlation within function)

FUNC 1

INCOME 0.82202
HSIZE 0.54096
VACATION 0.34607
TRAVEL 0.21337
AGE 0.16354 Cont.
Table 18.4, cont.
Table 18.4, cont.

Classification Results for cases not selected for use in the


analysis (holdout sample)
Predicted Group Membership
Actual Group No. of Cases 1 2
Group 1 6 4 2
66.7% 33.3%
Group 2 6 0 6
0.0% 100.0%
Percent of grouped cases correctly classified: 83.33%.
Determine the Significance of Discriminant
Function
• The null hypothesis that, in the population, the means
of all discriminant functions in all groups are equal can
be statistically tested.
• In SPSS this test is based on Wilks' Λ . If several
functions are tested simultaneously (as in the case of
multiple discriminant analysis), the Wilks' Λ statistic is
the product of the univariate for each function. The
significance level is estimated based on a chi-square
transformation of the statistic.
• If the null hypothesis is rejected, indicating significant
discrimination, one can proceed to interpret the results.
Interpret the Results
• The interpretation of the discriminant weights, or
coefficients, is similar to that in multiple
regression analysis.
• Given the multicollinearity in the predictor
variables, there is no unambiguous measure of the
relative importance of the predictors in
discriminating between the groups.
• With this limitation in mind, we can obtain some
idea of the relative importance of the variables by
examining the absolute magnitude of the
standardized discriminant function coefficients.
• Some idea of the relative importance of the
predictors can also be obtained by examining the
structure correlations, also called canonical
loadings or discriminant loadings.
• These simple correlations between each predictor
and the discriminant function represent the
variance that the predictor shares with the
function.
• Another aid to interpreting discriminant analysis
results is to develop a Characteristic profile for
each group by describing each group in terms of
the group means for the predictor variables
Assess Validity of Discriminant Analysis
• Many computer programs, such as SPSS, offer a leave-
one-out cross-validation option.
• The discriminant weights, estimated by using the
analysis sample, are multiplied by the values of the
predictor variables in the holdout sample to generate
discriminant scores for the cases in the holdout sample.
• The cases are then assigned to groups based on their
discriminant scores and an appropriate decision rule.
• The hit ratio, or the percentage of cases correctly
classified, can then be determined by summing the
diagonal elements and dividing by the total number of
cases.
• It is helpful to compare the percentage of cases
correctly classified by discriminant analysis to the
percentage that would be obtained by chance.
• Classification accuracy achieved by discriminant
analysis should be at least 25% greater than that
obtained by chance.
Results of Three-Group Discriminant Analysis
Table 18.5
Group Means
AMOUNT INCOME TRAVEL VACATION HSIZE AGE
1 38.57000 4.50000 4.70000 3.10000 50.30000
2 50.11000 4.00000 4.20000 3.40000 49.50000
3 64.97000 6.10000 5.90000 4.20000 56.00000
Total 51.21667 4.86667 4.93333 3.56667 51.93333

Group Standard Deviations


1 5.29718 1.71594 1.88856 1.19722 8.09732
2 6.00231 2.35702 2.48551 1.50555 9.25263
3 8.61434 1.19722 1.66333 1.13529 7.60117
Total 12.79523 1.97804 2.09981 1.33089 8.57395

Pooled Within-Groups Correlation Matrix


INCOME TRAVEL VACATION HSIZE AGE
INCOME 1.00000
TRAVEL 0.05120 1.00000
VACATION 0.30681 0.03588 1.00000
HSIZE 0.38050 0.00474 0.22080 1.00000
AGE -0.20939 -0.34022 -0.01326 -0.02512 1.00000

Cont.
Table 18.5, cont.

Wilks' (U-statistic) and univariate F ratio with 2 and 27 degrees of freedom.

Variable Wilks' Lambda F Significance

INCOME 0.26215 38.00 0.0000


TRAVEL 0.78790 3.634 0.0400
VACATION 0.88060 1.830 0.1797
HSIZE 0.87411 1.944 0.1626
AGE 0.88214 1.804 0.1840

CANONICAL DISCRIMINANT FUNCTIONS


% of Cum Canonical After Wilks'
Function Eigenvalue Variance % Correlation Function  Chi-square df Significance
: 0 0.1664 44.831 10 0.00
1* 3.8190 93.93 93.93 0.8902 : 1 0.8020 5.517 4 0.24
2* 0.2469 6.07 100.00 0.4450 :

* marks the two canonical discriminant functions remaining in the analysis.

Standardized Canonical Discriminant Function Coefficients


FUNC 1 FUNC 2
INCOME 1.04740 -0.42076
TRAVEL 0.33991 0.76851
VACATION -0.14198 0.53354
HSIZE -0.16317 0.12932
AGE 0.49474 0.52447
Cont.
Table 18.5, cont.

Structure Matrix:
Pooled within-groups correlations between discriminating variables and canonical discriminant functions
(variables ordered by size of correlation within function)

FUNC 1 FUNC 2
INCOME 0.85556* -0.27833
HSIZE 0.19319* 0.07749
VACATION 0.21935 0.58829*
TRAVEL 0.14899 0.45362*
AGE 0.16576 0.34079*

Unstandardized canonical discriminant function coefficients


FUNC 1 FUNC 2
INCOME 0.1542658 -0.6197148E-01
TRAVEL 0.1867977 0.4223430
VACATION -0.6952264E-01 0.2612652
HSIZE -0.1265334 0.1002796
AGE 0.5928055E-01 0.6284206E-01
(constant) -11.09442 -3.791600

Canonical discriminant functions evaluated at group means (group centroids)


Group FUNC 1 FUNC 2
1 -2.04100 0.41847
2 -0.40479 -0.65867
3 2.44578 0.24020 Cont.
Table 18.5, cont.

Classification Results:
Predicted Group Membership
Actual Group No. of Cases 1 2 3

Group 1 10 9 1 0
90.0%10.0%0.0%

Group 2 10 1 9 0
10.0%90.0%0.0%

Group 3 10 0 2 8
0.0% 20.0%80.0%
Percent of grouped cases correctly classified: 86.67%

Classification results for cases not selected for use in the analysis
Predicted Group Membership
Actual Group No. of Cases 1 2 3

Group 1 4 3 1 0
75.0%25.0%0.0%

Group 2 4 0 3 1
0.0% 75.0%25.0%

Group 3 4 1 0 3
25.0%0.0% 75.0%
Percent of grouped cases correctly classified: 75.00%
All-Groups Scattergram
Fig. 18.3

Across: Function 1
Down: Function 2

4.0
1 1
1 *1 3
23 3 *3 3
1 1 12 * 3 3
0.0 1 1 2 2
3
1 2 2
2
-4.0

* indicates a group
centroid

-6.0 -4.0 -2.0 0.0 2.0 4.0 6.0


Territorial Map
Fig. 18.4

13
13
13 Across: Function 1
8.0 13 Down: Function 2
13 * Indicates a
13
13 group centroid
4.0 11 3
112 3
112233
* 1 1 1 2 2 2 2 3 3*
112 * 223
0.0 121 2 233
1 12 2233
1 12 1 2 2 223
233
-4.0 1122
223
2
112
111 22 233
1 121 2 2 2233
112 223
-8.0 11122 233

-8.0 -6.0 -4.0 -2.0 0.0 2.0 4.0 6.0 8.0


Stepwise Discriminant Analysis
• Stepwise discriminant analysis is analogous to stepwise multiple
regression in that the predictors are entered sequentially based
on their ability to discriminate between the groups.
• An F ratio is calculated for each predictor by conducting a
univariate analysis of variance in which the groups are treated as
the categorical variable and the predictor as the criterion
variable.
• The predictor with the highest F ratio is the first to be selected
for inclusion in the discriminant function, if it meets certain
significance and tolerance criteria.
• A second predictor is added based on the highest adjusted or
partial F ratio, taking into account the predictor already
selected.
• Each predictor selected is tested for retention based on
its association with other predictors selected.
• The process of selection and retention is continued until
all predictors meeting the significance criteria for
inclusion and retention have been entered in the
discriminant function.
• The selection of the stepwise procedure is based on the
optimizing criterion adopted. The Mahalanobis
procedure is based on maximizing a generalized
measure of the distance between the two closest groups.
• The order in which the variables were selected also
indicates their importance in discriminating between the
groups.
The Logit Model
• The dependent variable is binary and there are
several independent variables that are metric.
• The binary logit model commonly deals with the
issue of how likely is an observation to belong to
each group.
• It estimates the probability of an observation
belonging to a particular group.
Binary Logit Model Formulation
• The probability of success may be modeled using the
logit model as:

æ P ö
log ç ÷ = a +a X +a X +... +a X
è1 - P ø
e 0 1 1 2 2 k k

æ P ö= n

Or log ç ÷ åa X
è1 - P ø
e ii i
i= 0
k

exp( å a X )
P =
i i
i =0

1 + exp(
k

å a X
i =0
i i )

Where :
P = Probability of success
Xi = Independent variable i
ai = parameter to be estimated.
Properties of the Logit Model
• Although Xi may vary from   to  , P is
constrained to lie between 0 and 1.

• When Xi approaches  , P approaches 0.

• When Xi approaches  , P approaches 1.

• When OLS regression is used, P is not constrained


to lie between 0 and 1.
Estimation and Model Fit
• The estimation procedure is called the maximum likelihood
method.
• Fit: Cox & Snell R Square and Nagelkerke R Square.
• Both these measures are similar to R2 in multiple regression.
• The Cox & Snell R Square can not equal 1, even if the fit is perfect
• This limitation is overcome by the Nagelkerke R Square.
• Compare the predicted and actual values of Y to determine the
percentage of correct predictions.
Significance Testing
The significance of the estimated coefficients is based on Wald’s statistic.

Wald = (ai / SEai)2

Where,

ai = logistical coefficient for that predictor variable

SEai= standard error of the logistical coefficient

The Wald statistic is chi-square distributed with 1 degree of freedom if the

variable is metric and the number of categories minus 1 if the variable is

nonmetric.
Interpretation of Coefficients
• If Xi is increased by one unit, the log odds will
change by ai units, when the effect of other
independent variables is held constant.

• The sign of ai will determine whether the


probability increases (if the sign is positive) or
decreases (if the sign is negative) by this amount.
Example: Explaining Brand Loyalty
Table 18.6
No. Loyalty Brand Product Shopping
1 1 4 3 5
2 1 6 4 4
3 1 5 2 4
4 1 7 5 5
5 1 6 3 4
6 1 3 4 5
7 1 5 5 5
8 1 5 4 2
9 1 7 5 4
10 1 7 6 4
11 1 6 7 2
12 1 5 6 4
13 1 7 3 3
14 1 5 1 4
15 1 7 5 5
16 0 3 1 3
17 0 4 6 2
18 0 2 5 2
19 0 5 2 4
20 0 4 1 3
21 0 3 3 4
22 0 3 4 5
23 0 3 6 3
24 0 4 4 2
25 0 6 3 6
26 0 3 6 3
27 0 4 3 2
28 0 3 5 2
29 0 5 5 3
30 0 1 3 2
Results of Logistic Regression
Table 18.7

Dependent Variable Encoding

Original Value Internal Value


Not Loyal 0
Loyal 1

Model Summary

-2 Log Cox & Snell Nagelkerke R


Step likelihood R Square Square
1 23.471(a) .453 .604
a Estimation terminated at iteration number 6 because parameter estimates changed by less than .001.
Table 18.7, cont.

Classification Table a
Predicted

Loyalty to the Brand Percentage


Observed Not Loyal Loyal Correct
Step 1 Loyalty to the Not Loyal 12 3 80.0
Brand Loyal 3 12 80.0
Overall Percentage 80.0

a. The cut value is .500

Variables in the Equation a


B S.E. Wald df Sig. Exp(B)
Step Brand 1.274 .479 7.075 1 .008 3.575
1 Product .186 .322 .335 1 .563 1.205
Shopping .590 .491 1.442 1 .230 1.804
Constant -8.642 3.346 6.672 1 .010 .000

a.Variable(s) entered on step 1: Brand, Product, Shopping.


Thank You

You might also like