0% found this document useful (0 votes)

92 views81 pages

09 - Common Limitations Errors PDF

This document discusses common limitations and errors in empirical research methods. It begins with a quick review of regression discontinuity design (RDD) methods. It then outlines that data is often imperfect, which can lead to measurement error and survivorship bias. Sample selection is also an issue if datasets only include certain types of firms. An example from Ali, Klasa, and Yeung (2009) shows how using an imperfect measure of industry concentration from Compustat led to incorrect prior conclusions. Overall, the document cautions that data limitations must be considered and can invalidate analyses if not addressed properly.

Uploaded by

Joe23232232

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views81 pages

09 - Common Limitations Errors PDF

Uploaded by

Joe23232232

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 81

FNCE 926

Empirical Methods in CF
Lecture 9 – Common Limits & Errors

Professor Todd Gormley

Announcements
n  Rough draft of research proposal due
q  Should have uploaded to Canvas
q  I’ll try to e-mail feedback by next week

2
Background readings for today
n  Ali, Klasa, and Yeung (RFS 2009)
n  Gormley and Matsa (RFS 2014)

3
Outline for Today
n  Quick review of last lecture on RDD
n  Discuss common limitations and errors
q  Our data isn’t perfect
q  Hypothesis testing mistakes
q  How not to control for unobserved heterogeneity

n  Student presentations of “RDD” papers

4
Quick Review[Part 1]

n  What is difference between “sharp” and

“fuzzy” regression discontinuity design (RDD)?
q  Answer = With “sharp”, change in treatment status
only depends on x and cutoff x’; with “fuzzy”, only
probability of treatment varies at cutoff

5
Quick Review[Part 2]
n  Estimation of sharp RDD
yi = α + β di + f ( xi − x ') + di × g ( xi − x ') + ui

q  di = indicator for x ≥ x’

q  f( ) and g( ) are polynomial functions that
control for effect of x on y
q  Can also do analysis in tighter window
around threshold value x’

6
Quick Review[Part 3]
n  Estimation of fuzzy RDD is similar…
yi = α + β di + f ( xi − x ') + ui

But use Ti and as instrument for di where

q  Ti is indicator for x ≥ x’
q  And, di is indicator for treatment

7
Quick Review [Part 4]
n  What are some standard internal validity tests
you might want to run with RDD?
q  Answers:
n  Check robustness to different polynomial orders
n  Check robustness to bandwidth
n  Graphical analysis to show discontinuity in y
n  Compare other characteristics of firms around cutoff
threshold to make sure no other discontinuities
n  And more…

8
Quick Review [Part 5]
n  If effect of treatment is heterogeneous,
how does this affect interpretation of RDD
estimates?
q  Answer = They take on local average
treatment effect interpretation, and fuzzy RDD
captures only effect of compliers. Neither is
problem for internal validity, but can
sometimes limit external validity of finding

9
Common Limitations & Errors – Outline
n  Data limitations
n  Hypothesis testing mistakes
n  How to control for unobserved heterogeneity

10
Data limitations
n  The data we use is almost never perfect
q  Variables are often reported with error
q  Exit and entry into dataset typically not random
q  Datasets only cover certain types of firms

11
Measurement error – Examples
n  Variables are often reported with error
q  Sometimes it is just innocent noise
n  E.g. Survey respondents self report past income
with error [because memory isn’t perfect]

q  Sometimes, it is more systematic

n  E.g. survey might ask teenagers # of times
smoked marijuana, but teenagers that have
smoked and have high GPA might say zero

n  How will these errors affect analysis?

12
Measurement error – Why it matters
n  Answer = Depends; but in general, hard
to know exactly how this will matter
q  If y is mismeasured…
n  If only random noise, just makes SEs larger
n  But if systematic in some way [as in second example],
can cause bias if error is correlated with x’s
q  If x is mismeasured…
n  Even simple CEV causes attenuation bias on
mismeasured x and biases on all other variables

13
Measurement error – Solution
n  Standard measurement error solutions
apply [see “Causality” lecture]
q  Though admittedly, measurement error is
difficult to deal with unless know exactly
source and nature of the error

14
Survivorship Issues – Examples
n  In other cases, observations are included
or missing for systematic reasons; e.g.
q  Ex. #1 – Firms that do an IPO and are
added to datasets that cover public firms may
be different than firms that do not do an IPO
q  Ex. #2 – Firms adversely affected by some
event might subsequently drop out of data
because of distress or outright bankruptcy
n  How can these issues be problematic?

15
Survivorship Issues – Why it matters
n  Answer = There is a selection bias,
which can lead to incorrect inferences
q  Ex. #1 – E.g. going public may not cause
high growth; it’s just that the firms going
public were going to grow faster anyway
q  Ex. #2 – Might not find adverse effect of
event (or might understate it’s effect) if some
affected firms go bankrupt and are dropped

16
Survivorship Issues – Solution
n  Again, no easy solutions; but, if worried
about survivorship bias…
q  Check whether treatment (in diff-in-diff) is
associated with observations being more or
less likely to drop from data
q  In other analysis, check whether covariates
of observations that drop are systematically
different in a way that might be important

17
Sample is limited – Examples

n  Observations in commonly used datasets

are often limited to certain firms
q  Ex. #1 – Compustat covers largest public firms
q  Ex. #2 – Execucomp only provides incentives
on CEOs of firms listed on S&P 1500

n  How this might affect our analysis?

18
Sample is limited – Why it matters

n  Answer = Need to be careful when

making claims about external validity
q  Ex. #1 – Might find no effect of treatment in
Compustat because treatment effect is greatest
for unobserved, smaller, private firms
q  Ex. #2 – Observed correlations between
incentives and risk-taking in Execucomp
might not hold for smaller firms

19
Sample is limited – Solution

n  Be careful with inferences to avoid

making claims that lack external validity
n  Argue that your sample is representative
of economically important group
n  Hand-collect your own data if theory
your interested in testing requires it!
q  This can actually make for a great paper and
is becoming increasingly important in finance

20
Interesting Example of Data Problem

n  Ali, Klasa, and Yeung (RFS 2009) provide

interesting example of data problem
q  They note the following…
n  Many theories argue that “industry concentration”
is important factor in many finance settings
n  But researchers measure industry concentration
(i.e. herfindahl index) using Compustat

q  How might this be problematic?

21
Ali, et al. – Example data problem [P1]

n  Answer = Systematic measurement error!

q  Compustat doesn’t include private firms; so
using it causes you to mismeasure concentration
q  Ali, et al. find evidence of this by calculating
concentration using U.S. Census data
n  Correlation between measures is just 13%
n  Moreover, error in Compustat measure is
systematically related to some key variables, like
turnover of firms in the industry

22
Ali, et al. – Example data problem [P2]
n  Ali, et al. (RFS 2009) found it mattered;
using Census measure overturns four
previously published results
q  E.g. Concentration is positively related to R&D,
not negatively related as previously argued
q  See paper for more details...

23
Common Limitations & Errors – Outline
n  Data limitations
n  Hypothesis testing mistakes
n  How to control for unobserved heterogeneity

24
Hypothesis testing mistakes
n  As noted in lecture on natural experiments,
triple-difference can be done by running
double-diff in two separate subsamples
q  E.g. estimate effect of treatment on small firms;
then estimate effect of treatment on large firms

25
Example inference from such analysis
Small Large Low D/E High D/E
Sample =
Firms Firms Firms Firms
Treatment * Post 0.031 0.104** 0.056 0.081***
(0.121) (0.051) (0.045) (0.032)

N 2,334 3,098 2,989 2,876

R-‐squared 0.11 0.15 0.08 0.21
Firm dummies X X X X
Year dummies X X X X

q  From above results, researcher often concludes…

n  “Treatment effect is larger for bigger firms” Do you see any
n  “High D/E firms respond more to treatment” problem with
either claim?

26
Be careful making such claims!

n  Answer = Yes! The difference across subsamples

may not actually be statistically significant!
q  Hard to know if different just eyeballing it because
whether difference is significant depends on
covariance of the two separate estimates

n  How can you properly test these claims?

27
Example triple interaction result

All
Sample =
Firms
Treatment * Post 0.031
(0.121)
Difference is not actually
Treatment * Post * Large 0.073
(0.065)
statistically significant

N 5,432
R-‐squared 0.12 Remember to interact year
Firm dummies X dummies & triple difference;
Year dummies X
otherwise, estimates won’t
Year * Large dummies X
match earlier subsamples

28
Practical Advice
n  Don’t make claims you haven’t tested;
they could easily be wrong!
q  Best to show relevant p-values in text or tables
for any statistical significance claim you make
q  If difference isn’t statistically significant
[e.g. p-value = 0.15], can just say so; triple-diffs
are noisy, so this isn’t uncommon
q  Or, be more careful in your wording…
n  I.e. you could instead say, “we found an effect for large
firms, but didn’t find much evidence for small firms”

29
Common Limitations & Errors – Outline
n  Data limitations
n  Hypothesis testing mistakes
n  How to control for unobserved heterogeneity
q  How not to control for it
q  General implications
q  Estimating high-dimensional FE models

30
Unobserved Heterogeneity – Motivation
n  Controlling for unobserved heterogeneity is a
fundamental challenge in empirical finance
q  Unobservable factors affect corporate policies and prices
q  These factors may be correlated with variables of interest

n  Important sources of unobserved heterogeneity are

often common across groups of observations
q  Demand shocks across firms in an industry,
differences in local economic environments, etc.

31
Many different strategies are used
n  As we saw earlier, FE can control for unobserved
heterogeneities and provide consistent estimates
n  But, there are other strategies also used to control
for unobserved group-level heterogeneity…
q  “Adjusted-Y” (AdjY) – dependent variable is
demeaned within groups [e.g. ‘industry-adjust’]
q  “Average effects” (AvgE) – uses group mean of
dependent variable as control [e.g. ‘state-year’ control]

32
AdjY and AvgE are widely used
n  In JF, JFE, and RFS…
q  Used since at least the late 1980s
q  Still used, 60+ papers published in 2008-2010
q  Variety of subfields; asset pricing, banking,
capital structure, governance, M&A, etc.

n  Also been used in papers published in

the AER, JPE, and QJE and top
accounting journals, JAR, JAE, and TAR

33
But, AdjY and AvgE are inconsistent

n  As Gormley and Matsa (2014) shows…

q  Both can be more biased than OLS
q  Both can get opposite sign as true coefficient
q  In practice, bias is likely and trying to predict its
sign or magnitude will typically be impractical

n  Now, let’s see why they are wrong…

34
The underlying model [Part 1]
n  Recall model with unobserved heterogeneity
yi , j = β X i , j + fi + ε i , j

q  i indexes groups of observations (e.g. industry);

j indexes observations within each group (e.g. firm)
n  yi,j = dependent variable
n  Xi,j = independent variable of interest
n  fi = unobserved group heterogeneity
n  ε i , j = error term

35
The underlying model [Part 2]

n  Make the standard assumptions:

N groups, J observations per group,
where J is small and N is large
X and ε are i.i.d. across groups, but
not necessarily i.i.d. within groups
var( f ) = σ 2f , µ f = 0
var( X ) = σ X2 , µ X = 0
Simplifies some expressions,
var(ε ) = σ ε , µε = 0
2
but doesn’t change any results

36
The underlying model [Part 3]

n  Finally, the following assumptions are made:

cov( f i , ε i , j ) = 0
co v( X i , j , ε i , j ) = co v( X i , j , ε i , − j ) = 0
cov( X i , j , f i ) = σ Xf ≠ 0
What do these imply?
Answer = Model is correct in that
if we can control for f, we’ll
properly identify effect of X; but
if we don’t control for f there will
be omitted variable bias

37
We already know that OLS is biased
True model is: yi , j = β X i , j + fi + ε i , j

But OLS estimates: yi , j = β OLS X i , j + uiOLS

q  By failing to control for group effect, fi, OLS

suffers from standard omitted variable bias
σ Xf
βˆ OLS = β +
σ X2

Alternative estimation strategies are required…

38
Adjusted-Y (AdjY)
n  Tries to remove unobserved group heterogeneity by
demeaning the dependent variable within groups

AdjY estimates: yi , j − yi = β X i , j + uiAdjY

AdjY
,j

1
where yi =
J
∑ (β X
k ∈group i
i,k + fi + ε i ,k )

Note: Researchers often exclude observation at hand when

calculating group mean or use a group median, but both
modifications will yield similarly inconsistent estimates

39
Example AdjY estimation

n  One example – firm value regression:

Qi , j ,t − Qi ,t = α + β ' Xi, j ,t + ε i , j ,t
Qi , j ,t
q  = Tobin’s Q for firm j, industry i, year t
q  Qi ,t = mean of Tobin’s Q for industry i in year t
q  Xijt = vector of variables thought to affect value
q  Researchers might also include firm & year FE

Anyone know why AdjY is going to be inconsistent?

40
Here is why…
n  Rewriting the group mean, we have:
yi = fi + β X i + ε i ,

n  Therefore, AdjY transforms the true data to:

yi , j − yi = β X i , j − β X i + ε i , j − ε i

What is the AdjY estimation forgetting?

41
AdjY can have omitted variable bias
n  β̂ adjY can be inconsistent when β ≠ 0
True model: yi , j − yi = β X i , j − β X i + ε i , j − ε i
y
But, AdjY estimates: i , j − yi = β AdjY
X i, j + u AdjY
i, j

q  By failing to control for X i , AdjY suffers

from omitted variable bias when σ XX ≠ 0
σ XX In practice, a positive
βˆ AdjY = β − β
σ X2 covariance between X
and X will be common;
e.g. industry shocks

42
Now, add a second variable, Z

n  Suppose, there are instead two RHS variables

True model: yi , j = β X i , j + γ Zi , j + fi + ε i , j

n  Use same assumptions as before, but add:

cov( Z i , j , ε i , j ) = cov( Z i , j , ε i , − j ) = 0
var( Z ) = σ Z2 , µ Z = 0
cov( X i , j , Z i , j ) = σ XZ
cov( Z i , j , f i ) = σ Zf

43
AdjY estimates with 2 variables
n  With a bit of algebra, it is shown that:

⎡ β (σ XZ σ ZX − σ Z2σ XX ) + γ (σ XZ σ ZZ − σ Z2σ XZ ) ⎤
⎢β + ⎥
⎡ βˆ ⎤ ⎢
AdjY
σ Z σ X − σ XZ
2 2 2
⎥
⎢ AdjY ⎥ = ⎢ ⎥
⎢⎣γˆ ⎥⎦ ⎢ β (σ σ
XZ XX − σ X ZX )
2
σ + γ (σ σ
XZ XZ − σ X ZZ ) ⎥
2
σ
⎢γ + σ σ
2 2
− σ 2 ⎥
⎣ Z X XZ ⎦

Estimates of both Determining sign and

β and γ can be magnitude of bias will
inconsistent typically be difficult

44
Average Effects (AvgE)

n  AvgE uses group mean of dependent variable

as control for unobserved heterogeneity
AvgE estimates: yi , j = β AvgE X i , j + γ AvgE yi + uiAvgE
,j

45
Average Effects (AvgE)

n  Following profit regression is an AvgE example:

ROAi ,s,t = α + β ' Xi,s,t + γ ROAs,t + ε i ,s,t

q  ROAs,t = mean of ROA for state s in year t

q  Xist = vector of variables thought to profits
q  Researchers might also include firm & year FE

Anyone know why AvgE is going to be inconsistent?

46
AvgE has measurement error bias

n  AvgE uses group mean of dependent variable

as control for unobserved heterogeneity
AvgE estimates: yi , j = β AvgE X i , j + γ AvgE yi + uiAvgE
,j

Recall, true model: yi , j = β X i , j + fi + ε i , j

Problem is that yi measures fi with error

47
AvgE has measurement error bias
n  Recall that group mean is given by yi = fi + β X i + ε i ,
q  Therefore, yi measures fi with error − β X i − ε i
q  As is well known, even classical measurement error
causes all estimated coefficients to be inconsistent

n  Bias here is complicated because error can be

correlated with both mismeasured variable, fi ,
and with Xi,j when σ XX ≠ 0

48
AvgE estimate of β with one variable

n  With a bit of algebra, it is shown that:

βˆ AvgE = β +
( ) (
σ Xf βσ fX + β 2σ X2 + σ ε2 − σ εε − βσ XX σ 2f + βσ fX + σ εε )
( )
σ X2 σ 2f + 2βσ fX + β 2σ X2 + σ ε2 − (σ Xf + βσ XX )
2

Determining
magnitude and Covariance between X and X
again problematic, but not Even non-i.i.d.
direction of bias
needed for AvgE estimate to nature of errors
is difficult
be inconsistent can affect bias!

49
Comparing OLS, AdjY, and AvgE
n  Can use analytical solutions to compare
relative performance of OLS, AdjY, and AvgE
n  To do this, we re-express solutions…
q  We use correlations (e.g. solve bias in terms of
correlation between X and f, ρ Xf , instead of σ Xf )
q  We also assume i.i.d. errors [just makes bias of
AvgE less complicated]
q  And, we exclude the observation-at-hand when
calculating the group mean, X i , …

50
Why excluding Xi doesn’t help
n  Quite common for researchers to exclude
observation at hand when calculating group mean
q  It does remove mechanical correlation between X and
omitted variable, X i , but it does not eliminate the bias
q  In general, correlation between X and omitted variable, X i ,
is non-zero whenever X i is not the same for every group i
q  This variation in means across group is almost
assuredly true in practice; see paper for details

51
ρXf has large effect on performance
Estimate, β̂ AdjY more biased
than OLS, except for
True β = 1
2

large values for ρXf OLS

AdjY
0

AvgE gives
wrong sign for Other parameters held constant
AvgE low values of ρXf
-1

σ f / σ X = σ ε / σ X = 1, J = 10, ρ X , X = 0.5
i −i
-2

-0.75 -0.5 -0.25 0 0.25 0.5 0.75

ρ Xf
52
More observations need not help!
Estimate, β̂
OLS
1.25
1

AvgE
0.75

AdjY
0.5

0 5 10 15 20 25
σ f / σ X = σ ε / σ X = 1, ρ X , X = 0.5, ρ Xf = 0.25 J
i −i

53
Summary of OLS, AdjY, and AvgE

n  In general, all three estimators are inconsistent

in presence of unobserved group heterogeneity
n  AdjY and AvgE may not be an improvement
over OLS; depends on various parameter values

n  AdjY and AvgE can yield estimates with

opposite sign of the true coefficient

54
Fixed effects (FE) estimation
n  Recall: FE adds dummies for each group to OLS
estimation and is consistent because it directly
controls for unobserved group-level heterogeneity
n  Can also do FE by demeaning all variables with respect
to group [i.e. do ‘within transformation’] and use OLS

y
FE estimates: i , j − yi = β FE
( i, j i ) i, j
X − X + u FE

True model: yi , j − yi = β ( X i , j − X i ) + (ε i , j − ε i )

55
Comparing FE to AdjY and AvgE
n  To estimate effect of X on Y controlling for Z
q  One could regress Y onto both X and Z… Add group FE
q  Or, regress residuals from regression of Y on Z
onto residuals from regression of X on Z Within-group
transformation!
§  AdjY and AvgE aren’t the same as finding the
effect of X on Y controlling for Z because...
§  AdjY only partials Z out from Y
§  AvgE uses fitted values of Y on Z as control

56
The differences will matter! Example #1
n  Consider the following capital structure regression:
( D / A)i ,t = α + βXi,t + fi + ε i ,t
q  (D/A)it = book leverage for firm i, year t
q  Xit = vector of variables thought to affect leverage
q  fi = firm fixed effect

n  We now run this regression for each approach to

deal with firm fixed effects, using 1950-2010 data,
winsorizing at 1% tails…

57
Estimates vary considerably
De p e n d e n t variab le = b o o k le ve rag e

OLS Adj Y Avg E FE

Fixed Assets/ Total Assets 0.270* 0.066* 0.103* 0.248*

(0.008) (0.004) (0.004) (0.014)
Ln(sales) 0.011*** 0.011*** 0.011*** 0.017***
(0.001) 0.000 0.000 (0.001)
Return on Assets -‐0.015*** 0.051*** 0.039*** -‐0.028***
(0.005) (0.004) (0.004) (0.005)
Z-‐score -‐0.017*** -‐0.010*** -‐0.011*** -‐0.017***
0.000 (0.000) (0.000) (0.001)
Market-‐to-‐book Ratio -‐0.006*** -‐0.004*** -‐0.004*** -‐0.003***
(0.000) (0.000) (0.000) (0.000)

Observations 166,974 166,974 166,974 166,974

R-‐squared 0.29 0.14 0.56 0.66

58
The differences will matter! Example #2

n  Consider the following firm value regression:

Qi , j ,t = α + β ' Xi, j ,t + f j ,t + ε i , j ,t

q  Q = Tobin’s Q for firm i, industry j, year t

q  Xijt = vector of variables thought to affect value
q  fj,t = industry-year fixed effect

n  We now run this regression for each approach

to deal with industry-year fixed effects…

59
Estimates vary considerably
Dependent Variable = Tobin's Q
OLS Adj Y Avg E FE

Delaware Incorporation 0.100* 0.019 0.040 0.086

(0.036) (0.032) (0.032) (0.039)
Ln(sales) -‐0.125*** -‐0.054*** -‐0.072*** -‐0.131***
(0.009) (0.008) (0.008) (0.011)
R&D Expenses / Assets 6.724*** 3.022*** 3.968*** 5.541***
(0.260) (0.242) (0.256) (0.318)
Return on Assets -‐0.559*** -‐0.526*** -‐0.535*** -‐0.436***
(0.108) (0.095) (0.097) (0.117)

Observations 55,792 55,792 55,792 55,792

R-‐squared 0.22 0.08 0.34 0.37

60
Common Limitations & Errors – Outline
n  Data limitations
n  Hypothesis testing mistakes
n  How to control for unobserved heterogeneity
q  How not to control for it
q  General implications
q  Estimating high-dimensional FE models

61
General implications

n  With this framework, easy to see that other

commonly used estimators will be biased

62
Other AdjY estimators are problematic
n  Same problem arises with other AdjY estimators
q  Subtracting off median or value-weighted mean
q  Subtracting off mean of matched control sample
[as is customary in studies if diversification “discount”]
q  Comparing “adjusted” outcomes for treated firms pre-
versus post-event [as often done in M&A studies]
q  Characteristically adjusted returns [as used in asset pricing]

63
AdjY-type estimators in asset pricing
n  Common to sort and compare stock returns across
portfolios based on a variable thought to affect returns
n  But, returns are often first “characteristically adjusted”
q  I.e. researcher subtracts the average return of a benchmark
portfolio containing stocks of similar characteristics
q  This is equivalent to AdjY, where “adjusted returns” are
regressed onto indicators for each portfolio

n  Approach fails to control for how avg. independent

variable varies across benchmark portfolios

64
Asset Pricing AdjY – Example
n  Asset pricing example; sorting returns based
on R&D expenses / market value of equity
Characteristically adjusted returns by R&D Quintile (i.e., Adj Y)
Missing Q1 Q2 Q3 Q4 Q5
-‐0.012*** -‐0.033*** -‐0.023*** -‐0.002 0.008 0.020***
(0.003) (0.009) (0.008) (0.007) (0.013) (0.006)

Difference between
We use industry-size benchmark portfolios Q5 and Q1 is 5.3
and sorted using R&D/market value percentage points

65
Estimates vary considerably
Dependent Variable = Yearly Stock Return
Adj Y FE
R&D Missing 0.021** 0.030***
(0.009) (0.010) Same AdjY result,
R&D Quintile 2 0.01 0.019 but in regression
(0.013) (0.014) format; quintile 1
R&D Quintile 3 0.032*** 0.051*** is excluded
(0.012) (0.018)
R&D Quintile 4 0.041*** 0.068***
(0.015) (0.020) Use benchmark-period
R&D Quintile 5 0.053*** 0.094*** FE to transform both
(0.011) (0.019) returns and R&D; this is
Observations 144,592 144,592 equivalent to double sort
2
R 0.00 0.47

66
What if AdjY or AvgE is true model?
§  If data exhibited structure of AvgE estimator,
this would be a peer effects model
[i.e. group mean affects outcome of other members]
§  In this case, none of the estimators (OLS, AdjY,
AvgE, or FE) reveal the true β [Manski 1993;
Leary and Roberts 2010]

§  Even ifinterested in studying yi , j − y i , AdjY

only consistent if Xi,j does not affect yi,j

67
Common Limitations & Errors – Outline
n  Data limitations
n  Hypothesis testing mistakes
n  How to control for unobserved heterogeneity
q  How not to control for it
q  General implications
q  Estimating high-dimensional FE models

68
Multiple high-dimensional FE

n  Researchers occasionally motivate using

AdjY and AvgE because FE estimator is
computationally difficult to do when there
are more than one FE of high-dimension

Now, let’s see why this is

(and isn’t) a problem…

69
LSDV is usually needed with two FE

n  Consider the below model with two FE

Two separate
yi , j ,k = β X i , j ,k + fi + δ k + ε i , j ,k group effects

q  Unless panel is balanced, within transformation can

only be used to remove one of the fixed effects
q  For other FE, you need to add dummy variables
[e.g. add time dummies and demean within firm]

70
Why such models can be problematic

n  Estimating FE model with many dummies

can require a lot of computer memory
q  E.g., estimation with both firm and 4-digit
industry-year FE requires ≈ 40 GB of memory

71
This is growing problem

n  Multiple unobserved heterogeneities

increasingly argued to be important
q  Manager and firm fixed effects in executive
compensation and other CF applications
[Graham, Li, and Qui 2011, Coles and Li 2011]
q  Firm and industry×year FE to control for
industry-level shocks [Matsa 2010]

72
But, there are solutions!

n  There exist two techniques that can be

used to arrive at consistent FE estimates
without requiring as much memory
#1 – Interacted fixed effects
#2 – Memory saving procedures

73
#1 – Interacted fixed effects
n  Combine multiple fixed effects into one-
dimensional set of fixed effect, and remove
using within transformation
q  E.g. firm and industry-year FE could be
replaced with firm-industry-year FE

But, there are limitations…

q  Can severely limit parameters you can estimate
q  Could have serious attenuation bias

74
#2 – Memory-saving procedures
n  Use properties of sparse matrices to reduce
required memory, e.g. Cornelissen (2008)
n  Or, instead iterate to a solution, which
eliminates memory issue entirely, e.g.
Guimaraes and Portugal (2010)
q  See paper for details of how each works
q  Both can be done in Stata using user-written
commands FELSDVREG and REGHDFE

75
These methods work…

n  Estimated typical capital structure

regression with firm and 4-digit
industry×year dummies
q  Standard FE approach would not work; my
computer did not have enough memory…
q  Sparse matrix procedure took 8 hours…
q  Iterative procedure took 5 minutes

76
Summary of Today [Part 1]

n  Our data isn’t perfect…

q  Watch for measurement error
q  Watch for survivorship bias
q  Be careful about external validity claims

n  Make sure to test that estimates across

subsamples are actually statistically different

77
Summary of Today [Part 2]

n  Don’t use AdjY or AvgE!

n  But, do use fixed effects
q  Should use benchmark portfolio-period FE in
asset pricing rather than char-adjusted returns
q  Use iteration techniques to estimate models with
multiple high-dimensional FE

78
In First Half of Next Class
n  Matching
q  What it does…
q  And, what it doesn’t do

n  Related readings… see syllabus

79
Assign papers for next week…
n  Gormley and Matsa (working paper, 2015)
q  Corporate governance & playing it safe preferences

n  Ljungqvist, Malloy, Marston (JF 2009) No comments

needed from
q  Data issues in I/B/E/S other groups

n  Bennedsen, et al. (working paper, 2012)

q  CEO hospitalization events

80
Break Time
n  Let’s take our 10 minute break
n  We’ll do presentations when we get back

Chapter 4
No ratings yet
Chapter 4
42 pages
CMSC 177 Statistical Experiments and Significance Testing
No ratings yet
CMSC 177 Statistical Experiments and Significance Testing
31 pages
Quasi Experimental Design
No ratings yet
Quasi Experimental Design
11 pages
GEA1000 Helpsheet v2
No ratings yet
GEA1000 Helpsheet v2
2 pages
Statistical Approaches To Causal Analysis, 1st Edition EPUB DOCX PDF Download
100% (11)
Statistical Approaches To Causal Analysis, 1st Edition EPUB DOCX PDF Download
14 pages
Econometric S
No ratings yet
Econometric S
20 pages
Model Specification & Data Issues
No ratings yet
Model Specification & Data Issues
45 pages
Assumptions
No ratings yet
Assumptions
69 pages
RD Lecture Slides
No ratings yet
RD Lecture Slides
101 pages
AP Stats MC Review 2015
No ratings yet
AP Stats MC Review 2015
58 pages
Hunermund - Causal Inference Seminar - Lecture 4 - Regression Discontinuity Design
No ratings yet
Hunermund - Causal Inference Seminar - Lecture 4 - Regression Discontinuity Design
27 pages
ERP For IB Notes
No ratings yet
ERP For IB Notes
30 pages
T04 PDF
No ratings yet
T04 PDF
3 pages
Statistical Process Control Guide
No ratings yet
Statistical Process Control Guide
69 pages
IAR Lecture 5
No ratings yet
IAR Lecture 5
5 pages
Academic CV for Data Science Experts
No ratings yet
Academic CV for Data Science Experts
10 pages
Twitter Return Vs S&P 500 Return
No ratings yet
Twitter Return Vs S&P 500 Return
7 pages
Internal Econometrics
No ratings yet
Internal Econometrics
5 pages
Mock Exam Solution Empirical Methods For Finance
No ratings yet
Mock Exam Solution Empirical Methods For Finance
6 pages
Topic 1
No ratings yet
Topic 1
32 pages
1 Introduction
No ratings yet
1 Introduction
35 pages
OMBC106
No ratings yet
OMBC106
3 pages
Ds 5 Marks Final
No ratings yet
Ds 5 Marks Final
11 pages
Causal Inference and Research Design Scott Cunningham (Baylor)
100% (1)
Causal Inference and Research Design Scott Cunningham (Baylor)
1,056 pages
Bankruptcy Law Primer
100% (2)
Bankruptcy Law Primer
101 pages
Exercise 8 Micro 110 2023
No ratings yet
Exercise 8 Micro 110 2023
3 pages
Chapter 4
No ratings yet
Chapter 4
62 pages
Regression Explained SPSS
No ratings yet
Regression Explained SPSS
25 pages
DS Assignment COMPLETED
No ratings yet
DS Assignment COMPLETED
11 pages
DECS431 Week9 Class1
No ratings yet
DECS431 Week9 Class1
29 pages
SOCS0075 Lecture5
No ratings yet
SOCS0075 Lecture5
23 pages
P Chart
No ratings yet
P Chart
3 pages
Assumption - 16 - Oct18
No ratings yet
Assumption - 16 - Oct18
48 pages
07 - Natural Experiment (Part 2) PDF
No ratings yet
07 - Natural Experiment (Part 2) PDF
90 pages
BIO 610 Lab Manual: Design & Analysis
No ratings yet
BIO 610 Lab Manual: Design & Analysis
17 pages
ML37 de Thi
No ratings yet
ML37 de Thi
7 pages
Methods Notes
No ratings yet
Methods Notes
9 pages
Research Stats for Grad Students
100% (1)
Research Stats for Grad Students
4 pages
Lesson 07 - Panel Data Regression - 2024
No ratings yet
Lesson 07 - Panel Data Regression - 2024
32 pages
Statistics Interview Questions & Answers For Data Scientists
No ratings yet
Statistics Interview Questions & Answers For Data Scientists
43 pages
Network Analysis and Quantitative Techniques: Presented By
No ratings yet
Network Analysis and Quantitative Techniques: Presented By
48 pages
Dependency Models Only
No ratings yet
Dependency Models Only
5 pages
1 4 Multilevel and Longitudinal Mode PDF
No ratings yet
1 4 Multilevel and Longitudinal Mode PDF
1,503 pages
Data Analysts: Visualize Correlations
No ratings yet
Data Analysts: Visualize Correlations
2 pages
AHP Sampling
0% (1)
AHP Sampling
2 pages
CS194 Lec 06 EDA
No ratings yet
CS194 Lec 06 EDA
40 pages
Session 2 - QRT - Oct 3, 2020
No ratings yet
Session 2 - QRT - Oct 3, 2020
17 pages
Linear Model Diagnostics Guide
No ratings yet
Linear Model Diagnostics Guide
31 pages
Regression Generalization Guide
No ratings yet
Regression Generalization Guide
47 pages
Das FFFF
No ratings yet
Das FFFF
16 pages
Standard Deviation and Variance
No ratings yet
Standard Deviation and Variance
6 pages
Solutions To Sample Final Exam ECO2151
No ratings yet
Solutions To Sample Final Exam ECO2151
7 pages
Session 6 BEDO - Hyd (Before Class)
No ratings yet
Session 6 BEDO - Hyd (Before Class)
31 pages
Data Screening & Non-Parametric Models
No ratings yet
Data Screening & Non-Parametric Models
38 pages
Notes 14
No ratings yet
Notes 14
33 pages
Block 05d ControChartAdvanced
No ratings yet
Block 05d ControChartAdvanced
98 pages
Panel Guidelines
No ratings yet
Panel Guidelines
3 pages
Week5 Assumptions 1
No ratings yet
Week5 Assumptions 1
41 pages
HW 2 CHAP 2 - Yepez
No ratings yet
HW 2 CHAP 2 - Yepez
9 pages
QDT Final
No ratings yet
QDT Final
107 pages
Statistics Interview Questions
No ratings yet
Statistics Interview Questions
20 pages
Introduction To Data Cleaning and Bias in Analysis
No ratings yet
Introduction To Data Cleaning and Bias in Analysis
35 pages
Sample Thesis Using Anova
100% (3)
Sample Thesis Using Anova
6 pages
Pengaruh Beban Kerja Terhadap Kinerja Tenaga Kependidikan Pada Kantor Rektorat Universitas Nusa Cendana Kupang
No ratings yet
Pengaruh Beban Kerja Terhadap Kinerja Tenaga Kependidikan Pada Kantor Rektorat Universitas Nusa Cendana Kupang
23 pages
Fixed and Random Effects
No ratings yet
Fixed and Random Effects
23 pages
291-Article Text-1636-1-10-20210628
No ratings yet
291-Article Text-1636-1-10-20210628
12 pages
Introduction To Econometrics - Stock & Watson - CH 7 Slides
100% (1)
Introduction To Econometrics - Stock & Watson - CH 7 Slides
35 pages
Statictical Tolerancing PDF
No ratings yet
Statictical Tolerancing PDF
117 pages
Statistics Course Outline
No ratings yet
Statistics Course Outline
2 pages
Multivariant Data.
No ratings yet
Multivariant Data.
36 pages
Modules Statistics PDF
No ratings yet
Modules Statistics PDF
67 pages
Slide EAP Cross2
No ratings yet
Slide EAP Cross2
62 pages
Currency Market: Frontier Research and Chinese Currency Market Qinhua Chen
No ratings yet
Currency Market: Frontier Research and Chinese Currency Market Qinhua Chen
11 pages
Equity in The Time Series, Part 2: 15.433 Financial Markets October 3 & 5, 2017
No ratings yet
Equity in The Time Series, Part 2: 15.433 Financial Markets October 3 & 5, 2017
54 pages
Notes EAP Option PDF
No ratings yet
Notes EAP Option PDF
42 pages
Empirical Asset Pricing: Class 1: Introduction
No ratings yet
Empirical Asset Pricing: Class 1: Introduction
15 pages
MODULE 27 Two Sample T-Tests When Variances Unequal 19 July 05
No ratings yet
MODULE 27 Two Sample T-Tests When Variances Unequal 19 July 05
15 pages
05 - Instrumental Variables PDF
No ratings yet
05 - Instrumental Variables PDF
92 pages
Slide EAP Structure
No ratings yet
Slide EAP Structure
18 pages
Microsoft Machine Learning Algorithm Cheat Sheet v2 PDF
100% (1)
Microsoft Machine Learning Algorithm Cheat Sheet v2 PDF
1 page
Empirical Asset Pricing: Classes 17-20: Debt Markets and Banks
No ratings yet
Empirical Asset Pricing: Classes 17-20: Debt Markets and Banks
23 pages
Equity in The Time Series, Part 1: 15.433 Financial Markets September 28, 2017
No ratings yet
Equity in The Time Series, Part 1: 15.433 Financial Markets September 28, 2017
20 pages
Cross-Section of Stock Returns Analysis
No ratings yet
Cross-Section of Stock Returns Analysis
24 pages
Mutual Fund: Prepared For Empirical Asset Pricing Class at SAIF
No ratings yet
Mutual Fund: Prepared For Empirical Asset Pricing Class at SAIF
25 pages
Empirical Asset Pricing Behavioral Finance: Instructor: Xiaomeng Lu
No ratings yet
Empirical Asset Pricing Behavioral Finance: Instructor: Xiaomeng Lu
28 pages
02 - Linear Regression II PDF
No ratings yet
02 - Linear Regression II PDF
81 pages
Chapter 5
No ratings yet
Chapter 5
25 pages
CH15 Measures of Association
No ratings yet
CH15 Measures of Association
51 pages
Econometrics 2019 PDF
No ratings yet
Econometrics 2019 PDF
143 pages
06 - Natural Experiment (Part 1) PDF
No ratings yet
06 - Natural Experiment (Part 1) PDF
89 pages
As of Sep 16, 2021: Seppo Pynn Onen Econometrics I
No ratings yet
As of Sep 16, 2021: Seppo Pynn Onen Econometrics I
60 pages
Advanced Analysis QuantileRegression
No ratings yet
Advanced Analysis QuantileRegression
45 pages
Week 6: Assumptions in Regression Analysis
No ratings yet
Week 6: Assumptions in Regression Analysis
69 pages
Precision Statements in UOP Methods
No ratings yet
Precision Statements in UOP Methods
15 pages
Utility Maximization in Economics
No ratings yet
Utility Maximization in Economics
34 pages
Indian Stock Market Seasonality Study
No ratings yet
Indian Stock Market Seasonality Study
3 pages
Estimating Demand: Regression Analysis
No ratings yet
Estimating Demand: Regression Analysis
29 pages
CH 4 - Problems
No ratings yet
CH 4 - Problems
72 pages
SEM Boot Camp: Day 1 Basics & Data
No ratings yet
SEM Boot Camp: Day 1 Basics & Data
38 pages
Chicagoibank
No ratings yet
Chicagoibank
3 pages
08 - Regression Discontinuity PDF
No ratings yet
08 - Regression Discontinuity PDF
83 pages
Ch5 Slides Ed3 Feb2021
No ratings yet
Ch5 Slides Ed3 Feb2021
49 pages
Introduction To Data Science Exploratory Data Analysis
No ratings yet
Introduction To Data Science Exploratory Data Analysis
55 pages
04 - Panel Data PDF
No ratings yet
04 - Panel Data PDF
84 pages
MSC 515 Econometrics of Event Studies PDF
No ratings yet
MSC 515 Econometrics of Event Studies PDF
49 pages
Data Screening and Cleaning Guide
No ratings yet
Data Screening and Cleaning Guide
55 pages
Data Screening Assumptions
No ratings yet
Data Screening Assumptions
29 pages

09 - Common Limitations Errors PDF

Uploaded by

09 - Common Limitations Errors PDF

Uploaded by

FNCE 926

Professor Todd Gormley

n Student presentations of “RDD” papers

n What is difference between “sharp” and

q di = indicator for x ≥ x’

But use Ti and as instrument for di where

q Sometimes, it is more systematic

n How will these errors affect analysis?

n Observations in commonly used datasets

n How this might affect our analysis?

n Answer = Need to be careful when

n Be careful with inferences to avoid

n Ali, Klasa, and Yeung (RFS 2009) provide

q How might this be problematic?

n Answer = Systematic measurement error!

N 2,334 3,098 2,989 2,876

q From above results, researcher often concludes…

n Answer = Yes! The difference across subsamples

n How can you properly test these claims?

n Important sources of unobserved heterogeneity are

n Also been used in papers published in

n As Gormley and Matsa (2014) shows…

n Now, let’s see why they are wrong…

q i indexes groups of observations (e.g. industry);

n Make the standard assumptions:

n Finally, the following assumptions are made:

But OLS estimates: yi , j = β OLS X i , j + uiOLS

q By failing to control for group effect, fi, OLS

Alternative estimation strategies are required…

AdjY estimates: yi , j − yi = β X i , j + uiAdjY

Note: Researchers often exclude observation at hand when

n One example – firm value regression:

Anyone know why AdjY is going to be inconsistent?

n Therefore, AdjY transforms the true data to:

What is the AdjY estimation forgetting?

q By failing to control for X i , AdjY suffers

n Suppose, there are instead two RHS variables

n Use same assumptions as before, but add:

Estimates of both Determining sign and

n AvgE uses group mean of dependent variable

n Following profit regression is an AvgE example:

q ROAs,t = mean of ROA for state s in year t

Anyone know why AvgE is going to be inconsistent?

n AvgE uses group mean of dependent variable

Recall, true model: yi , j = β X i , j + fi + ε i , j

Problem is that yi measures fi with error

n Bias here is complicated because error can be

n With a bit of algebra, it is shown that:

large values for ρXf OLS

-0.75 -0.5 -0.25 0 0.25 0.5 0.75

n In general, all three estimators are inconsistent

n AdjY and AvgE can yield estimates with

n We now run this regression for each approach to

OLS Adj Y Avg E FE

Fixed Assets/ Total Assets 0.270*** 0.066*** 0.103*** 0.248***

Observations 166,974 166,974 166,974 166,974

n Consider the following firm value regression:

q Q = Tobin’s Q for firm i, industry j, year t

n We now run this regression for each approach

Delaware Incorporation 0.100*** 0.019 0.040 0.086**

Observations 55,792 55,792 55,792 55,792

n With this framework, easy to see that other

n Approach fails to control for how avg. independent

§ Even ifinterested in studying yi , j − y i , AdjY

n Researchers occasionally motivate using

Now, let’s see why this is

n Consider the below model with two FE

q Unless panel is balanced, within transformation can

n Estimating FE model with many dummies

n Multiple unobserved heterogeneities

n There exist two techniques that can be

But, there are limitations…

n Estimated typical capital structure

n Our data isn’t perfect…

n Make sure to test that estimates across

n Don’t use AdjY or AvgE!

n  Student presentations of “RDD” papers

n  What is difference between “sharp” and

q  di = indicator for x ≥ x’

q  Sometimes, it is more systematic

n  How will these errors affect analysis?

n  Observations in commonly used datasets

n  How this might affect our analysis?

n  Answer = Need to be careful when

n  Be careful with inferences to avoid

n  Ali, Klasa, and Yeung (RFS 2009) provide

q  How might this be problematic?

n  Answer = Systematic measurement error!

q  From above results, researcher often concludes…

n  Answer = Yes! The difference across subsamples

n  How can you properly test these claims?

n  Important sources of unobserved heterogeneity are

n  Also been used in papers published in

n  As Gormley and Matsa (2014) shows…

n  Now, let’s see why they are wrong…

q  i indexes groups of observations (e.g. industry);

n  Make the standard assumptions:

n  Finally, the following assumptions are made:

q  By failing to control for group effect, fi, OLS

n  One example – firm value regression:

n  Therefore, AdjY transforms the true data to:

q  By failing to control for X i , AdjY suffers

n  Suppose, there are instead two RHS variables

n  Use same assumptions as before, but add:

n  AvgE uses group mean of dependent variable

n  Following profit regression is an AvgE example:

q  ROAs,t = mean of ROA for state s in year t

n  AvgE uses group mean of dependent variable

n  Bias here is complicated because error can be

n  With a bit of algebra, it is shown that:

n  In general, all three estimators are inconsistent

n  AdjY and AvgE can yield estimates with

n  We now run this regression for each approach to

Fixed Assets/ Total Assets 0.270* 0.066* 0.103* 0.248*

n  Consider the following firm value regression:

q  Q = Tobin’s Q for firm i, industry j, year t

n  We now run this regression for each approach

Delaware Incorporation 0.100* 0.019 0.040 0.086

n  With this framework, easy to see that other

n  Approach fails to control for how avg. independent

§  Even ifinterested in studying yi , j − y i , AdjY

n  Researchers occasionally motivate using

n  Consider the below model with two FE

q  Unless panel is balanced, within transformation can

n  Estimating FE model with many dummies

n  Multiple unobserved heterogeneities

n  There exist two techniques that can be

n  Estimated typical capital structure

n  Our data isn’t perfect…

n  Make sure to test that estimates across

n  Don’t use AdjY or AvgE!

n  Related readings… see syllabus

n  Ljungqvist, Malloy, Marston (JF 2009) No comments

n  Bennedsen, et al. (working paper, 2012)