Olivetti 2008
Olivetti 2008
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
The University of Chicago Press, Society of Labor Economists, NORC at the University of Chicago, The
University of Chicago are collaborating with JSTOR to digitize, preserve and extend access to Journal of
Labor Economics.
http://www.jstor.org
I. Introduction
There is substantial international variation in gender pay gaps, from
around 30 log points in the United States and the United Kingdom to
We wish to thank Ivan Fernandez-Val, Larry Katz, Kevin Lang, Alan Manning,
and Steve Pischke for their very helpful suggestions. We also acknowledge comments
from seminars at several institutions, as well as from presentations at the Bank of
Portugal Annual Conference 2005, the SOLE/EALE Conference 2005, the Conference
in Honor of Reuben Gronau 2005, and the National Bureau of Economic Research
Summer Institute 2006. Olivetti aknowledges the Radcliffe Institute for Advanced
621
Studies for financial support during the early stages of the project. Petrongolo ak-
nowledges the ESRC for financial support to the Centre for Economic Performance.
Contact the corresponding author, Barbara Petrongolo, at [email protected].
Fig. 1.—Gender gaps in mean (log) hourly wages and in employment, 1999. Coefficient
of correlation: ⫺0.474.
1
See Altonji and Blank (1999) for an overall survey on both employment and gender
gaps for the United States, Blau and Kahn (2003) for international comparisons of
gender wage gaps, and Azmat, Güell, and Manning (2006) for international compar-
isons of unemployment gaps.
vations for those with imputed wages in the base year can shed light on
the goodness of our imputation methods.
We next use probability models for assigning individuals on either side
of the median of the wage distribution. To do this we fit a probit model
for the probability of an employed worker’s belonging above their gender-
specific median, based on education and labor market experience, and
obtain predicted probabilities for the nonemployed. We then construct
an imputed sample using such predicted probabilities as sampling weights.
We complete our set of results by estimating bounds to the distribution
of wages (see Manski 1994), using either the actual or the imputed wage
distribution in turn. Bounds computed using the observed wage distri-
bution are interesting because they show that all our wage gap estimates
based on imputation do fall within these bounds. When the imputed wage
distribution is used, the increase in the proportion of individuals with a
wage (actual or imputed) allows us to tighten the bounds, as predicted
by the theory.
In our study we use panel data sets that are as comparable as possible
across countries, namely, the Panel Study of Income Dynamics (PSID)
for the United States and the European Community Household Panel
Survey (ECHPS) for Europe. We consider the period 1994–2001, which
is the longest time span for which data are available for all countries. Our
estimates on these data deliver higher median wage gaps on imputed rather
than actual wage distributions for most countries and across alternative
imputation methods. This implies, as one would have expected, that
women tend on average to be more positively selected into work than
men. However, the difference between actual and potential wage gaps is
small in the United States, the United Kingdom, and most central and
northern European countries, and it becomes sizable in southern Europe,
where the gender employment gap is highest. Under our most conser-
vative correction, sample selection into employment explains nearly half
of the observed negative correlation between gender wage and employ-
ment gaps. In particular, in Spain, Italy, Portugal, and Greece, the median
wage gap on the imputed wage distribution reaches levels closely com-
parable to those of the United States and of other central and northern
European countries.
Our results thus show that, while the raw wage gap is much higher in
Anglo Saxon countries than in southern Europe, the reason is probably
not to be found in more equal pay treatment for women in the latter
group of countries but mainly in a different process of selection into
employment. Female participation rates in Catholic countries and Greece
are low and and are concentrated among high-wage women. Having cor-
rected for lower participation rates, the wage gap there widens to levels
similar to those of other European countries and the United States.
The article is organized as follows. Section II discusses the related lit-
erature. Section III describes the data sets used and presents descriptive
evidence on gender gaps. Section IV describes our imputation method-
ologies. Section V estimates raw median gender wage gaps on actual and
imputed wage distributions, to illustrate how alternative sample selection
rules affect the estimated gaps. Conclusions are brought together in Sec-
tion VI.
III. Data
A. The PSID
Our analysis for the United States is based on the Michigan Panel Study
of Income Dynamics (PSID). This is a longitudinal survey of a represen-
tative sample of U.S. individuals and their households. It has been ongoing
since 1968. The data were collected annually through 1997 and every
other year after 1997. In order to ensure consistency with European data,
we use six waves from the PSID, from 1994 to 2001. We restrict our
analysis to individuals aged 25–54, having excluded the self-employed,
full-time students, and individuals in the armed forces.5
5
The exclusion of self-employed individuals may require some justification in so
far as the incidence of self-employment varies importantly across genders and coun-
tries, as well as the associated earnings gap. However, the available definition of income
for the self-employed is not comparable to the one we are using for the employees,
and the number of observations for the self-employed is very limited for European
countries. Both these factors prevent us from including the self-employed in our
analysis.
The wage concept that we use throughout the analysis is the gross
hourly wage. This is given by annual labor income divided by annual
hours worked in the calendar year before the interview date. Employed
workers are defined as those with positive hours worked in the previous
year.
The characteristics that we exploit for wage imputation for the nonem-
ployed are human capital variables, spouse income, and nonemployment
status, that is, unemployed versus out of the labor force. Human capital
is proxied by education and work experience controls. Ethnic origin is
not included here as information on ethnicity is not available for the
European sample. We consider three broad educational categories: less
than high school, high school completed, and college completed. They
include individuals who have completed less than 12 years of schooling,
between 12 and 15 years of schooling, and at least 16 years of schooling,
respectively. This categorization of the years of schooling variable is cho-
sen for consistency with the definition of education in the ECHPS, which
does not provide information on completed years of schooling but only
on recognized qualifications.
Information on work experience refers to years of actual labor market
experience (either full- or part-time) since the age of 18. When individuals
first join the PSID sample as a head or a wife (or cohabitor), they are
asked how many years they worked since age 18 and how many of these
years involved full-time work. These two questions are also asked ret-
rospectively in 1974 and 1985, irrespective of the year in which respon-
dents had joined the sample. The answers to these questions are used to
construct a measure for actual work experience, following the procedure
of Blau and Kahn (2006). Given the initial values reported, we update
work experience information for the years of interest using the longitu-
dinal work history file from the PSID. For example, in order to construct
the years of actual experience in 1994 for an individual who was in the
survey in 1985, we add to the number of years of experience reported in
1985 the number of years between 1985 and 1994 during which they
worked a positive number of hours.6 This procedure allows us to construct
the full work experience in each year until 1997. As the survey became
biannual after 1997, there is no information on the number of hours
worked by individuals between 1997 and 1998 and between 1999 and
2000. We fill missing work experience information for 1998 following
again Blau and Kahn (2006). In particular, we use the 1999 sample to
estimate logit models for positive hours in the previous year and in the
year preceding the 1997 survey, separately for males and females. The
6
The measure of actual experience used here includes both full-time and part-time
work experience, as this is better comparable to the measure of experience available
from the ECHPS.
B. The ECHPS
Data for European countries are drawn from the European Community
Household Panel Survey (ECHPS).7 This is an unbalanced household-
based panel survey, containing annual information on a few thousand
households per country during the period 1994–2001.8 The ECHPS has
the advantage that it asks a consistent set of questions across the 15
member states of the preenlargement European Union. The employment
section of the survey contains information on the jobs held by members
of selected households, including wages and hours of work. The house-
hold section allows us to obtain information on the family composition
of respondents. We exclude Sweden and Luxembourg from our country
set as wage information is unavailable for Sweden in all waves and un-
available for Luxembourg after 1996.
As for the United States, we restrict our analysis of wages to individuals
aged 25–54 as of the survey date, and we exclude the self-employed, those
in full-time education, and those in the military. The definitions of var-
iables used replicates quite closely those used for the United States.
Hourly wages are computed as gross weekly wages divided by weekly
usual working hours. The education categories used are less than upper
secondary high school, upper secondary school completed, and higher
education. These correspond to ISCED 0–2, 3, and 5–7, respectively.
Unfortunately, no information on actual experience is available in the
7
Previous work using ECHPS data for international comparisons of gender gaps
include the OECD (2002) survey and Arulampalam, Booth, and Bryan (2007), who
study the variation in gender pay gaps across quantiles of the wage distribution in 11
EU countries.
8
The initial sample sizes are as follows: Austria: 3,380; Belgium: 3,490; Denmark:
3,482; Finland: 4,139; France: 7,344; Germany: 11,175; Greece: 5,523; Ireland: 4,048;
Italy: 7,115; Luxembourg: 1,011; Netherlands: 5,187; Portugal: 4,881; Spain: 7,206;
Sweden: 5,891; United Kingdom: 10,905. These figures are the number of households
included in the first wave for each country, which corresponds to 1995 for Austria,
1996 for Finland, 1997 for Sweden, and 1994 for all other countries.
9
Similar to other Scandinavian countries, the employment gap in Sweden over the
same sample period is 5.2 percentage points.
cation group j for gender g and Wjg is the associated true wage. In turn,
Wjg is a weighted average of actual wages for the employed and potential
wages for the nonemployed. Assuming that the nonemployed would earn
a wage that is equal to a proportion g of the wage of the employed,
Wjg can then be expressed as
˜ [g ⫹ n (1 ⫺ g)],
Wjg p W (1)
jg jg
where njg is the employment rate of education group j and gender g and
W̃jg is their observed average wage. The reason for first computing (1) by
education and then aggregating over education groups is that gender em-
ployment gaps vary widely by education. Specifically, they everywhere
decline with educational levels, but if anything they do so more strongly
in southern Europe than elsewhere (see Olivetti and Petrongolo 2006,
table 1A).
The parameter g represents the type and extent of sample selection into
employment. In particular, values of g ! 1 (respectively 1 1) indicate pos-
itive (respectively negative) sample selection. For a given g, the role of
selection is magnified by a lower employment rate, njg . Denoting by w
the log of potential wages, the gender wage gap for education group j is
wjmale ⫺ wjfemale. This decreases with g if women have lower employment
rates than men, and it increases with the gender employment gap if there
is positive sample selection (g ! 1).
We can now assess the difference between observed and potential wage
gaps across alternative values of g after aggregating (1) across education
groups. This is shown in table 1 for g p 0.7, 0.5, and 0.3. Column 1
reports for reference the mean wage gap on the 1999 employed sample,
as also pictured in figure 1, together with its correlation with the em-
ployment gap and its coefficient of variation.10 Columns 2–4 report the
mean wage gap, having corrected for sample selection using (1). Gender
wage gaps increase everywhere with lower values of g, and, as expected,
they do more so in countries with high gender employment gaps. In other
words, the higher the gender employment gap, the stronger the impact
of a certain degree of positive sample selection. Selection correction gets
rid of the negative correlation between gender wage and employment gaps
and reduces the coefficient of variation in wage gaps. It is interesting to
10
The coefficient of correlation is better suited here to assess cross-country variation
than the simple standard deviation as the level of the wage gap is also systematically
affected by wage imputation. In col. 1, following Krueger and Summers (1988), we
adjust the standard deviation of estimated gender gaps across countries to account for
the upward bias induced by the least-squares sampling error, i.e., SD p (Var (bˆ c) ⫺
冘14cp1jˆ c2/14)1/2, where bˆ c is the estimated wage gap in country c, jˆ c is the corresponding
standard error, and 14 is the number of countries. To obtain the coefficient of variation,
we divide SD by the cross-country mean of the estimated b̂c’s. The same adjustment
applies to all coefficients of variation reported in tables 2–4.
Table 1
Mean Wage Gaps under Alternative Values of g
Base Sample g p .7 g p .5 g p .3
Country:
United States .325 .409 .434 .460
United Kingdom .269 .309 .336 .365
Finland .181 .239 .260 .283
Denmark .128 .167 .178 .189
Germany .226 .295 .333 .373
Netherlands .248 .334 .385 .440
Belgium .113 .202 .246 .292
Austria .233 .306 .359 .416
Ireland .178 .311 .367 .430
France .124 .207 .260 .318
Italy .052 .223 .311 .414
Spain .109 .296 .387 .494
Portugal .097 .218 .264 .313
Greece .089 .353 .471 .612
Correlation ⫺.474 .302 .616 .806
Coefficient of variation .462 .247 .245 .273
Sources.—Michigan Panel Study of Income Dynamics and European Community Household Panel
Survey.
Note.—The symbol g represents the ratio between the potential wages of the nonemployed and the
observed wages of the employed. Figures reported in rows 1–14 are gender differences in mean log wages.
Wages for each gender are obtained as weighted averages across three education groups. Wages for each
education group depend on g, as illustrated in eq. (1). Figures in the last two rows provide the cross-
country correlation between gender and employment gaps and the coefficient of variation for the gender
wage gap, respectively. Sample description: aged 25–54, excluding the self-employed, the military, and
those in full-time education, 1999.
IV. Methodology
Let w denote the natural logarithm of hourly wages and F(wFg) the
cumulative log wage distribution for each gender, where g p 1 denotes
males and g p 0 denotes females. In what follows, our variable of interest
is the difference between (log) male and female median wages:
where m() is the median function. The (log) wage distribution for each
gender is defined by
F(wFg) p F(wFg, I p 1) Pr (I p 1Fg)
⫹ F(wFg, I p 0)[1 ⫺ Pr (I p 1Fg)], (3)
where I p 1 for the employed and I p 0 for the nonemployed.
Estimated moments of the observed wage distribution are based on the
F(wFg, I p 1) term alone. If there are systematic differences between
F(wFg, I p 1) and F(wFg, I p 0), cross-country variation in Pr (I p 1Fg)
may translate into misleading inferences concerning the international var-
iation in the distribution of potential wage offers. This problem typically
affects estimates of female wage offer distributions; this is even more
the case when one is interested in cross-country comparisons of gender
wage gaps, given the cross-country variation in Pr (I p 1Fg p male) ⫺
Pr (I p 1Fg p female), measuring the gender employment gap. But
F(wFg), the term of interest, is not identified, because data provide in-
formation on F(wFg, I p 1) and Pr (I p 1Fg) but clearly not on
F(wFg, I p 0), as wages are only observed for those who are in work.
In particular, using (3), the median log wage for each gender, m, is
defined by
1
F(mFg, I p 1) Pr (I p 1Fg) ⫹ F(mFg, I p 0)[1 ⫺ Pr (I p 1Fg)] p . (4)
2
Our goal is to retrieve gender gaps in median (potential) wages, as illus-
trated in equation (2), with gender medians defined in equation (4). To
do this we need to retrieve information on F(mFg, I p 0), representing
the probability that nonemployed individuals have potential wages below
the median.
It can be shown that knowledge of F(mFg, I p 0) allows us to identify
the median wage gap in potential wages using median wage regressions,
as a simpler alternative to numerically solving (4). Let us consider the
linear wage equation
wi p b0 ⫹ b1 gi ⫹ i , (5)
where wi denotes (log) potential wages, b0 is a constant term, b1 is the
parameter of interest, and i is an error term such that m(iFgi ) p 0. De-
note by b̂ the hypothetical least absolute deviations (LAD) regression
estimator based on potential wages, that is, b̂ { arg min b 冘Nip1 Fwi ⫺
b0 ⫺ b1 giF, where b { [b0 b1 ] .
However, wages wi are only observed for the employed; they are missing
for the nonemployed. Consider an example in which missing wages fall
completely below the median regression line, that is, wi ! wˆ i p bˆ 0 ⫹ bˆ 1 gi
for the nonemployed (Ii p 0), or equivalently, F(mFg, I p 0) p 1 . One can
ip1
Fyi ⫺ b0 ⫺ b1 giF
冘
N
Condition (6) states that the LAD estimator is not affected by impu-
tation. In other words, obtaining b̂ using the transformed dependent var-
iable yi gives the same estimate that one would obtain if potential wages
were available for the whole population. Now consider an alternative
example in which missing wages fall completely above the median re-
gression line, that is, wi 1 w ˆ i for Ii p 0, or equivalently, F(mFg, I p
0) p 0. The result in (6) still holds, having set yi equal to some arbitrarily
high imputed value w ¯ (such that w ¯i1w ˆ i ) for the nonemployed. More in
general, the LAD estimator is not affected by imputation when the missing
wage observations are imputed “so as to maintain the same sign of the
residual” (Bloomfield and Steiger 1983, 52). That is, (6) is valid whenever
missing wage observations are imputed on the “correct” side of the median.
As a further example, suppose that the potential wages of the nonemployed
could be classified into two groups, L and U, such that wi ! w ˆ i for i 苸 L
and wi 1 w ˆ i for i 苸 U. One can define yi as a transformed variable such
that yi p wi for Ii p 1, yi p w — for Ii p 0 and i 苸 L, and yi p w ¯ for
Ii p 0 and i 苸 U, and LAD inference is still valid.
Using this result, one can estimate median wage gaps, based on wage
imputation for the nonemployed that simply requires assumptions on the
position of the imputed wage observations with respect to the median of
the wage distribution, as done in Johnson et al. (2000) and Neal (2004).
The attractive feature of median regressions is that results are only affected
by the position of imputed wage observations with respect to the median
and not by specific values of imputed wages as it would be in the matching
approach. In this article, we will estimate median wage gaps under alter-
native imputation rules, that is, under alternative conjectures over
F(mFg, I p 0). These imputation rules are described in detail below.
sumption is that, for a given individual i, the latent wage position with
respect to her predicted (gender-specific) median when she is nonem-
ployed can be proxied by her wage in the nearest wave in which she is
employed. As the position with respect to the median is determined using
alternative information on wages, as opposed to measured characteristics,
we are allowing for selection on unobservables.
Formally, we will assume
F(mFgi , Iit p 0) p F(mFgi , Iit p 1), (7)
where t is our base year and t is the wave nearest to t in which we have
a nonmissing wage observation. In practice, we impute yit p wit for
Iit p 0. This procedure of imputation makes sense if an individual’s po-
sition in the wage distribution stays on the same side of the median when
the individual is switching employment status. As we estimate median
wage gaps, we do not need an assumption of stable rank throughout the
whole wage distribution but only with respect to the median. Should the
position of individuals in the wage distribution change with employment
status, movements that happen within either side of the median do not
invalidate this method.
While imputation based on this procedure arguably uses the minimum
set of potentially arbitrary assumptions, it has the disadvantage of not
providing any wage information on individuals who never worked during
the sample period. It is therefore important to understand in which di-
rection this problem may distort, if at all, the resulting median wage gaps.
If women are on average less attached to the labor market than men, and
if attachment increases with potential wages, then the difference between
the median gender wage gap on the imputed and the actual wage distri-
bution tends to be higher the higher the proportion of imputed wage
observations in total nonemployment in the base year. Consider, for ex-
ample, a country with a very persistent female employment status: women
who do not work in the base year and are therefore less attached are also
unlikely to work at all in the whole sample period. In this case, low wage
observations for less-attached women are unlikely to be recovered and
the estimated wage gap is relatively low. Proportions of imputed wage
observations over the total nonemployed population in 1999 (our base
year) are reported in table A2 of the online appendix: the differential
between male and female proportions tends to be higher in Germany,
Austria, France, and southern Europe than elsewhere. Under reasonable
assumptions we should therefore expect the difference between the median
wage gap on the imputed and the actual wage distribution to be biased
downward relatively more in this set of countries. This, in turn, means
that we are being relatively more conservative in assessing the effect of
nonrandom employment selection in these countries than elsewhere.
Even so, it would, of course, be preferable to recover wage observations
also for those never observed in work during the whole sample period.
To do this, we rely on the observed characteristics of the nonemployed.
B. Imputation on Observables
We use observable characteristics for wage imputation with two meth-
ods. With the first method, we make assumptions on the position of
missing wages with respect to their gender-specific median, based on a
small number of characteristics, summarized into the vector Xi. We can
illustrate this with a very simple example. Suppose that Xi only includes
years of completed education. This implies that we are using information
on education for someone who is nonemployed to place them above or
below their gender-specific median. We can define a threshold for Xi, x—
(say, 11 years of schooling), below which nonemployed individuals would
earn below-median wages, and another threshold x̄ (say, 16 years), above
which individuals would earn above-median wages.
More formally we assume that
F(mFgi , Ii p 0, Xi ≤ x)
— p 1; F(mFgi , Ii p 0, Xi ≥ x)
¯ p 0, (8)
where —x and x¯ are low and high values of Xi, respectively.11 In this case,
the imputed dependent variable yi is set equal to w— for i such that Ii p
0 and Xi ≤ x— and is set equal to w¯ for i such that Ii p 0 and Xi ≥ x¯ .
This method for placing individuals with respect to the median follows
an educated guess, based on their observable characteristics. However, we
can use wage information from other waves in the panel to assess the
goodness of such guess, as will be illustrated in Section V.B.
With the second method, we use probability models for imputing miss-
ing wage observations. In this case our imputation rule assumes that
F(mFgi , Ii p 0, Xi ) p Pˆ i , (9)
where P̂i is the predicted probability to belong below the median, based
on probit estimates.
We implement this imputation method in two steps. In the first step,
we estimate the probability of an individual’s wage belonging below the
median of the wage distribution, based on a set of observable character-
istics. On the employed sample, we define Mi p 1 for individuals earning
less than their gender-specific median and Mi p 0 for the others. We
estimate a probit model for Mi for each gender, with explanatory variables
Xi. Using the probit estimates, we obtain predicted probabilities of having
a latent wage below the median, Pˆ i p F(gXˆ i ) p Pr (Mi p 1FXi), for the
nonemployed subset, where F is the cumulative distribution function of
the standardized normal distribution and ĝ is the estimated parameter
11
All variables in (8) refer to the (same) base year, so time subscripts are omitted.
vector from the probit regression. Predicted probabilities P̂i are then used
in the second step as sampling weights for the nonemployed. That is, we
construct an imputed sample in which the employed feature with their
observed wage and the nonemployed feature with a wage below median
with a weight Pˆ i and a wage above median with a weight 1 ⫺ Pˆ i. The
statistics of interest is the gender wage gap estimated on the imputed
sample. The associated variance is obtained by bootstrap to correct for
the fact that the weights used are based on probit estimates.
Note that in the first step we need a reference median wage in order
to define Mi. The readily available candidate would be the median ob-
served wage, but precisely due to selection this may be quite different
from the latent median wage, thus potentially delivering biased estimates.
In order to attenuate this problem, we also perform repeated imputation
on an expanded sample, augmented with wage observations from adjacent
waves. This allows us to get a better estimate of the potential median in
the first step of our procedure, thereby generating more appropriate es-
timates of the median wage gap on the final, imputed sample.
our imputed sample is typically larger than the one obtained with the
first method, although it is still substantially smaller than the existing
population. Finally, with the third method, we estimate the probability
of belonging above the median for the whole range of our vector of
characteristics, thus recovering predicted probabilities and imputed wages
for the whole existing population.
Different imputed samples will have an impact on our estimated median
wage gaps. In so far as women tend to be more positively selected into
employment than men, the larger the imputed sample with respect to the
actual sample of employed workers, the larger the estimated correction
for selection.
Having said this, it is important to stress that, with all three imputation
methods used, we never impose positive selection ex ante (except in a
benchmark example), and thus there is nothing that would tell a priori
which way correction for selection is going to affect the results. This is
ultimately determined by the wages that the nonemployed earned when
they were previously (or later) employed and by their observable char-
acteristics, depending on methods.
Before moving on to the discussion of our estimates, it is worthwhile
to motivate our choice of selection correction methodology and to frame
it in the context of the existing literature on sample selection. A number
of approaches can be used to correct for nonrandom sample selection in
wage equations and/or recover the distribution in potential wages. The
seminal approach suggested by Heckman (1974, 1979) consists in allowing
for selection on unobservables, that is, on variables that do not feature
in the wage equation but that are observed in the data.12 Heckman’s two-
stage parametric specifications have been used extensively in the literature
in order to correct for selectivity bias in female wage equations. More
recently, these have been criticized for lack of robustness and distribu-
tional assumptions (see Manski 1989). Approaches that circumvent most
of the criticism include semiparametric selection correction models that
have appeared in the literature since the early 1980s (see Vella [1998] for
an extensive survey of both parametric and nonparametric sample selec-
tion models). Two-stage nonparametric methods allow us in principle to
approximate the bias term by a series expansion of propensity scores from
12
In this framework, wages of employed and nonemployed would be recovered
as
E(wFZw, I p 1) p Zwdw ⫹ E(wFI 1 ⫺ZIdI)
E(wFZw, I p 0) p Zwdw ⫹ E(wFI ! ⫺ZIdI),
respectively, where Zw and ZI are the set of covariates included in the wage and
selection equations, respectively, with associated parameters dw and dI , and w and
I are the respective error terms.
the selection equation, with the qualification that the term of order zero
in the polynomial is not separately identified from the constant term in
the wage equation unless some additional information is available (see
Buchinsky 1998). Usually the constant term in the wage regression is
identified from a subset of workers for which the probability of work is
close to one, but in our case this route is not feasible since for no type
of women is the probability of working close to one in all countries.
Selection on observed characteristics is instead exploited in the matching
approach, which consists in imputing wages for the nonemployed by
assigning them the observed wages of the employed with matching char-
acteristics (see Blau and Beller 1992; and Juhn 1992, 2003). The approach
of this article is also based on some form of wage imputation for the
nonemployed, but it simply requires assumptions on the position of the
imputed wage observations with respect to the median of the wage dis-
tribution. Importantly, it does not require assumptions on the actual level
of missing wages, as is typically required in the matching approach, nor
it requires the arbitrary exclusion restrictions that are often invoked in
the two-stage Heckman sample selection correction models.
D. Bounds
As discussed above, each imputation method is based on identifying
assumptions that are largely untested. In order to illustrate that results
delivered by our imputation methods are reasonable, we also provide
“worst case” bounds to the gap in potential median wages that do not
require any identifying assumption, as shown by Manski (1994) and Blun-
dell et al. (2007). We will then check that our estimated wage gaps on
imputed wage distributions fall into these bounds.
Manski notes that substituting the inequality 0 ≤ F(wFg, I p 0) p 1
into (3) gives bounds for the true cumulative distribution
1
F(mFg, I p 1) Pr (I p 1Fg) ≤
2
≤ F(mFg, I p 1) Pr (I p 1Fg) ⫹ [1 ⫺ Pr (I p 1Fg)]. (11)
These bounds on F() deliver the following worst case bounds on the
gender-specific median
such that
1
p F(m lFg, I p 1) Pr (I p 1Fg) ⫹ [1 ⫺ Pr (I p 1Fg)], (13)
2
1
p F(muFg, I p 1) Pr (I p 1Fg). (14)
2
Bounds on the gender-specific median can be obtained by solving (13)
and (14) using data on the observed wage distribution and employment
rates. Note that conditions (13) and (14) imply that one can only identify
bounds for the median if Pr (I p 1Fg) p 1/2. Hence, we will not be able
to obtain such bounds for the female median wage (and, therefore, for
the gender wage gap) in countries where less than 50% of the women
have a wage observation.
Having said this, the bounds for the median gender wage gap D defined
in (2) are obtained as follows:
m l(wFg p male) ⫺ mu (wFg p female)
≤ D ≤ mu (wFg p male) ⫺ m l(wFg p female). (15)
V. Results
A. Imputation on Wages from Adjacent Waves
In our first set of estimates, an individual’s position with respect to the
median of the wage distribution in the base year is proxied by the position
of his or her wage obtained from the nearest available wave. The kind of
imputation made here requires that individuals stay on the same side of
their gender median across different waves in the panel (see eq. [7]).
Results obtained with this method are reported in table 2.
Column 1 reports the actual wage gap for reference: this is the median
wage gap for individuals with an hourly wage in 1999, which is our base
year. The wage gaps of column 1 replicate very closely those plotted in
figure 1, with the only difference being that figure 1 plotted mean as opposed
to median wage gaps.13 As in figure 1, the United States and the United
Kingdom stand out as the countries with the highest wage gaps, followed
by central Europe, and finally by Scandinavia and Southern Europe.
In column 2, missing wage observations in 1999 are replaced with the
real value of the nearest wage observation in a 2-year window, while in
column 3, they are replaced with the real value of the nearest wage obser-
13
The absence of any important difference between mean and median wage
gaps on the observed wage distribution is good news for our approach, based on
the recovery of selection-corrected median wage gaps.
Table 2
Median Wage Gaps under Alternative Sample Inclusion Rules:
Wage Imputation Based on Wage Observations from Adjacent Waves
1 2 3 4 5
Country:
United States .339 .352 .361 .354 .363
United Kingdom .256 .277 .284 .291 .301
Finland .160 .194 .197 .203 .206
Denmark .086 .100 .100 .093 .093
Germany .191 .223 .214 .234 .234
Netherlands .178 .193 .199 .195 .199
Belgium .078 .099 .112 .098 .112
Austria .192 .224 .234 .220 .229
Ireland .232 .273 .284 .292 .300
France .095 .133 .152 .140 .164
Italy .059 .062 .075 .070 .079
Spain .097 .153 .168 .143 .157
Portugal .150 .168 .186 .169 .185
Greece .111 .148 .184 .148 .185
Correlation ⫺.329 ⫺.263 ⫺.181 ⫺.269 ⫺.199
Coefficient of variation .484 .416 .382 .427 .392
Sources.—Michigan Panel Study of Income Dynamics and European Community Household Panel
Survey.
Note.—All wage gaps are significant at the 1% level. Figures in the last two rows display the cross-
country correlation between the reported gender wage gap and the gender employment gap and the
coefficient of variation of the gender wage gap. Sample description: aged 25–54, excluding the self-
employed, the military, and those in full-time education, 1999. Sample inclusion rules by columns: (1)
Employed at time of survey in 1999. (2) Wage imputed from other waves when nonemployed (⫺2 , ⫹
2 window). (3) Wage imputed from other waves when nonemployed (⫺5 , ⫹2 window). (4) Wage imputed
from other waves when nonemployed (⫺5 , ⫹2 window), adjusted for real wage growth by gender and
country. (5) Wage imputed from other waves when nonemployed (⫺5 , ⫹2 window), adjusted for real
wage growth by gender and country.
14
Note, however, that, even if real wage growth were homogeneous across
genders, imputation based on wage observations from adjacent waves would not
be affected only if the proportion of men and women in the sample remained
unchanged after imputation.
15
Of course, for our estimated rates of wage growth to be unbiased, this pro-
cedure requires that participation into employment be unaffected by wage growth,
which may not be the case.
B. Imputation on Observables
In table 3, we exploit some observable characteristics of the nonem-
ployed for assigning them on one or the other side of their gender median
(eq. [8] gives the formal identifying assumption). Column 1 reports for
reference the median wage gap on the base sample, which is the same as
the one reported in column 1 of table 2. In column 2, we assume that all
those not in work in 1999 would have wage offers below the median for
their gender.16 This is an extreme assumption, and it is the only case in
which we impose ex ante positive sample selection. This assumption is
clearly violated for countries like Italy, Spain, and Greece, in which more
than half of the female sample is not in work in 1999, and thus estimates
are not reported for these three countries. However, also for other coun-
tries, there are reasons to believe that not all nonemployed individuals
would have wage offers below their gender mean. Having said this, es-
timated median wage gaps increase substantially for most countries, except
for Denmark and Finland. The correlation with employment gaps turns
positive and quite strong because the wage gap in high employment-gap
countries increases disproportionately relative to other countries.
Of course, one cannot know exactly what wages these individuals
would have received had they worked in 1999. But we can form an idea
of the goodness of this assumption by looking again at wage observations
(if any) for these individuals in all other waves in the panel. This allows
us to see whether an imputed observation had a wage that was indeed
below their predicted gender median at the time he or she was observed
working. Specifically, we take all imputed observations in 1999. Among
these, we select those who ever worked at some time in the sample period.
Out of this subset, we compute the proportion of observations who had
wages below the predicted gender median. Such proportions are computed
for men and women and are reported in columns headed “M” and “F,”
16
In practice, whenever we assign someone a wage below the median, we pick
wi p ⫺5, this value being lower than the minimum observed (log) wage for all
countries and thus lower than the median. Similarly, whenever we assign someone
a wage above the median, we pick w̄i p 20.
Country:
United States .339 .432 .90 .74 .335 1.00 .89 .348 .67 .78 .348 .82 .88 .367
United Kingdom .256 .375 .80 .63 .238 .83 .81 .246 .71 .56 .253 .87 .89 .287
Finland .160 .191 .86 .73 .179 .87 .86 .161 .40 .43 .160 .80 .83 .188
Denmark .086 .122 .77 .77 .100 .85 .81 .076 .63 .55 .092 1.00 .60 .100
Germany .191 .368 .87 .53 .214 .88 .74 .192 .53 .54 .196 .94 .86 .215
Netherlands .178 .353 .67 .50 .246 .67 .62 .208 .67 .78 .189 .79 .78 .221
Belgium .078 .228 .68 .70 .113 .70 .82 .082 .80 .52 .084 .69 1.00 .116
Austria .192 .360 .89 .60 .210 .91 .72 .192 .00 .43 .208 .88 .80 .234
Ireland .232 .591 .83 .33 .213 .87 .94 .237 .75 .79 .241 .73 .92 .291
France .095 .357 .81 .56 .134 .84 .82 .114 .75 .79 .102 .83 .86 .161
Italy .059 ... .75 ... .092 .78 .69 .200 .93 .74 .088 .83 .86 .186
Spain .097 ... .71 ... .182 .74 .65 .199 .70 .78 .109 .72 .96 .235
Portugal .150 .357 .65 .52 .188 .61 .69 .264 .67 .71 .162 .69 .69 .258
Greece .111 ... .78 ... .175 .81 .73 .349 .67 .70 .160 .73 .60 .362
Correlation ⫺.329 .625 ⫺.120 .435 ⫺.200 .395
Coefficient of variation .484 .364 .329 .393 .430 .336
respectively. They are fairly high for men, but they are sensibly lower for
women, which makes the estimates based on this extreme imputation case
a benchmark rather than a plausible measure for the gender wage gap.
In column 3, we impute a wage below the median to all those who are
unemployed (as opposed to nonparticipants) in 1999. The unemployed,
by definition, are receiving wage offers (if any) below their reservation
wage, while the employed have received at least one wage offer above
their reservation wage. At constant reservation wages, the unemployed
have lower potential wages than the observed wages of the employed and
are thus assigned an imputed wage value below the median. This impu-
tation leaves the median wage gap roughly unchanged with respect to the
base sample in the United States, the United Kingdom, Scandinavia, Ger-
many, Austria, and Ireland and raises it substantially elsewhere, especially
in southern Europe. Also, the proportion of “correctly” imputed obser-
vations, computed as for the previous imputation case, is now much
higher. Those who do not work because they are unemployed are thus
relatively more likely to be overrepresented in the lower half of the wage
distribution. Selection now explains 64% of the correlation between wage
and employment gaps and 32% of the cross-country variation.
In column 4, we follow standard human capital theory and assume that
all those with less than upper secondary education and fewer than 10
years of labor market experience have wage observations below the me-
dian for their gender. Those with at least higher education and at least 10
years of labor market experience are instead placed above the median. In
the four southern European countries, the gender wage gap increases
enormously with respect to the actual wage gap of column 1 and, as a
consequence, the correlation with employment gaps turns positive. It is
interesting that the proportion of correctly imputed observations is high
in Ireland, France, and southern Europe but that is not so much so in
other countries, where imputation based on unemployment works better
than imputation based on human capital components.
The imputation method of column 5 is implicitly based on the as-
sumption of assortative mating along wage attributes and consists of as-
signing wages below the median to those whose partners have total income
in the bottom quartile of the gender-specific distribution. The assumption
is that individuals married to low-productivity spouses also have low
productivity, and thus the spouse’s wage is taken as a proxy for an in-
dividual’s potential wage offer. The results are qualitatively similar to those
of column 3: the wage gap is mostly affected in southern Europe, but on
average it is less affected than in other imputation examples. It would be
natural to perform a similar exercise at the top of the distribution by
assigning a wage above the median to those whose partner earns income
in the top quartile. However, in this case, the proportion of correctly
imputed observations was too low to rely on the assumption used for
Table 4
Median Wage Gaps under Alternative Imputation Rules:
Wage Imputation Based on Observables—Probabilistic Model
Weighted
Base Sample Imputation
(1) (2) (3)
Country:
United States .339 .359 .371
United Kingdom .256 .264 .292
Finland .160 .179 .199
Denmark .086 .100 .100
Germany .191 .200 .232
Netherlands .178 .229 .235
Belgium .078 .117 .154
Austria .192 .205 .236
Ireland .232 .319 .341
France .095 .182 .186
Italy .059 ... .229
Spain .097 ... .333
Portugal .150 .272 .272
Greece .111 ... .593
Correlation ⫺.329 .291 .686
Coefficient of variation .484 .339 .427
Sources.—Michigan Panel Study of Income Dynamics and European Community
Household Panel Survey.
Note.—All wage gaps are significant at the 1% level. In specification 2 no results
are reported for Italy, Spain, and Greece as more than 50% of women in the sample
are nonemployed. Figures in the last two rows display the cross-country correlation
between the reported gender wage gap and the gender employment gap and the co-
efficient of variation of the gender wage gap. Sample description: aged 25–54, excluding
the self-employed, the military, and those in full-time education, 1999. Sample inclusion
rules by columns: (1) Employed at time of survey in 1999. (2) Impute wage ! (resp. 1)
median with probability Pˆ i (resp. 1 ⫺ Pˆ i) if nonemployed. Pˆ i is the predicted probability
of having a wage below the gender-specific base sample median, as estimated from a probit
model including two education dummies, experience, and its square. (3) Impute wage !
(resp. 1) median with probability Pˆ i (resp. 1 ⫺ Pˆ i) if nonemployed. Pˆ i as above, having
enlarged the base sample with wage observation from adjacent waves.
for workers earning less than the median for these countries. In column
3, we use as the reference median the one obtained on a wage distribution
enlarged with wage imputation from all other waves, and in this case the
fraction of missing wages is below 50% for men and women in all coun-
tries. If wage imputation is correct, this procedure delivers a reference
median that is closer to the latent median than the observed median.
Comparing column 1 to columns 2 and 3 shows that the median wage
gap on imputed wage distributions increases mildly in most countries
down to Austria but rises substantially in Ireland, France, and Portugal
and enormously in Italy, Spain, and Greece, which are the countries with
the highest employment gaps.
To broadly summarize our findings (a summary is provided in table
5), one could note that, whether one corrects for selection on unobserv-
ables (table 2) or observables (tables 3 and 4), our results are qualitatively
consistent in identifying a clear role of sample selection in countries with
United States 2,835 3,551 95.9 83.3 98.2 91.4 98.5 92.7 100.0 100.0 96.4 84.1 96.9 88.9 96.7 86.3 99.0 95.3 99.7 99.3
United Kingdom 1,998 2,458 87.2 74.3 92.4 84.9 93.3 88.0 96.9 96.8 91.2 76.2 90.5 81.0 89.9 77.1 95.3 91.1 98.8 98.2
Finland 1,464 1,697 92.6 82.5 96.9 92.9 97.2 93.6 99.4 98.8 98.8 91.2 93.2 86.7 93.2 83.3 97.4 95.3 100.0 99.6
Denmark 996 1,054 94.0 88.1 99.5 97.4 99.5 97.7 98.3 98.1 97.5 94.1 95.0 90.5 94.6 88.9 99.7 98.0 99.9 99.6
Germany 2,822 3,064 91.2 68.8 96.7 83.1 98.1 87.0 98.9 93.4 97.6 75.2 92.2 72.0 92.6 70.7 98.4 88.4 99.5 97.0
Netherlands 2,322 2,750 94.1 70.4 96.7 81.5 97.1 84.3 99.8 99.1 96.7 82.0 95.8 77.8 95.6 72.9 98.1 89.1 99.8 99.6
Belgium 1,170 1,382 90.3 70.2 94.1 77.9 94.9 81.5 98.7 98.2 95.8 80.2 91.6 77.1 92.8 73.5 95.7 86.6 99.0 97.5
Austria 1,296 1,378 95.8 69.3 98.5 78.7 98.7 81.6 99.8 97.8 99.2 72.2 95.8 70.9 96.6 71.4 98.7 82.7 99.8 95.8
Ireland 1,053 1,345 84.9 58.1 89.8 70.8 90.7 73.8 99.6 99.0 92.9 61.0 88.3 64.0 88.9 62.2 93.0 78.3 99.8 99.2
France 2,601 2,923 73.1 55.7 92.8 75.6 94.4 80.1 85.5 90.2 79.8 64.6 75.3 63.7 74.8 58.0 95.7 84.9 98.4 97.0
Italy 3,113 3,744 79.3 45.2 89.9 54.9 91.3 57.9 94.9 96.8 91.9 56.1 83.2 64.0 81.5 49.7 93.8 74.9 98.6 96.6
Spain 2,680 3,065 83.7 45.5 91.2 59.4 92.6 62.5 99.7 99.6 93.8 56.1 87.0 60.1 86.5 47.9 94.7 73.8 99.8 99.6
Portugal 2,019 2,251 91.5 67.2 95.1 75.5 95.9 78.1 99.7 99.3 95.0 73.5 94.6 82.0 92.6 70.1 98.4 90.2 99.7 99.1
Greece 1,379 1,938 87.1 38.4 94.5 49.0 95.1 52.1 99.9 99.4 95.9 47.1 89.7 60.6 88.5 42.1 96.4 71.1 100.0 99.3
Sources.—Michigan Panel Study of Income Dynamics and European Community Household Panel Survey.
Note.—Figures in cols. 1–9 represent the proportions of males and females included in the sample across imputation rules of tables 2–4. Sample description: aged 25–54,
C. Bounds
Each imputation rule requires assumptions about the position of the
nonemployed relative to the median of the potential wage distribution.
In order to show that we obtain reasonable estimates for the median wage
gap under each specification, we compute bounds following the procedure
discussed in Section IV. Table 6 reports “worst case” bounds to the po-
tential distribution for the base sample and for a subset of wage imputation
rules.
Column 1 reports bounds using the actual wage distribution to obtain
the F() terms in conditions (13) and (14). All estimates for the median
wage gap obtained with alternative imputation methods and reported in
tables 2–4 lie within the bounds to the potential distribution reported in
column 1. Note that, mechanically, the bounds for the gender-specific
18
We have performed a number of robustness tests and more disaggregate anal-
yses on the results reported in tables 2–4. First, we repeated all estimates using
a common set of age weights (obtained from the U.S. 1999 sample) for all countries.
Results using such weights were virtually identical to those obtained without
weights, and thus variation in the age structure across countries does not seem
to explain much of the observed variation in gender pay gaps. Second, for the
imputation rules reported in tables 2 and 3, we have repeated our estimates sep-
arately for three education groups (less than upper secondary education, upper
secondary education, and higher education), and we found that most of the se-
lection occurs between rather than within groups, as median wage gaps disag-
gregated by education are much less affected by sample inclusion rules than in
the aggregate. Finally, we have repeated our estimates separately for three de-
mographic groups: single individuals without children in the household, married
or cohabiting without children, and married or cohabiting with children. We found
evidence of a strong selection effect in France and southern Europe among those
who are married or cohabiting, especially when they have children, and much
less evidence of selection among single individuals without children.
Table 6
“Worst Case” Bounds to Median Wage Gaps under Alternative
Imputation Rules
1 2 3 4
Lower Upper Lower Upper Lower Upper Lower Upper
Bound Bound Bound Bound Bound Bound Bound Bound
United States .153 .513 .268 .397 .164 .504 .218 .447
United Kingdom ⫺.042 .562 .126 .373 .005 .505 .045 .460
Finland .059 .255 .147 .234 .131 .236 .073 .236
Denmark .035 .155 .074 .099 .064 .115 .040 .150
Germany ⫺.065 .521 .110 .297 .013 .399 ⫺.025 .462
Netherlands ⫺.028 .406 .088 .274 .072 .289 .040 .322
Belgium ⫺.184 .321 ⫺.061 .203 ⫺.070 .203 ⫺.110 .246
Austria ⫺.030 .417 .087 .288 .011 .365 ⫺.017 .402
Ireland ⫺.533 .815 ⫺.037 .494 ⫺.374 .680 ⫺.311 .639
France ⫺.668 .830 ⫺.089 .265 ⫺.387 .565 ⫺.430 .623
Italy ... ... ⫺.419 .377 ⫺.496 .485 ⫺.282 .360
Spain ... ... ⫺.510 .374 ⫺.718 .715 ⫺.577 .640
Portugal ⫺.396 .470 ⫺.111 .323 ⫺.208 .376 ⫺.066 .315
Greece ... ... ⫺.940 .924 ... ... ⫺.496 .566
Sources.—Michigan Panel Study of Income Dynamics and European Community Household Panel
Survey.
Note.—In specification 2 no results are reported for Italy, Spain, and Greece as more than 50% of
women in the sample are nonemployed. Similarly, this is the case in specification 3 for Greece. Sample
description: aged 25–54, excluding the self-employed, the military, and those in full-time education, 1999.
Sample inclusion rules by columns: (1) Employed at time of survey in 1999. (2) Wage imputed from
other waves when nonemployed (⫺5 , ⫹2 window). (3) Impute wage ! median when unemployed. (4)
Impute wage ! median when nonemployed and education ! upper secondary education and experience
! 10 years; impute wage 1 median when nonemployed and education ≥ higher education and experience
≥ 10 years.
median are tighter the higher the employment rate for that gender. Since
variation in male employment rates is low relative to variation in female
rates, cross-country differences in the tightness of the bounds mostly stem
from differences in women’s employment selection across countries.
Bounds for the median gender wage gap are thus much tighter for the
United States, the United Kingdom, and countries all the way down to
Austria than they are for Ireland, France, and Portugal—for which they
are so large as to be completely uninformative. Indeed, we cannot even
obtain bounds to the median wage gap for Italy, Spain, and Greece on
the base sample because less than 50% of women are employed.
A restriction typically used to tighten such bounds is that of stochastic
dominance (see Blundell et al. 2007), which assumes various forms of
positive selection into employment. As this is precisely something that
our article is assessing, we cannot use it as an identifying assumption. But
we can instead compute bounds after wage imputation, that is, using
imputed wage distributions to compute the F() terms in (13) and (14).
This procedure has the advantage of tightening the bounds without as-
suming positive sample selection ex ante. The estimated bounds are re-
ported in columns 2–4 of table 6. In column 2, the wage distribution used
is one in which missing wage observations are replaced by observed wages
in the nearest available wave (as in col. 2 of table 2). In column 3, missing
wage observations are imputed below the median if an individual is un-
employed (as in col. 3 of table 3). In column 4, they are imputed using
education and experience levels of the nonemployed (as in col. 4 of table
3). As employment rates are higher in columns 2–4 than in column 1,
bounds do become tighter. However, they still remain relatively large in
southern Europe, where employment rates remain relatively low even
after wage imputation.
VI. Conclusions
Gender wage gaps in the United States and the United Kingdom are
much higher than in other European countries, and especially so with
respect to France and southern Europe. Although at first glance this fact
may suggest evidence of a more equal pay treatment across genders in
the latter group of countries, appearances can be deceptive.
In this article, we note that gender wage gaps across countries are
negatively correlated with gender employment gaps, and we illustrate the
importance of nonrandom selection into work in understanding the ob-
served international variation in gender wage gaps. To do this, we perform
wage imputation for those not in work by simply making assumptions
on the position of the imputed wage observations with respect to the
median. Imputation is performed according to different methodologies
based on observable or unobservable characteristics of missing wage
observations.
We find higher median wage gaps on imputed rather than actual wage
distributions for most countries in the sample, meaning that, as one would
have expected, women tend on average to be more positively selected into
work than men. However, this difference is small in the United States,
the United Kingdom, and a number of central and northern European
countries, and it is sizable in France and southern Europe, that is, in
countries in which the gender employment gap is particularly high. Our
(most conservative) estimates suggest that correction for employment se-
lection explains about 45% of the observed negative correlation between
wage and employment gaps. In Italy, Spain, Portugal, and Greece, the
median wage gap on the imputed wage distribution ranges between 20
and 30 log points across specifications. These levels are closely comparable
to those of the United States and of other European countries.
Another interesting result is that we obtain qualitatively similar esti-
mates whether we impute missing wages using available wage information
from other waves in the panel or whether we use observable characteristics
of the nonemployed. This implies that employment selection mostly takes
place along a small number of measurable characteristics.
Our analysis identifies directions for future work. We argue that gender
References
Albrecht, James, Aico van Vuuren, and Susan Vroman. 2004. Decom-
posing the gender wage gap in the Netherlands with sample selection
adjustment. IZA Discussion Paper no. 1400, Institute for the Study of
Labor, Bonn.
Altonji, Joseph G., and Rebecca M. Blank. 1999. Race and gender in the
labor market. In Handbook of labor economics, vol. 3C, ed. Orley
Ashenfelter and David Card, 3141–3259. Amsterdam: North-Holland.
Arulampalam, Wiji, Alison L. Booth, and Mark L. Bryan. 2007. Is there
a glass ceiling over Europe? Exploring the gender pay gap across the
wage distribution. Industrial and Labor Relations Review 60, no. 2:
163–86.
Azmat, Ghazala, Maia Güell, and Alan Manning. 2006. Gender gaps in
unemployment rates in OECD countries. Journal of Labor Economics
24, no. 1:1–37.
Beblo, Miriam, Denis Beninger, Anja Heinze, and François Laisney. 2003.
Measuring selectivity-corrected gender wage gaps in the EU. ZEW
Discussion Paper no. 03-74, Centre for European Economic Research,
Mannheim.
Blau, Francine D., and Andrea H. Beller. 1992. Black-white earnings over
the 1970s and 1980s: Gender differences in trends. Review of Economics
and Statistics 74, no. 2:276–86.
Blau, Francine D., and Lawrence M. Kahn. 1996. Wage structure and
gender earnings differentials: An international comparison. Economica
63 (May): S29–S62.
———. 2003. Understanding international differences in the gender pay
gap. Journal of Labor Economics 21, no. 1:106–44.
———. 2006. The US gender pay gap in the 1990s: Slowing convergence?
Industrial and Labor Relations Review 60, no. 1:45–66.
Bloomfield, Peter, and W. L. Steiger. 1983. Least absolute deviations: The-
ory, applications and algorithms. Boston: Birkhauser.
Blundell, Richard, Amanda Gosling, Hidehiko Ichimura, and Costas
Meghir. 2007. Changes in the distribution of male and female wages
accounting for employment composition using bounds. Econometrica
75, no. 2:323–63.
Buchinsky, Moshe. 1998. The dynamics of changes in the female wage
distribution in the USA: A quantile regression approach. Journal of
Applied Econometrics 13, no. 1:1–30.
Chandra, Amitabh. 2003. Is the convergence in the racial wage gap il-
lusory? NBER Working Paper no. 9476, National Bureau of Economic
Research, Cambridge, MA.
Fernández, Raquel, and Alessandra Fogli. 2005. Culture: An empirical
investigation of beliefs, work, and fertility. NBER Working Paper no.
11268, National Bureau of Economic Research, Cambridge, MA.
Fortin, Nicole M. 2005. Gender role attitudes and the labour market
outcomes of women across OECD countries. Oxford Review of Eco-
nomic Policy 21, no. 3:416–38.
———. 2006. Greed, altruism, and the gender wage gap. Unpublished
manuscript, Department of Economics, University of British Columbia.
Goldin, Claudia. 1990. Understanding the gender wage gap: An economic
history of American women. Oxford: Oxford University Press.
Gronau, Reuben. 1974. Wage comparison: A selectivity bias. Journal of
Political Economy 82, no. 6:1119–43.
Heckman, James J. 1974. Shadow prices, market wages, and labor supply.
Econometrica 42, no. 4:679–94.
———. 1979. Sample selection bias as a specification error. Econometrica
47, no. 1:153–63.
———. 1980. Addendum to sample selection bias as a specification error.
In Evaluation studies review annual, vol. 5, ed. Ernst W. Stromsdorfer
and George Farkas. Beverly Hills, CA: Sage.
Johnson, William R., Yuichi Kitamura, and Derek Neal. 2000. Evaluating
a simple method for estimating black-white gaps in median wages.
American Economic Review Papers and Proceedings 90:339–43.
Juhn, Chinhui. 1992. Decline of male labor market participation: The role
of declining market opportunities. Quarterly Journal of Economics 107,
no. 1:79–122.
———. 2003. Labor market dropouts and trends in the wages of black
and white men. Industrial and Labor Relations Review 56, no. 4:643–62.
Krueger, Alan B., and Lawrence H. Summers. 1988. Efficiency wages and
the inter-industry wage structure. Econometrica 56, no. 2:259–93.