0% found this document useful (0 votes)

94 views6 pages

Empirical Project 2 Do Smaller Classes Improve Test Scores? Evidence From A Regression Discontinuity Design

This document describes an empirical project analyzing the effect of class size on test scores using a regression discontinuity design. Students are instructed to use Israeli school and test score data to: 1) Graphically analyze how class size, test scores, and student characteristics change near a school enrollment threshold; 2) Estimate regression models to quantify the discontinuities; 3) Consider how the findings compare to other studies and what additional data would be needed to make policy recommendations on class size reductions. The data and variables are also defined.

Uploaded by

Paulo Cesar RLuna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views6 pages

Empirical Project 2 Do Smaller Classes Improve Test Scores? Evidence From A Regression Discontinuity Design

Uploaded by

Paulo Cesar RLuna

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Economics 1152/SUP 135 Professor Raj Chetty and Dr.

Gregory Bruich
Spring 2019 Department of Economics, Harvard University

Empirical Project 2
Do Smaller Classes Improve Test Scores? Evidence from a Regression Discontinuity Design
Posted on Thursday, February 21, 2019
Due at midnight on Thursday, March 7, 2019

In this empirical project, you will use a regression discontinuity design to estimate the causal
effect of class size on test scores. To answer some of the questions, you will need to refer to the
following papers:
1. Chetty, Raj, John N. Friedman, Nathaniel Hilger, Emmanuel Saez, Diane Whitmore Schanzenbach, and
Danny Yagan. 2011. “How Does Your Kindergarten Classroom Affect Your Earnings? Evidence from
Project STAR,” Quarterly Journal of Economics 126(4): 1593–1660.

2. Angrist, Joshua D., and Victor Lavy. 1999. “Using Maimonides’ Rule to Estimate the Effect of Class Size
on Scholastic Achievement,” Quarterly Journal of Economics 114(2): 533–575.

The Stata data file grade5.dta consists of test scores in fifth grade classes at public elementary
schools in Israel. These data were originally used in Angrist and Lavy (1999). The graphs below
were drawn using the same data.

Figure 1
Class Size as a Function of Total School Enrollment in Public Schools in Israel

Note: These figures plot class size as a function of total school enrollment for fourth grade and
fifth grade classes in pubic schools in Israel in 1991.

Instructions

Please submit your Empirical Project on Canvas. Your submission should include three files:
1. A 4-6 page replication as a word or pdf document (double spaced and including
references, graphs, and tables)
2. A do-file with your STATA code or an .R script file with your R code
3. A log file of your STATA or R output
Specific questions to address in your replication

1. Explain why a simple comparison of test scores in small classes versus large classes
would not measure the causal effect of class size. Would this simple comparison likely
be biased upwards or biased downwards relative to that true causal effect? Explain.

2. (To answer this and the next question, read Chetty et al. 2011). How did the Tennessee
STAR experiment overcome this problem? What did it find?

3. What is a binned scatter plot? Explain how it is constructed.

4. Graphical regression discontinuity analysis, focusing on the 40 student school enrollment

threshold. See Table 2a and 2b for more guidance.

a. Draw a binned scatter plot to visualize how class size changes at the 40 student
school enrollment threshold. Display a linear or quadratic regression line based
on what you see in the data.

b. Draw binned scatter plots to visualize how math and verbal test scores change at
the 40 student school enrollment threshold. Display a linear or quadratic
regression line based on what you see in the data.

c. Draw binned scatter plots to test whether (i) the percent of disadvantaged
students, (ii) the fraction of religious schools, and (iii) the fraction of female
students evolve smoothly across the 40 student school enrollment threshold.
Display a linear or quadratic regression line based on what you see in the data.

d. Produce a histogram of the number of schools by total school enrollment. Note

that you must collapse the data by school to produce this graph.

5. Regression analysis. Run the regressions that correspond to your three graphs in 4a and
4b to quantify the discontinuities that you see in the data. In estimating these regressions,
use all the observations with school enrollment less than 80. Report a 95% confidence
interval for each of these estimates. See Table 2a and 2b for more guidance.

6. Recall that any quasi experiment requires an identification assumption to make it as good
as an experiment. What is the identification assumption for regression discontinuity
design? Explain whether your graphs in 4c and 4d are consistent with that assumption.

7. (To answer this question, read Angrist and Lavy (1999)). If all schools followed the class
size rule exactly as described in Angrist and Lavy (1999), how much would you expect
class size to change at the 40 student enrollment threshold? Explain why the actual
change in class size that you see in the data is less than this.

2
8. Suppose your school superintendent is considering a reform to reduce class sizes in your
school from 40 to 35. Use your estimates above to predict the change in math and verbal
test scores that would result from this reform.

Hint: divide the RD estimate of the change in test scores by the change in number of
students per class at the threshold.

9. Now suppose you are asked for advice by another school that is considering reducing
class size from 20 to 15 students – a 5 unit reduction as above. Would you feel confident
in making the same prediction as you did above about the impacts this change will have?
Why or why not?

10. Compare your estimates in 8 with the estimates from (i) the Tennessee STAR experiment
(Chetty et al. 2011) and (ii) data from Sweden (Fredriksson et al. 2013) discussed in
lecture. Give two reasons that your estimates might differ from those of these other
studies.

11. Chetty et al. (2011) show that being assigned to a smaller class in Kindergarten raises
Kindergarten test scores, but has little impact in later grades. Does this “fade out” effect
mean that class size doesn’t really matter in the long run? Why or why not?

12. Given the evidence above, would you encourage your hometown school to reduce class
size by hiring more teachers if the goal is to maximize students’ long-term outcomes
(e.g., college attendance rates, earnings)? Explain clearly what other data you would
need to make a scientific recommendation and how you would use that data.

3
DATA DESCRIPTION, FILE: grade5.dta

The data consist of n = 2,019 fifth grade classes at 1,002 public schools in Israel in 1991. For
more details on the construction of the variables included in this data set, please see Angrist and
Lavy (1999).

Table 1
Definitions of Variables in grade5.dta

Variable Label
(1) (2)
schlcode School id code
school_enrollment Total school enrollment in fifth grade
grade Class grade
5 = fifth grade for all observations in grade5.dta
classize Number of students in the class
avgmath Average composite year-end math score in the class, on a scale
of 1 to 100, from a national elementary school test.
avgverb Average composite year-end verbal score in the class, on a scale
of 1 to 100, from a national elementary school test.
disadvantaged Percent of class coming from a disadvantaged background, as
defined by an index used by the Ministry of Education to
allocate supplementary hours of instruction and other school
resources. The index is based on fathers’ education, fathers’
continent of birth, and family size.
female Fraction of students in the class that are female
religious 1 = School is a religious public school
0 = School is a secular public school

Note: This table describes the variables included in grade5.dta.

4
Table 2a
STATA Commands
STATA command Description
*Install binscatter The first command installs binscatter, which
ssc install binscatter, replace
only has to be done once. The second
*Draw graph (command all goes on one line) command produces a binned scatter plot of
binscatter yvar school_enrollment if yvar against the total school enrollment with a
inrange(school_enrollment,20,60), rd(40.5) discrete linear best fit line, restricting the graph to
line(lfit)
observations with total school enrollment in
*Save graph [20,60]. The third line saves the graph. The
graph export figure1_linear.png, replace fourth line shows how to change the best fit
*Draw graph (command all goes on one line)
line to be quadratic by changing line(lfit) to
binscatter yvar school_enrollment if line(qfit).
inrange(school_enrollment,20,60), rd(40.5) discrete
line(qfit)

*Save graph
graph export figure1_quadratic.png, replace

*Collapse data to school level These commands show how to create a graph
collapse (mean) school_enrollment, by(schlcode )
showing the number of schools that have each
*Graph counts (command all goes on 1 line) value of school_enrollment. First, we
twoway (histogram school_enrollment if collapse the data to convert from school-grade
inrange(school_enrollment,20,60), discrete level data to school level data. Second, we
frequency), xline(40.5)
draw a graph of the counts of schools,
*Save graph restricting the graph to schools with between
graph export school_counts.png, replace 20 and 60 students enrolled. Finally, we save
*Note after collapsing the data, you have to load in
the graph.
the original data in order to run your regressions.

*Load un-collapsed data These commands show how to run a

use grade5.dta, clear
regression to quantify the discontinuity in
*Generate new variables yvar at the 40 student threshold. We first
gen above40 = 0 generate an indicator variable for
replace above40 = 1 if school_enrollment > 40 school_enrollment being above 40. We next
gen x = school_enrollment - 40
gen x_above40 = x*above40 generate a variable that equals
school_enrollment minus 40 and the
*Run regression (all goes on one line) interaction term between this variable and the
reg yvar above40 x x_above40 if
inrange(school_enrollment,0,80), cluster(schlcode)
indicator for school_enrollment being above
40. Finally, we run a regression of yvar on
these three variables, restricting the regression
to observations with school_enrollment
between 0 and 80. The coefficient on
above40 is the estimate of the discontinuity in
yvar at the threshold. We report standard
errors that are clustered by school.

5
Table 2b
R Commands
R command Description
#Install and load rdrobust The first command installs rdrobust,
install.packages('rdrobust')
library(rdrobust) which only has to be done once. The
second command produces a binned
#Subset data to observations in [20,60] scatter plot of yvar against the total
narrow <- subset(grade5, school_enrollment <= 60 & school enrollment with a linear best
school_enrollment >= 20)
fit line, restricting the graph to
#draw binned scatter plot with linear fit observations with total school
rdplot(narrow$yvar, narrow$school_enrollment, c = 40.5, p = enrollment in [20,60]. The last part
1, nbins = 20)
ggsave("figure1_linear.png")
shows how to change the best fit line
to be quadratic by changing p=1 to
# draw binned scatter plot with quadratic fit p=2.
rdplot(narrow$yvar, narrow$school_enrollment, c = 40.5, p =
2, nbins = 20)
ggsave("figure1_quadradratic.png")

#Install and load dyplr These commands show how to create

install.packages('dplyr')
library(dplyr) a graph showing the number of
schools that have each value of
#Collapse data school_enrollment. First, we collapse
by_school <- group_by(narrow, schlcode) the data to convert from school-grade
schools <- summarise(by_school, school_enrollment =
mean(school_enrollment, na.rm = TRUE)) level data to school level data.
Second, we draw a graph of the
#Draw graph counts of schools, restricting the
ggplot(schools, aes(school_enrollment)) +
geom_histogram(bins = 40) +
graph to schools with between 20 and
geom_vline(xintercept=40.5, color = "red") 60 students enrolled. Finally, we
save the graph.
#Save graph
ggsave("school_counts.png")

#For clustered standard errors These commands show how to run a

source("BM_StandardErrors.R")
regression to quantify the
#Subset data and define indicator for above enrollment > 40 discontinuity in yvar at the 40 student
narrow <- subset(grade5, school_enrollment <= 80) threshold. We first subset the data to
narrow$above40 <- 0 observations with school_enrollment
narrow$above40[which(narrow$school_enrollment > 40)] <- 1
between 0 and 80. Next we generate
#Generate centered version of enrollment an indicator variable for
narrow$x <- grade5_narrow$school_enrollment - 40 school_enrollment being above 40.
#Generate interaction term
We then generate a variable that
narrow$x_above <- narrow$above40*grade5_narrow$x equals school_enrollment minus 40
and the interaction term between this
#Run regression variable and the indicator for
mod1 <- lm(yvar~above40 + x + x_above, data = narrow)
summary(mod1) school_enrollment being above 40.
Finally, we run a regression of yvar
#Report clustered standard errors on these three variables, restricting
clustervar <- as.factor(narrow$schlcode) the regression to observations with
BMlmSE(mod1, clustervar, IK=F)
school_enrollment between 0 and 80.
The coefficient on above40 is the
estimate of the discontinuity in yvar
at the threshold. We report standard
errors that are clustered by school,
which will be reported as $se.Stata
after running the last two lines.

ADHD Assessment
No ratings yet
ADHD Assessment
6 pages
Middlemarch: Realism Explored
100% (1)
Middlemarch: Realism Explored
31 pages
California High School Test Scores: Philip Pham April 27, 2011
No ratings yet
California High School Test Scores: Philip Pham April 27, 2011
11 pages
The Impact of A Universal Class-Size Reduction Policy: Evidence From Florida's Statewide Mandate
No ratings yet
The Impact of A Universal Class-Size Reduction Policy: Evidence From Florida's Statewide Mandate
53 pages
Test Scores and School Size
No ratings yet
Test Scores and School Size
4 pages
Chapter 1 Notes
No ratings yet
Chapter 1 Notes
9 pages
Sika® ViscoCrete®-TS 100-2
0% (1)
Sika® ViscoCrete®-TS 100-2
3 pages
Oscor Blue
No ratings yet
Oscor Blue
6 pages
BS en 13335-2002 PDF
No ratings yet
BS en 13335-2002 PDF
12 pages
The Construction of Family in Selected Disney Animated Films
No ratings yet
The Construction of Family in Selected Disney Animated Films
4 pages
Problem Solving
No ratings yet
Problem Solving
16 pages
Kawai Indonesia Factory Report
No ratings yet
Kawai Indonesia Factory Report
5 pages
Chetty Et Al. (2011) ,'how Does Your Kindergarten Classroom Effect Your Earnings' PDF
No ratings yet
Chetty Et Al. (2011) ,'how Does Your Kindergarten Classroom Effect Your Earnings' PDF
89 pages
Unit 1 - What Kind of Movies Have You Been Watching Recently
No ratings yet
Unit 1 - What Kind of Movies Have You Been Watching Recently
12 pages
Associations Between Social Responsibility Disclosure and Characteristics of Companies
No ratings yet
Associations Between Social Responsibility Disclosure and Characteristics of Companies
8 pages
PicoWay Candela Specifications Brochure Resolve
100% (1)
PicoWay Candela Specifications Brochure Resolve
8 pages
Impact of Class Size on Education
No ratings yet
Impact of Class Size on Education
13 pages
Emotional Intelligence Brochure PLI
100% (1)
Emotional Intelligence Brochure PLI
2 pages
Long-Term Effects of Class Size: IZA DP No. 5879
No ratings yet
Long-Term Effects of Class Size: IZA DP No. 5879
31 pages
The Effect of Attending A Small Class in The Early Grades On College-Test Taking and Middle School Test Results: Evidence From Project Star
No ratings yet
The Effect of Attending A Small Class in The Early Grades On College-Test Taking and Middle School Test Results: Evidence From Project Star
28 pages
Teacher Experience and The Class Size Effect - Experimental Evidence
No ratings yet
Teacher Experience and The Class Size Effect - Experimental Evidence
31 pages
EMC Engineering Exam Insights
No ratings yet
EMC Engineering Exam Insights
3 pages
Teacher Effects On Educational Achievement
No ratings yet
Teacher Effects On Educational Achievement
9 pages
An Analysis of The Factors That Influence Student Performance: A Fresh Approach To An Old Debate
No ratings yet
An Analysis of The Factors That Influence Student Performance: A Fresh Approach To An Old Debate
21 pages
Class Size Effect
No ratings yet
Class Size Effect
14 pages
All About Methods of Research
No ratings yet
All About Methods of Research
20 pages
Python Datatypes
No ratings yet
Python Datatypes
6 pages
Econometric S Proj 1
No ratings yet
Econometric S Proj 1
13 pages
Revival and Reinvention of Kathak Dance
No ratings yet
Revival and Reinvention of Kathak Dance
14 pages
Chapters 1 & 2-Final - PPT Econmetrics - Smith/Watson
100% (1)
Chapters 1 & 2-Final - PPT Econmetrics - Smith/Watson
71 pages
DQ2
No ratings yet
DQ2
2 pages
Evaluate The Affected Factors On Students' Mathematics Performance in Rural Areas by Estimating An Education Production Function: As A Case Study of Passara Educational Zone, Sri Lanka
No ratings yet
Evaluate The Affected Factors On Students' Mathematics Performance in Rural Areas by Estimating An Education Production Function: As A Case Study of Passara Educational Zone, Sri Lanka
11 pages
Northern Black Polished Ware in India
100% (1)
Northern Black Polished Ware in India
19 pages
NYC School Class Size Crisis
No ratings yet
NYC School Class Size Crisis
4 pages
Math 10 SLM 18 Permutation and Combination
No ratings yet
Math 10 SLM 18 Permutation and Combination
17 pages
English MAINS Practice Shot 200
No ratings yet
English MAINS Practice Shot 200
4 pages
Measures of Central Tendency and Box and Whisker Plots
No ratings yet
Measures of Central Tendency and Box and Whisker Plots
36 pages
Written Assignment Unit 4
No ratings yet
Written Assignment Unit 4
5 pages
10 Stockwatson 1
No ratings yet
10 Stockwatson 1
65 pages
Testimony Cuomo Commission 7 26 12 Final
No ratings yet
Testimony Cuomo Commission 7 26 12 Final
3 pages
Aalto Test
No ratings yet
Aalto Test
6 pages
Manzan SW4e Ch01 02 03
No ratings yet
Manzan SW4e Ch01 02 03
70 pages
Fredriksson Et Al. QJE, 2013
No ratings yet
Fredriksson Et Al. QJE, 2013
38 pages
BSBCRT511 Task 3 Assessment Templates V3.0923
No ratings yet
BSBCRT511 Task 3 Assessment Templates V3.0923
10 pages
Economics
No ratings yet
Economics
68 pages
How Does Your Kindergarten Classroom Affect Your Earnings? Evidence From Project STAR
No ratings yet
How Does Your Kindergarten Classroom Affect Your Earnings? Evidence From Project STAR
62 pages
Generative Ai-In-The-Loop: Integrating Llms and Gpts Into The Next Generation Networks
No ratings yet
Generative Ai-In-The-Loop: Integrating Llms and Gpts Into The Next Generation Networks
9 pages
Rev. A. J. Deignan's Dream of Wah Yan's Future
No ratings yet
Rev. A. J. Deignan's Dream of Wah Yan's Future
15 pages
Summary Labour Economics
No ratings yet
Summary Labour Economics
87 pages
Irislocker
No ratings yet
Irislocker
23 pages
Elementary Education Report
No ratings yet
Elementary Education Report
7 pages
Worksheet 1
No ratings yet
Worksheet 1
5 pages
SSRN Id3719762
No ratings yet
SSRN Id3719762
92 pages
HCP Draft1
No ratings yet
HCP Draft1
7 pages
Hanushek 1999 EvidenceonCLassSize
No ratings yet
Hanushek 1999 EvidenceonCLassSize
24 pages
HW 04 220
No ratings yet
HW 04 220
5 pages
Paper Outline
No ratings yet
Paper Outline
5 pages
Problem Set III: Part 1a: Theoretical Exercise
No ratings yet
Problem Set III: Part 1a: Theoretical Exercise
2 pages
Israeli Class Size Effects Study
No ratings yet
Israeli Class Size Effects Study
16 pages
Krueger - Experimental Estimates of Education Production
No ratings yet
Krueger - Experimental Estimates of Education Production
37 pages
Empirical Approaches To Studying The Impact of Class Size
No ratings yet
Empirical Approaches To Studying The Impact of Class Size
7 pages
Applied Economics Project Option
No ratings yet
Applied Economics Project Option
5 pages
MCQ Pediatrics
No ratings yet
MCQ Pediatrics
5 pages
Uninterruptible Power Supply (UPS)
No ratings yet
Uninterruptible Power Supply (UPS)
11 pages
Testscore 918.0 + 13.9 × Smallclass, R2 0.01, Ser 74.6
No ratings yet
Testscore 918.0 + 13.9 × Smallclass, R2 0.01, Ser 74.6
1 page
Impact of Class Size on Future Success
No ratings yet
Impact of Class Size on Future Success
37 pages
Literature Review
No ratings yet
Literature Review
4 pages
Sem 4 - Writing and Editing For Media - Bammc
No ratings yet
Sem 4 - Writing and Editing For Media - Bammc
28 pages
Listof C25 Batcheswith Times&Syllabus
No ratings yet
Listof C25 Batcheswith Times&Syllabus
4 pages
Do Low Achieving Students Benefit More From Small Classes Evidence From The Tennessee Class Size
No ratings yet
Do Low Achieving Students Benefit More From Small Classes Evidence From The Tennessee Class Size
17 pages
Speaker 8 GP Leithwood Jantzi OISE Schools Size Study Presented Feb 15 2017
No ratings yet
Speaker 8 GP Leithwood Jantzi OISE Schools Size Study Presented Feb 15 2017
37 pages
Problem Set 1 - Answer Key
No ratings yet
Problem Set 1 - Answer Key
7 pages
Impact of Class Size on Early Education
No ratings yet
Impact of Class Size on Early Education
8 pages
THNG Qualifyingpaper 2017
No ratings yet
THNG Qualifyingpaper 2017
83 pages
Chapter 2 & 3-Review of Probability and Statistics
No ratings yet
Chapter 2 & 3-Review of Probability and Statistics
93 pages
The Effects of Standardized Testing
No ratings yet
The Effects of Standardized Testing
294 pages
Standardized Testing Final Draft
No ratings yet
Standardized Testing Final Draft
10 pages
102x Screening Exam Questions
No ratings yet
102x Screening Exam Questions
3 pages
Introduction To Well Planning, GTO and Drilling Terms
No ratings yet
Introduction To Well Planning, GTO and Drilling Terms
73 pages
M01 StockWatson123635 03 Econ Part01
No ratings yet
M01 StockWatson123635 03 Econ Part01
61 pages
MadisonLivingston ArgumentativeEsssayRoughDraft
No ratings yet
MadisonLivingston ArgumentativeEsssayRoughDraft
5 pages
Class Size vs. Academic Performance
No ratings yet
Class Size vs. Academic Performance
10 pages
ECON4040 Class Size Math Scores RDD Assignment
No ratings yet
ECON4040 Class Size Math Scores RDD Assignment
3 pages
Mathematics Mastery Report
No ratings yet
Mathematics Mastery Report
19 pages
Term Paper PE Anastasia
No ratings yet
Term Paper PE Anastasia
15 pages
Mechanics of Structure
No ratings yet
Mechanics of Structure
16 pages
Optimal Curriculum
No ratings yet
Optimal Curriculum
58 pages
Lecture 8 Education 3
No ratings yet
Lecture 8 Education 3
46 pages