0% found this document useful (0 votes)

23 views3 pages

Problem Set 8

The document outlines the instructions for Problem Set 2 for Stats 506 due on October 17, 2018, requiring submissions via Canvas. It includes tasks using Stata and R for data analysis, with specific file naming conventions and requirements for outputs, including formatted tables and graphs. Students are encouraged to be resourceful in their analyses and can utilize late days if necessary, with a maximum of two allowed.

Uploaded by

uditm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views3 pages

Problem Set 8

Uploaded by

uditm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

6/17/25, 12:42 AM Problem Set 2

Problem Set 2
Stats 506, Fall 2018
Due: Wednesday October 17, 5pm

Instructions
Submit the assignment by the due date via canvas. If you intend to utilize late days, please upload partial progress
to Canvas and comment that you will utlize late days before the assignment is due. In your email please indicate
how many days you intend to use. You do not need to cc me. You may use a maximum of two late days for this
assignment.

Use Rmarkdown to create and submit a single pdf with your answers to each question along with supporting
evidence in the form of tables and graphs.

All tables and graphs should be neatly labeled and appear polished.

Questions 1 and 2 ask you to use Stata. Do all data manipulation and analyses in separate .do files named
ps2_q1.do and ps2_q2.do .

ps2_q1.do should write a comma delimited file recs2015_usage.csv with the requested point estimates and
standard errors.

Run ps2_q2.do in batch mode and produce a ps2_q2.log file with the output. Output in the log file should be
clearly labeled and referred to in your typed answer to the questions.

Question 3 asks you to analyze data in R. You should submit your code for this problem as ps2_q3.R .

You should submit a single compressed archive ( .zip ) which contains the following files:
ps2.pdf or ps2.html
ps2.Rmd
ps2_q1.do
recs2015_usage.csv
ps2_q2.do
ps2_q2.log
ps2_q3.R
All files should be executable without errors.

All files read, sourced, or referred to within scripts should be assumed to be in the same working directory ( ./ ).

Your code should be clearly written and it should be possible to assess it by reading it. Use appropriate variable
names and comments. Your style will be graded using the style rubric (./StyleRubric.html) [15 points].

Some of these exercises may require you to use commands or techniques that were not covered in class or in the
course notes. You can use the web as needed to identify appropriate approaches. Part of the purpose of these
exercises is for you to learn to be resourceful and self sufficient. Questions are welcome at all times, but please
make an attempt to locate relevant information yourself first.

You may wish to review:

the tutorial on converting between wide and long data available here
(https://stats.idre.ucla.edu/stata/modules/reshaping-data-wide-to-long/),
Richard Williams’s presentation on “Stata’s Margins Command” here
(https://www3.nd.edu/~rwilliam/stats/Margins01.pdf).

https://jbhender.github.io/Stats506/F18/PS2.html 1/3
6/17/25, 12:42 AM Problem Set 2

Question 1 [25 points]

Use Stata to estimate the following national totals for residential energy consumption:

Electricity usage in kilowatt hours

Natural gas usage, in hundreds of cubic feet
Propane usage, in gallons
Fuel oil or kerosene usage, in gallons

In your analysis, be sure to properly weight the individual observations. Use the replicate weights to compute standard
errors. At the end of your .do file, write the estimates and standard errors to a delimited file recs2015_usage.csv .

In your .Rmd read recs2015_usage.csv and produce a nicely formatted table with estimates and 95% confidence
intervals.

Question 2 [35 points]

For this question you should use the 2005-2006 NHANES ORAL Health data available here
(https://wwwn.cdc.gov/nchs/nhanes/search/datapage.aspx?Component=Examination&CycleBeginYear=2005) and the
demographic data available here (https://wwwn.cdc.gov/nchs/nhanes/search/datapage.aspx?
Component=Demographics&CycleBeginYear=2005). Your analyses for this question should be done in Stata, though you
may create plots and format tables using R within Rmarkdown.

For part (b-d), you can ignore the survey aspect of the data and analyze it as if the data were a simple random sample.

a. [5 points] Determine how to read both data sets into Stata and merge them together by the participant id SEQN.

b. [5 points] Use logistic regression to estimate the relationship between age (in months) and the probability that an
individual has a primary rather than a missing or permanent upper right 2nd bicuspid. You can recode permanent
root fragments as permanent and drop individuals for whom this tooth was not assessed. Use the fitted model to
estimate the ages at which 25, 50, and 75% of individuals lose their primary upper right 2nd bicuspid. Round these
to the nearest month. Choose a range of representative age values with one year increments by taking the floor (in
years) of the 25%-ile and the ceiling (in years) of the 75%-ile.

c. [10 points] In the regression above, control for demographics in the following way:
Add gender to the model and retain it if it improves the BIC.
Create indicators for each race/ethnicity category using the largest as the reference and collapsing ‘Other
Hispanic’ and ‘Other’. In order of group size in the sample, add each category retaining those that improve
BIC.
Add poverty income ratio to the model and retain it if it improves BIC.

In your pdf document, include a nicely formatted regression table for the final model and an explanation of the model
fitting process.

d. [10 points] Use the margins command to compute:

1. Adjusted predctions at the mean (for other values) at each of the representative ages determined in part b.

2. The marginal effects at the mean of any retained categorical variables at the same representative ages.

3. The average marginal effect of any retained categorical varialbes at the representative ages.

e. [5 points] Refit your final model from part c using svy and comment on the differences. Include a nicely formatted
regression table and cite evidence to justify your comments.

You should use the following command to set up the survey weights
(ftp://ftp.cdc.gov/pub/health_statistics/nchs/tutorial/nhanes/Continuous/descriptive_mean.do):

https://jbhender.github.io/Stats506/F18/PS2.html 2/3
6/17/25, 12:42 AM Problem Set 2

svyset sdmvpsu [pweight=wtmec2yr], strata(sdmvstra) vce(linearized)

Question 3 [30 points]

Repeat part a-d of question 2 using R. For part d, you may either use the “margins” package or code the computations
yourself.

https://jbhender.github.io/Stats506/F18/PS2.html 3/3

Stat 302 Practice Final: Brad Mcneney 2017-04-15
No ratings yet
Stat 302 Practice Final: Brad Mcneney 2017-04-15
7 pages
CS2B - Sept23 - EXAM - Clean Proof
No ratings yet
CS2B - Sept23 - EXAM - Clean Proof
5 pages
AFES English Manual
100% (7)
AFES English Manual
290 pages
HIRA Night Works
No ratings yet
HIRA Night Works
13 pages
Applied Econometrics: Lab 2: General Instruction
No ratings yet
Applied Econometrics: Lab 2: General Instruction
3 pages
Implementing Merchandise Plans
100% (4)
Implementing Merchandise Plans
19 pages
K.robert Mod 2 Problem Set
No ratings yet
K.robert Mod 2 Problem Set
4 pages
Assignment 2 - HLTH 605b - Fall 2020 (100 Marks)
No ratings yet
Assignment 2 - HLTH 605b - Fall 2020 (100 Marks)
2 pages
Multiple Regression Analysis Problem Set
No ratings yet
Multiple Regression Analysis Problem Set
5 pages
Major Assignment F21 (Friday)
No ratings yet
Major Assignment F21 (Friday)
4 pages
Homework #2: Data Analysis & Regression
No ratings yet
Homework #2: Data Analysis & Regression
2 pages
CS3943-9223 Assignment1
No ratings yet
CS3943-9223 Assignment1
2 pages
Problem Set 1
No ratings yet
Problem Set 1
2 pages
Econometrics Problem Set Guide
No ratings yet
Econometrics Problem Set Guide
4 pages
QMM1001 Applied Activity 2
No ratings yet
QMM1001 Applied Activity 2
2 pages
MCQ 15it423e
No ratings yet
MCQ 15it423e
28 pages
Stata Introduction and Worksheet
No ratings yet
Stata Introduction and Worksheet
2 pages
University of Zimbabwe: Time: 2 Hours
No ratings yet
University of Zimbabwe: Time: 2 Hours
5 pages
Bacs HW5
No ratings yet
Bacs HW5
12 pages
Assignment 4 Corrected
No ratings yet
Assignment 4 Corrected
3 pages
HealthEconIndividualAssignment 2019
No ratings yet
HealthEconIndividualAssignment 2019
2 pages
Stat5000 HW 1-2
No ratings yet
Stat5000 HW 1-2
3 pages
Econometrics Exam Guide
No ratings yet
Econometrics Exam Guide
9 pages
STT 215 Exam 1 Example
No ratings yet
STT 215 Exam 1 Example
5 pages
Problem Set 1
No ratings yet
Problem Set 1
3 pages
ESB2021 Resit With Solution
No ratings yet
ESB2021 Resit With Solution
9 pages
Example Metrics - Final Assignment - WS1920 - SH
No ratings yet
Example Metrics - Final Assignment - WS1920 - SH
9 pages
LabVIEW SVPWM for 3-Level Converters
No ratings yet
LabVIEW SVPWM for 3-Level Converters
61 pages
ECON20003 S1 2024 Sample Exam
No ratings yet
ECON20003 S1 2024 Sample Exam
27 pages
MH 3511 Midterm 2018 So LN
No ratings yet
MH 3511 Midterm 2018 So LN
5 pages
INT232 ETP Question Paper1 2023
No ratings yet
INT232 ETP Question Paper1 2023
4 pages
Statistics & Econometrics Exam 2021
No ratings yet
Statistics & Econometrics Exam 2021
8 pages
DH302 Spring2025 Assignment02-Solutions
No ratings yet
DH302 Spring2025 Assignment02-Solutions
35 pages
R PS
No ratings yet
R PS
2 pages
Pbset1 Dofile
No ratings yet
Pbset1 Dofile
3 pages
2017aug 02323 02402 Solution en
No ratings yet
2017aug 02323 02402 Solution en
43 pages
Computer Lab 3 MM
No ratings yet
Computer Lab 3 MM
38 pages
Sta108hw4 1
No ratings yet
Sta108hw4 1
5 pages
FIT2086 Assignment 3: Regression & Classification Analysis
No ratings yet
FIT2086 Assignment 3: Regression & Classification Analysis
9 pages
Thesis Paper On Net Zero Carbon
No ratings yet
Thesis Paper On Net Zero Carbon
68 pages
Assignment-2 HS 649 2024
No ratings yet
Assignment-2 HS 649 2024
3 pages
Ex 08
No ratings yet
Ex 08
10 pages
EC295 Assign2 2025
No ratings yet
EC295 Assign2 2025
5 pages
STAT501 Online - HW2R - Spring2024
No ratings yet
STAT501 Online - HW2R - Spring2024
7 pages
ECON6001 HW1 Fall2024
No ratings yet
ECON6001 HW1 Fall2024
4 pages
HW 2
No ratings yet
HW 2
12 pages
Final Fa21 Solutions
No ratings yet
Final Fa21 Solutions
40 pages
RAN Network Optimization Parameter Reference RAN6 1
No ratings yet
RAN Network Optimization Parameter Reference RAN6 1
371 pages
HWK5 SS
No ratings yet
HWK5 SS
11 pages
Applied Econometrics Problem Set Solutions
No ratings yet
Applied Econometrics Problem Set Solutions
14 pages
Exercise Solutions
No ratings yet
Exercise Solutions
30 pages
ProblemSet1 2025
No ratings yet
ProblemSet1 2025
6 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Assignment STAT5002
No ratings yet
Assignment STAT5002
5 pages
Specialty Chemical Production Analysis
No ratings yet
Specialty Chemical Production Analysis
8 pages
2023dec 02402 en
No ratings yet
2023dec 02402 en
24 pages
Stat Ana Week2 Quiz 2 - Montalbo J Nicole e
No ratings yet
Stat Ana Week2 Quiz 2 - Montalbo J Nicole e
4 pages
Engineering Data Analysis
No ratings yet
Engineering Data Analysis
5 pages
Scalability PDF
No ratings yet
Scalability PDF
21 pages
Cao Wang FTA EMA
No ratings yet
Cao Wang FTA EMA
5 pages
Compression: DMET501 - Introduction To Media Engineering
No ratings yet
Compression: DMET501 - Introduction To Media Engineering
26 pages
FELICIANO MALIWAT, Petitioner, vs. HON. COURT OF APPEALS, Former Special First Division, and The REPUBLIC OF THE PHILIPPINES, Respondents
100% (1)
FELICIANO MALIWAT, Petitioner, vs. HON. COURT OF APPEALS, Former Special First Division, and The REPUBLIC OF THE PHILIPPINES, Respondents
7 pages
Review Of: Generated On 2022-12-20
No ratings yet
Review Of: Generated On 2022-12-20
21 pages
Group 4
No ratings yet
Group 4
9 pages
CSE 312-Introduction To Statistical Tools in Research - Question Bank
No ratings yet
CSE 312-Introduction To Statistical Tools in Research - Question Bank
6 pages
Metal Casting 3
No ratings yet
Metal Casting 3
23 pages
Lab 3
No ratings yet
Lab 3
16 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Cineplex Loyalty Program Strategy
No ratings yet
Cineplex Loyalty Program Strategy
10 pages
Consumer Perception Towards Online Grocery Stores, Chennai
No ratings yet
Consumer Perception Towards Online Grocery Stores, Chennai
14 pages
SAP MM - Purchase Info Record
100% (1)
SAP MM - Purchase Info Record
6 pages
The Theoretical Framework of The Optimization of Public Transport Travel
No ratings yet
The Theoretical Framework of The Optimization of Public Transport Travel
7 pages
Eco5006 1)
No ratings yet
Eco5006 1)
9 pages
4 Startup Roles To Hire
No ratings yet
4 Startup Roles To Hire
8 pages
TEDwp Fy Youth
No ratings yet
TEDwp Fy Youth
4 pages
Shibaura D265F
No ratings yet
Shibaura D265F
3 pages
Insurance Premium Rates Guide
No ratings yet
Insurance Premium Rates Guide
6 pages
SST 202 by Eugene Akoko Telo
No ratings yet
SST 202 by Eugene Akoko Telo
4 pages
Applied Energy Systems
No ratings yet
Applied Energy Systems
2 pages
Kuwait's Growing F&B Market
No ratings yet
Kuwait's Growing F&B Market
2 pages
Harsh Kumar Chetiwal-CV
No ratings yet
Harsh Kumar Chetiwal-CV
1 page
Bagua Map
No ratings yet
Bagua Map
1 page
Property Dispute: No Forgery Found
No ratings yet
Property Dispute: No Forgery Found
1 page
NIKE Bleed Blue Integrated Campaign
No ratings yet
NIKE Bleed Blue Integrated Campaign
2 pages
Applied Electronics Paper - IV: B.E. Sixth Semester (Aeronautical Engineering) (C.B.S.)
No ratings yet
Applied Electronics Paper - IV: B.E. Sixth Semester (Aeronautical Engineering) (C.B.S.)
2 pages
Attachment 1
No ratings yet
Attachment 1
3 pages
Sample Exam 2
No ratings yet
Sample Exam 2
6 pages
Export Import and Countertrade
No ratings yet
Export Import and Countertrade
32 pages

Problem Set 8

Uploaded by

Problem Set 8

Uploaded by

6/17/25, 12:42 AM Problem Set 2

You may wish to review:

Question 1 [25 points]

Electricity usage in kilowatt hours

Question 2 [35 points]

d. [10 points] Use the margins command to compute:

svyset sdmvpsu [pweight=wtmec2yr], strata(sdmvstra) vce(linearized)

Question 3 [30 points]

You might also like