Statstutor community project www.statstutor.ac.
uk
stcp-dataset-crime_des
Crime Rate Data description
This data set gives a variety of variables by US state at two time points 10 years apart. A variety of regressions and t-
tests can be carried out with the main scale dependent being and Crime Rate (offences per million population) and t-
tests with the independent being whether or not the state is in the south. Mostly discrete variables as they measure
populations per 1000, there are some continuous variables such as those measuring expenditure.
Variable Description
CrimeRate Crime rate (number of offences per million population) Continuous
Youth Young males (number of males aged 18-24 per 1000) Discrete
Southern Southern state 1 = yes, 0 = no Binary
Education Education time (average number of years schooling up to 25) Discrete
ExpenditureYear0 Expenditure (per capita expenditure on police) skewed Continuous
LabourForce Youth labour force (males employed 18-24 per 1000) Discrete
Males Males (per 1000 females) Discrete
MoreMales More males identified per 1000 females 1 = yes, 0 = no Binary
StateSize State size (in hundred thousands) Discrete
YouthUnemployment Youth Unemployment (number of males aged 18-24 per 1000) skewed Discrete
MatureUnemploymen Mature Unemployment (number of males aged 35-39 per 1000) Discrete
t
HighYouthUnemploy High Youth Unemployment 1 = yes, 0 = no (high if Youth >3*Mature ) Binary
Wage Wage (median weekly wage) Continuous
BelowWage Below Wage (number of families below half wage per 1000) Discrete
Note: The same variables are collected 10 years later and have 10 on the end.
Can be used for :
Test Example Questions
1. Independent t-test Do states in the south have a
higher crime rate?
2. Correlation Relationship between crime Relationship between crime
rate and police expenditure rate and number of families
(Spearman’s) below half wage (Pearson’s)
3. Simple regression Can the number of males to Can number of families below
females predict the crime rate? half wage predict crime rate?
4. Multiple regression Crime rate with any
independent.
5. Chi-squared Is there a relationship between Is there a relationship between
southern states and the male high youth unemployment and
population? southern states?
6. Paired t-test Have crime rates increased in Has adult unemployment
10 years? dropped in 10 years?
7. Checking normality Is the crime rate normally Is youth unemployment
distributed? normally distributed?
8. Mann-Whitney Is police expenditure higher in Is youth unemployment higher
the south? in the south?
Data contributed by Katy Dobson, University of Leeds
Statstutor community project www.statstutor.ac.uk
stcp-dataset-crime_des
Data contributed by Katy Dobson, University of Leeds