Thanks to visit codestin.com
Credit goes to github.com

Skip to content

moalvarez/GettingAndCleaningData

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

title author date output
Readme
Miguel O. Alvarez
April 26, 2015
html_document

================================================================== Coursera Getting and Cleaning Data Course April 2015 Miguel O. Alvarez

This is some general information/highlights on the work for this project.

I believe the code provides the results requested, generating tidy data in the wide form.

It also complies with all the steps described for the assignment (although not necessarily in that order.

The resulting data set shows all the columns from the original set, with the data grouped by the activity and the subject, and each column summarized to the mean value of each observation type by activity/subject.

The column names were prepended with the string "Mean-" to denote that the values reflect the mean of the original set of observations by activity/subject.

The general process was to first merge and tidy the data for each set (train and test).
Then merge the two tidy sets (clean_train_set and clean_test_set) into one (merged_set). Then used the merged set to generate a new frame (tidy_set) with the average (mean) of each variable by activity/subject. Finally, update the column names of the tidy_set to denote that they contain the mean of the original measurements (by activity/subject).

I have documented the steps in the code for clarity.

The script file is run_analysis.R. The file containing the tidy_data frame is tidy_data.txt.

The file containing the tidy_data frame can be read in to R with: data <- read.table(, header = TRUE)

About

Repository for files for the Getting and Cleaning Data course project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages