Getting and Cleaning Data Course - Project

This project presents a tidy data set prepared for later data analysis. This tidy data corresponds to the mean and standard deviation measurements collected from Samsung Galaxy S II smartphone accelerometers.

Repo Files

This repository is composed of other 3 files, as described below:

tidyDataSet.txt, a txt file which contais the variables corresponding to the mean and standard deviation measurements from the smartphone accelerometers for each human activity;
CodeBook.md, a code book that describes the variables, the data, and transformations performed to clean up the raw data;
run_analysis.R, the R script which contains the steps that I used to go from the raw data to the tidy data set;
rawDataset file, which contais the raw data files used.

How the run_analysis work?

NO INPUT PARAMETER

INPUT: raw data files (X_train.txt and X_test.txt) which contains the training and test sets

OUTPUT: tidy data set as a txt file created with write.table() function

This script does the following steps:

Use the rbind function to merge training and the test sets to create the fullDataSet. Also, this function was used to merge the activities labels correspoding to training and test sets to create the activity data.
From the merged data set, use the grepl function to extracts only the measurements which presents exactly "-mean()" or "-std()" patterns in the end of its names were extrated to build the newDataSet. Example: grepl('-mean$', features$V2[i]);
From the activity data, the paste command is used to replace the number by the corresponding activity name in the new data set;
From the new data set, the colMeans function is used to summarize the variables by mean per activity in order to create a second independent tidy data set (tidyDataSet);
Use the lapply and paste functions to appropriately label the tidy data set with the descriptive variable names.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting and Cleaning Data Course - Project

Repo Files

How the run_analysis work?

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
rawDataset		rawDataset
CodeBook.md		CodeBook.md
README.md		README.md
run_analysis.R		run_analysis.R
tidyDataSet.txt		tidyDataSet.txt

deborahvm/GettingAndCleaningDataCourseProject

Folders and files

Latest commit

History

Repository files navigation

Getting and Cleaning Data Course - Project

Repo Files

How the run_analysis work?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages