Codestin Search App

Coverage reports are generated using jacoco and gradle. To automate generaion of reports and copying their output for use in the mapreduce program a bash script is written.

Run the bash file: bash run_tests.sh This will generate all the coverage reports and copy them into the input folder for mapreduce. (Tests are located in src/test/java/gurpreetstests of the module gurpreetkaur_chabada_hw1. The program has two jUnit tests)
Now we need to generate the jar for mapreduce. Run ./gradlew mapreduce:fatJar This will generate the jar in mapreduce/build/libs The jar name is mapreduce-all.jar
To run this jar locally, setup hadoop and execute the following command from the bin directory of your local installation:

hadoop jar mapreduce-all.jar input/ output

(if using mac please run zip -d mapreduce-all.jar META-INF/LICENSE before running the above command.)

Before running the above command Make sure you have input directory copied to your hdfs, you can do that using:

hadoop fs -mkdir -p input (To create input directory if it does not exist) hdfs dfs -put /. input Make sure output directory does not exist in hdfs. Delete it..if it does using: hadoop fs -rm -r output/

copy the contents of the output directory to local file system using: hadoop fs -copyToLocal output/ .

View the results stored in output/final folder. Output format is filename->Line number List of tests which cover this line in descending order

If your hadoop cluster contains more than one node, then uncomment lines 171,172 in WordCount.java and add or reduce them based on the cluster nodes. The file works for a configuration with 1 node.

Youtube link for EMR deployment (It is unlisted and only accessible via this link): https://youtu.be/EHoT8o9eHRI

Results of emr deployment are present in mapreduce/output/aws folder.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
gradle/wrapper		gradle/wrapper
gurpreetkaur_chabada_hw1		gurpreetkaur_chabada_hw1
mapreduce		mapreduce
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
build.gradle		build.gradle
gradlew		gradlew
gradlew.bat		gradlew.bat
run_tests.sh		run_tests.sh
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

gurpreet14/MapReduce

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages