Main Folder: CS599-UFOs-PA1
Sub Folders:
- assignment_description
- cached_data- contains cached data for locations (created when extracting location co-ordinates using geopy)
- d3_work- contains code to generate d3 visualizations
- data- contains datasets used
- merge_data- contains final dataset
Order of execution:
-
Install the requirements by running: pip install -r requirement.txt
-
Follow /data/initial_cleaning to get reference.csv
-
Follow /data/geocoded_location to get reference_w_loc.csv
-
Use /data/shapes_cleaning to attempt to extract shape from description for UFO sightings that did not report a shape -- this uses and overwrites reference_w_loc.csv
-
Follow /data/airport_dataset to produce reference_w_airports.csv
-
Follow /data/sports_dataset to produce reference_w_sports.csv
-
Follow /data/meteorite_dataset to produce reference_w_meteorite.csv
You can request access to this repository.