In this project, we are going to wrangle, analyze and visualize tweets data of WeRateDogs Twitter user.
The Twitter account WeRateDogs (@dog_rates) is devoted to humorously reviewing pictures of dogs doing adorable poses, often giving them scores above 10/10. It has close to 8.7 million followers as of May 2020.
We are going to perform
- Data wrangling i.e., Gather, Assess and Clean relevant tweet data of WeRateDogs from multiple data sources.
- Storing, analyzing, and visualizing the wrangled data.
- Reporting on
- Wrangling efforts
- Data Analysis and Visualization
Organization structure of this project is as follows:
|-data/ |-data/twitter_archive_enhanced.csv |-data/image_predictions.tsv |-data/tweet_json.txt | |-addons/ |-addons/twitter_api.py | |-notebooks/ |-notebooks/wrangle_act.ipynb |-notebooks/wrangle_report.ipynb |-notebooks/act_report.ipynb | |-reports/ |-reports/wrangle_report.html |-reports/act_report.html | |-README.md
- Original ratings (no retweets) that have images.
- Assess and clean atleast 8 qualities.
- Assess and clean atleast 2 tidiness issues.
- Tweet image predictions neural network is hosted on Udacity's server.
- Python3.x
Make sure you have pip install on your machine to fiddle around with different python modules. If pip is not pre-installed, run:
sudo apt-get install python3-pipSome of the python modules don't come in-built with python3. You can download individual modules by running the following commands in your terminal.
pip install --user wget
pip install --user requestsThis is a new feature yet to be added.
- Twitter API
- Udacity Data Analyst Nanodegree
- @WeRateDogs Twitter user