Write a crawler to extract game attributes (Max 300 games) and store them in Elasticsearch. Using Kibana, visualize the stered data.
First setup Elasticsearch & Kibana.
Step 1 — clone the repo:
mkdir crawler
cd crawler
git clone https://github.com/sshmo/game_crawler.git .
Step 2 — Set Up Python
sudo apt update
sudo apt install python3-pip python3-dev
Step 3 — Create a Python Virtual Environment
sudo -H pip3 install --upgrade pip
sudo -H pip3 install virtualenv
mkvirtualenv scrap
workon scrap
Step 4 — Install requirements
pip3 install -r requirements.txt
Step 5 — Run scrapy
scrapy crawl games -o games.jl
Step 6 — Run elasticsearch_db.py
python elasticsearch_db.py games.jl
Now you can see the result at: http://localhost:9200/games/_search
- The data should be extracted from the following link (two layers):
The following attributes are of intrest:
- Game name
- Game genere
- Number of downloads
- Score value and number
- Description
- Use Kibana for data visualization
- Retale Downloads and vote number to score, using Kibana
- Visualize the words frequency in game description using Kibana



