This repo is our project "SNCB Cool Train" in the course "Data Mining" at Université Libre de Bruxelles (ULB).
- Here we only use a mini file
ar41_for_ulb_mini.csv. You can get the full raw data filear41_for_ulb.csv(1.9GB) provided by the lecturer. - Go to https://account.mapbox.com/access-tokens/ and create an access token, then put it at
mapbox_access_token=in the fileapp.py.
- Clone the repo
git clone https://github.com/hieunm44/dm-sncb-cool-train.git cd sdm-sncb-cool-train - Install necessary packages
pip install -r requirements.txt
- Get weather data from https://open-meteo.com/ through API calls then integrate to raw data.
Then two files
python3 data_integration.py
weather_data.csvandmerged_datawill be created in the folderdata. - Check the file
preprocess_data.ipynbfor data preprocessing and model training. Several data files and model files will be created. - Run the visualization app. Here we only use the file
sample_data.csv(a portion of full data), so that the application can run smoothly.python3 app.py