Steam Data Analysis

Step 1: Download the Dataset

Begin by downloading the games.json file from the Steam Games Dataset on Kaggle. This dataset provides comprehensive information about Steam games, which will serve as the foundation for analysis and modeling.

Step 2: Prepare the Dataset

Locate the downloaded games.json.zip file.
Extract the contents of the .zip archive to retrieve the games.json file.
Place the games.json file in the storage/ directory of this repository.

These steps ensure the dataset is properly organized and ready for further processing.

Step 3: Install Dependencies

Install the required Python dependencies using the following command:

pip install -r requirements.txt

This will ensure all necessary libraries are installed in your environment. Python version is 3.9

Note

To make sure the dashboard is working, you will need a Chromium-based browser installed on your system.
Install it using the command:

playwright install chromium

Commands

1. Inference

Apply the trained model to external data and generate predictions.

Command:

python run.py -mode "inference" -file "./path_to_input.csv"

Arguments:

-mode: Set to "inference" to run inference on best model. -file: Path to the input CSV file. Output:

A new CSV file with predictions added as a predict column, saved in the folder specified by infer_folder in config.yaml.

2. Update

Fetch the next batch of data, preprocess it, and retrain the model.

Command:

python run.py -mode "update"

Arguments:

-mode: Set to "update" to process the next batch and retrain the model. Output:

Updated model and metrics saved in the storage/results/ folder. Data quality and EDA reports saved in the storage/results/reports and storage/results/eda folders.

3. Summary

Generate a monitoring report summarizing data quality, model metrics, and hyperparameters.

Command:

python run.py -mode "summary"

Arguments:

-mode: Set to "summary" to generate the monitoring report. Output:

A monitoring_report.txt file saved in the report_storage folder.

Dashboard

Access dashboard by

python dashboard/app.py

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.github/workflows		.github/workflows
dashboard		dashboard
doc		doc
monitoring		monitoring
options		options
src		src
storage/results		storage/results
tests		tests
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Steam Data Analysis

Step 1: Download the Dataset

Step 2: Prepare the Dataset

Step 3: Install Dependencies

Commands

1. Inference

2. Update

3. Summary

Dashboard

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

NotTwist/Steam-MLOps

Folders and files

Latest commit

History

Repository files navigation

Steam Data Analysis

Step 1: Download the Dataset

Step 2: Prepare the Dataset

Step 3: Install Dependencies

Commands

1. Inference

2. Update

3. Summary

Dashboard

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages