Thanks to visit codestin.com
Credit goes to GitHub.com

Skip to content
#

data-cleaning-pipeline

Here are 34 public repositories matching this topic...

🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.

  • Updated Apr 2, 2025
  • Python

Using machine learning models to predict if patients have chronic kidney disease based on a few features. The results of the models are also interpreted to make it more understandable to health practitioners.

  • Updated Jan 21, 2026
  • Jupyter Notebook

The dataset I wrangled (and analysed and visualized) is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog.

  • Updated Nov 25, 2021
  • HTML

End-to-end ML project for predicting Indian flight prices using XGBoost with 97% accuracy and RMSE of ~768 INR. Deployed as an interactive Streamlit app with AWS SageMaker integration and modern gradient UI design.

  • Updated Feb 2, 2026
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-cleaning-pipeline topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-cleaning-pipeline topic, visit your repo's landing page and select "manage topics."

Learn more