GraphPart, a data partitioning method for ML on biological sequences
- 
            Updated
            Oct 26, 2023 
- Jupyter Notebook
GraphPart, a data partitioning method for ML on biological sequences
This library allows reading and converting bounding box annotations in many popular formats
🪨 Machine learning project using logistic regression to classify sonar signals as either rocks or mines. Uses scikit-learn to train a binary classifier on sonar dataset with 60 numerical features for accurate underwater object detection.
🫀 A machine learning project using logistic regression to predict heart disease risk from clinical data. Built with Python, scikit-learn, and Jupyter notebooks. Achieves 85%+ accuracy on 303-patient dataset with 13 medical features. Complete ML pipeline from data exploration to model evaluation.
🩺 Machine Learning diabetes prediction model using Support Vector Machine (SVM) classifier. Analyzes 8 medical features (glucose, BMI, age, etc.) from Pima Indian dataset to predict diabetes risk with 75-80% accuracy. Built with Python, scikit-learn, pandas. Includes data preprocessing, model training, and prediction system for diabetes..
🍾 A comprehensive machine learning project using Random Forest algorithm to predict wine quality based on physicochemical properties. Features EDA, model training, hyperparameter tuning, feature importance analysis, and detailed documentation.
📁 Repo for python_splitter Python package. This package can split Images into Train, Test, Validation folders automatically by shuffling media/images for machine learning.
This is an algorithm for evenly partitioning.
A machine learning project predicting Titanic passenger survival using data preprocessing, feature engineering, and model optimization with Logistic Regression, Random Forest, and XGBoost.
This repository contains the code and resources for our participation (team of 4 members) in the IIMS Hackathon 2024, where our team developed an award-winning image segmentation model for autonomous vehicles. Our model was recognized for its accuracy and innovation, earning us the Best Model award.
To create a Decision Tree classifier and visualize it graphically, the purpose is if we feed any new data to this classifier, it would be able to predict the right class accordingly.
Supervised-ML-Decision-Tree-C5.0-Entropy-Iris-Flower-Using Entropy Criteria - Classification Model. Import Libraries and data set, EDA, Apply Label Encoding, Model Building - Building/Training Decision Tree Classifier (C5.0) using Entropy Criteria. Validation and Testing Decision Tree Classifier (C5.0) Model
Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).
This project applies a Random Forest Classifier to predict whether a student will Pass or Fail based on their features
This project is designed to extract sales data from a PostgreSQL database, process it, and use a Random Forest model to predict sales quantities. It also visualizes real and predicted sales for better understanding.
This repository contains introductory notebook for logistic regression
protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.
Learning Project ML - Diabetes Prediction
This project provides tools to search for datasets on Kaggle, download and preprocess them, and perform predictions using a Linear Regression model. It includes interactive text-based user interfaces built with `curses`.
This creates an AWS Chatbot to give users their investment portfolio based on their risk tolerance level i.e. conservative, moderate, or aggressive. With the use of machine learning, the tool will be created to different portfolios based off that.
Add a description, image, and links to the train-test-split topic page so that developers can more easily learn about it.
To associate your repository with the train-test-split topic, visit your repo's landing page and select "manage topics."