My EDA Helper - Boost Your Exploratory Data Analysis Process! 🚀

EDA Helper is a Python package designed to streamline your Exploratory Data Analysis (EDA) process. It provides a collection of helper functions to quickly analyze, visualize, and summarize datasets. Whether you're working with numeric, categorical, or datetime data, this package has you covered!

Credits 🙏

This package is inspired by the brilliant work of @MisbahullahSheriff. The original EDA helper functions were created by him, and I have extended and organized them for easier use. Additional functions and improvements have been added by me (@shemanto27).

Installation 📦

You can install the package via pip:

pip install my_eda_helper

For Google Colab users, install it directly in your notebook:

!pip install my_eda_helper

Usage 🛠️

1. Import the Package

import my_eda_helper as eda

2. High-Level Analysis

Missing Data

Find Missing Values:

missing_data = eda.missing_info(df)
print(missing_data)

Plot Missing Data:

eda.plot_missing_info(df)

Correlation Analysis

Numeric Features (Pearson/Spearman):

eda.correlation_heatmap(df)

Categorical Features (Cramer's V):

eda.cramersV_heatmap(df)

Pair Plots

eda.pair_plots(df)

3. Detailed Analysis

Numeric Features

Summary:

eda.num_summary(df, "Age")

Univariate Plots:

eda.num_univar_plots(df, "Fare")

Bivariate Plots:

eda.num_bivar_plots(df, "Age", "Fare")

Categorical Features

Summary:

eda.cat_summary(df, "Sex")

Univariate Plots:

eda.cat_univar_plots(df, "Embarked")

Bivariate Plots:

eda.num_cat_bivar_plots(df, "Fare", "Sex")

Hypothesis Testing

Numeric vs Numeric:

eda.num_num_hyp_testing(df, "Age", "Fare")

Numeric vs Categorical:

eda.num_cat_hyp_testing(df, "Fare", "Sex")

Categorical vs Categorical:

eda.hyp_cat_cat(df, "Sex", "Survived")

Contributing 🤝

Contributions are welcome! If you have ideas for new features, improvements, or bug fixes, please feel free to:

Fork the repository.
Create a new branch:
```
git checkout -b feature/YourFeatureName
```
Commit your changes:
```
git commit -m 'Add some feature'
```
Push to the branch:
```
git push origin feature/YourFeatureName
```
Open a pull request.

Please ensure your code follows the project's style and includes appropriate tests.

License 📄

This project is licensed under the MIT License. See the LICENSE file for details.

Support 💬

If you have any questions, suggestions, or issues, please open an issue on the GitHub repository.

Happy EDA! 🎉

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
my_eda_helper		my_eda_helper
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

My EDA Helper - Boost Your Exploratory Data Analysis Process! 🚀

Credits 🙏

Installation 📦

Usage 🛠️

1. Import the Package

2. High-Level Analysis

Missing Data

Correlation Analysis

Pair Plots

3. Detailed Analysis

Numeric Features

Categorical Features

Hypothesis Testing

Contributing 🤝

License 📄

Support 💬

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

shemanto27/eda-helper-py

Folders and files

Latest commit

History

Repository files navigation

My EDA Helper - Boost Your Exploratory Data Analysis Process! 🚀

Credits 🙏

Installation 📦

Usage 🛠️

1. Import the Package

2. High-Level Analysis

Missing Data

Correlation Analysis

Pair Plots

3. Detailed Analysis

Numeric Features

Categorical Features

Hypothesis Testing

Contributing 🤝

License 📄

Support 💬

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages