Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View indrajeetapache's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report indrajeetapache

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 250 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
indrajeetapache/README.md

πŸ‘‹ Hi, I'm Indrajit Swain

πŸš€ Senior Data Scientist at Citi Bank | πŸŽ“ PhD Scholar in Data Science | πŸ† Kaggle Expert | ✍️ Blogger on AI & Big Data

Welcome to my GitHub! I specialize in building scalable machine learning solutions, automating data pipelines, and crafting insights using Big Data technologies. With over 6 years of experience, I thrive at the intersection of data science and engineering, creating impactful solutions for real-world problems.


πŸ”§ My Expertise

🧠 Machine Learning & AI

  • Developing credit card fraud detection models using XGBoost, PyTorch, and Hugging Face.
  • Feature engineering, model optimization, and explainability for black-box models.
  • Expertise in GenAI, transformers, and model interpretability.

πŸ“Š Data Science & Analytics

  • Analyzing large-scale datasets to derive actionable insights.
  • Proficient in predictive modeling, time-series analysis, and exploratory data analysis.

🌐 Big Data Technologies

  • Experienced in distributed systems with Apache Spark, Hadoop, Hive, and Snowflake.
  • Building scalable solutions to process terabytes of data seamlessly.

πŸ› οΈ Technologies & Tools

Programming Languages

Python
Scala
SQL


Machine Learning & Deep Learning Frameworks

Scikit-learn
TensorFlow
PyTorch
Keras
XGBoost
Hugging Face


Natural Language Processing (NLP)

spaCy
NLTK
Transformers


Big Data Technologies

Apache Spark
Hadoop
Hive
Snowflake


Visualization

Tableau
Matplotlib
Seaborn


Version Control & Collaboration

Git
GitHub
Jupyter


Other Tools

NumPy
Pandas
Airflow


🎯 Kaggle Highlights

Kaggle Badge
I love competing on Kaggle, where I tackle data challenges across domains. Check out my Kaggle contributions:

  • Participated in Kaggle Competitions to solve real-world problems.
  • Created reusable notebooks for predictive modeling, EDA, and feature engineering.


πŸ“« Connect with Me


🌱 Fun Fact

When I'm not training ML models, I love exploring GenAI advancements, reading research papers for my PhD or casually, writing blogs, and mentoring aspiring data scientists!


Pinned Loading

  1. Kagglesurvey Kagglesurvey
    1
    import  pandas as pd
    2
    import matplotlib.pyplot as plt
    3
    import seaborn as sns
    4
    import numpy as np
    5
    import scipy.stats as sst