Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View janeruxi1's full-sized avatar
πŸ’­
Learning
πŸ’­
Learning
  • American Modern Insurance
  • Bellevue, WA

Block or report janeruxi1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
janeruxi1/README.md

πŸ‘‹ Hi, I'm Xi Ru

πŸ‘©β€πŸ’» About Me

I’m a Data Scientist with a strong focus on AI-driven solutions and a background in Data Analysis. I build end-to-end systems that combine data engineering, machine learning, and Generative AI to improve process automations, model predictions and business decisions.

πŸ“ Location: Bellevue, Washington

This repository highlights selected projects that demonstrate my skills in AI-focused data science.


πŸ” Focus

  • Machine Learning & Deep Learning
  • Generative AI & LLM applications
  • Data Engineering with Spark & Databricks
  • End-to-end AI systems (data β†’ model β†’ deployment)

πŸ› οΈ Skills & Tools

  • Languages: Python, SQL, R, DAX
  • ML / AI: Scikit-learn, TensorFlow, PyTorch, LLMs, Prophet
  • Big Data: Spark, Databricks, Delta Lake
  • MLOps & Deployment: FastAPI, Docker, GitHub Actions,MLflow
  • Visualization: Matplotlib, Seaborn, Streamlit

πŸ“‚ Projects

Below are selected projects designed to mirror real-world AI data science work, from data ingestion to model diagnostics and AI-powered insights.

πŸ€– 1️. Automated Business Insights Platform (ML Evaluation + AI)

Tech: Python, FastAPI, Pandas, LLMs

  • Built an end-to-end ML analysis pipeline with automated performance summaries
  • Combined traditional metrics with AI-generated insights for decision support
  • Emphasized model evaluation, explainability, and stakeholder-ready outputs

πŸ‘‰ Repo: automated-business-insights

πŸ“ˆ 2. Time Series Forecasting & Model Diagnostics (Core DS)

Tech: Python, Statsmodels, MLflow

  • Implemented multiple forecasting models and compared performance
  • Added residual diagnostics, model fit summaries, and error analysis
  • Tracked experiments and metrics using MLflow

πŸ‘‰ Repo: time-series-forecasting-project

⚑ 3. Scalable ML Pipelines with Spark & Databricks (AI at Scale)

Tech: PySpark, Delta Lake, Databricks

  • Built scalable data pipelines supporting ML workloads
  • Implemented window functions, incremental processing, and Delta time travel
  • Designed architecture with production ML systems in mind

πŸ‘‰ Repo: [spark-databricks-pipeline]

🧠 4. Generative AI: LLM-Powered Insight Generator

Tech: Python, LangChain, OpenAI API, Streamlit

  • Uses LLMs to transform raw analytical outputs into human-readable insights, summaries, and reports.

πŸ‘‰ Repo: [llm-insight-generator]


πŸ’‘ Skills Demonstrated

  • Applied ML & Generative AI (LLMs, prompt design, evaluation)
  • Model training, performance metrics, and diagnostics
  • Big data processing with Spark & Delta Lake
  • AI-enabled APIs & backend systems
  • Clear documentation, reproducibility, and stakeholder communication

πŸ“« Contact

⭐ If you find these projects useful, feel free to star the repos!

Pinned Loading

  1. automated-business-insights automated-business-insights Public

    This repository shows end-to-end automated AI business insight generator with forecasting and Azure integration

    Python 1

  2. Weekly-AI-Prediction-Insights-via-N8N Weekly-AI-Prediction-Insights-via-N8N Public

    Python