Data Scientist based in π Hannover, Germany. I'm into everything data β pipelines, models, dashboards. My focus: Customer Analytics & Business Intelligence.
π’ Currently open to part-time Data Science roles β Hannover or remote
Data Engineering
Machine Learning
BI & Analytics
| Project | Description | Tools |
|---|---|---|
| π databricks-data-warehouse | Build a Data Warehouse in Databricks using a Medallion architecture (BronzeβSilverβGold), including ETL pipelines and dbt testing workflows. | Databricks, SQL, dbt |
| π ai-customer-growth-retention | Perform RFM customer segmentation and model churn, survival, and customer lifetime value (CLV) using transactional data to identify high-priority customer groups. | Python, Scikit-learn, Lifetimes, Pandas |
| π‘ recommender-systems-bootcamp | End-to-end recommender system pipeline covering data preprocessing, ANN-based retrieval, transformer ranking, offline evaluation, and REST API serving. | Python, PyTorch, FastAPI, FAISS |
| π€ sales_pipeline | Modern ELT data pipeline using dbt, Snowflake, and Apache Airflow to transform and orchestrate Snowflake TPCH sample datasets. | dbt, Snowflake, Apache Airflow |
| πΈ rag-chatbot | Simple RAG chatbot for flower recommendations using vector search, Vietnamese embeddings, and a lightweight Streamlit interface. | LangChain, Streamlit, Vector Database, LLM |


