π AI Driven Data Engineer with over 5 years of experience building scalable, reliable, and production-ready data systems. I specialize in ETL development, real-time processing, and cloud-native architectures.
- Languages: Python, SQL, Bash
- Big Data: PySpark, EMR, Glue, Athena
- Cloud Platforms: AWS (Lambda, Redshift, S3), GCP
- Data Tools: Airflow, Dagster, Kafka, Adjust API, Pandas
- AI Tools: LangChain, LangGraph, Model Context Protocol (MCP), LLMs (GPT, Claude, Gemini), Automation Frameworks
- Databases: MySQL, PostgreSQL, Redshift, NoSQL
- Building and orchestrating end-to-end ETL pipelines
- Streaming data from MT4/MT5 platforms for FX trading
- Automating data validation and reporting processes
- Designing data warehouses for reporting and analytics
- API integrations for marketing and user behavior platforms (e.g., Adjust, Amplitude)
- π Redshift ETL Pipeline with Dagster
- π§Ή Data Sentinel: Autonomous data quality agent. Self-checking, reasoning, fixing. Shows data engineering + AI skills
βοΈ This GitHub profile showcases selected open-source and self-built projects inspired by real production experience β with client-sensitive details removed for confidentiality.