Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View talhaumer's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report talhaumer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
talhaumer/README.md

πŸ‘‹ Hi, I'm Talha Umer

πŸš€ AI Driven Data Engineer with over 5 years of experience building scalable, reliable, and production-ready data systems. I specialize in ETL development, real-time processing, and cloud-native architectures.

πŸ› οΈ Tech Stack

  • Languages: Python, SQL, Bash
  • Big Data: PySpark, EMR, Glue, Athena
  • Cloud Platforms: AWS (Lambda, Redshift, S3), GCP
  • Data Tools: Airflow, Dagster, Kafka, Adjust API, Pandas
  • AI Tools: LangChain, LangGraph, Model Context Protocol (MCP), LLMs (GPT, Claude, Gemini), Automation Frameworks
  • Databases: MySQL, PostgreSQL, Redshift, NoSQL

πŸ“Š Focus Areas

  • Building and orchestrating end-to-end ETL pipelines
  • Streaming data from MT4/MT5 platforms for FX trading
  • Automating data validation and reporting processes
  • Designing data warehouses for reporting and analytics
  • API integrations for marketing and user behavior platforms (e.g., Adjust, Amplitude)

πŸ“Œ Featured Projects

πŸ“« Connect With Me


βš™οΈ This GitHub profile showcases selected open-source and self-built projects inspired by real production experience β€” with client-sensitive details removed for confidentiality.

Pinned Loading

  1. talhaumer talhaumer Public

    Data Engineer | Python | ETL | AWS | Redshift | PySpark | Building robust data pipelines & analytics solutions

  2. agentic-data-sentinel agentic-data-sentinel Public

    Data Sentinel is an agentic AI platform that autonomously ensures data quality, anomaly detection, and insight generation across modern data warehouses.

    Python

  3. dagster_redshift_etl dagster_redshift_etl Public

    A production-ready ETL pipeline using Dagster to extract data from MySQL, transform it with Python, and load it into Amazon Redshift. Includes modular query management, robust logging, scheduling, …

    Python

  4. travel_concierge travel_concierge Public

    Python