Thanks to visit codestin.com
Credit goes to Github.com

Skip to content

jmdu99/jmdu99

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

13 Commits
Β 
Β 

Repository files navigation

Hi there πŸ‘‹, I'm Jose

Freelance Data Engineer β€” Turning messy data into clarity for projects with real impact.
I work with purpose-driven teams to build data systems they can trust.

πŸ›  What I Do

  • πŸ“Š Centralise scattered data into a single source of truth
  • βš™οΈ Automate cleaning & validation for always-ready data
  • πŸš€ Design efficient ETL/ELT pipelines (Airflow, dbt, Spark…)
  • πŸ“ˆ Build solid foundations for BI, ML & GenAI
  • ⏱ Create real-time dataflows when speed matters

πŸ’» Tech Stack

Core Skills & Tooling

Python SQL Bash Git GitHub Poetry Pylint Pandas NumPy

Ingestion, Orchestration & Processing

Apache Airflow Cloud Composer (GCP) MWAA (AWS) dbt Fivetran Airbyte Prefect Apache Spark PySpark Apache Beam Dataflow (GCP) Dataproc (GCP) Spark Structured Streaming Apache Kafka Google Pub/Sub Apache NiFi Web scraping

Data Platforms & Storage

Amazon S3 Google Cloud Storage Parquet BigQuery Snowflake Amazon Redshift Amazon Athena PostgreSQL MongoDB Cassandra ClickHouse

Cloud & DevOps

Amazon EC2 Google Compute Engine Terraform (IaC) Docker Docker Compose GitHub Actions (CI/CD) IAM / RBAC

ML, NLP & Knowledge Graphs

Generative AI Large Language Models OpenAI API LangChain (RAG) Hugging Face Transformers NLTK spaCy scikit-learn PyTorch TensorFlow SPARQL AWS SageMaker

Analytics & Visualization

Matplotlib Seaborn Plotly Amazon QuickSight Apache Superset

🎯 About Me

Since 2021, I've worked in data across tech, banking, and large-scale systems (Amazon, Slido/Cisco).
In 2025, I went freelance to focus on projects with real impact β€” from healthtech and edtech to any sector that values purpose as much as results.
I also donate 10% of my earnings to the GiveWell Top Charities Fund.

πŸ—‚ Portfolio & Contact

πŸ’Ό Portfolio request β†’ LinkedIn
πŸ“© Let’s connect and discuss how to make your data work better.

πŸ† GitHub Trophies

trophy

About

Personal repo

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published