Stars
A comprehensive data engineering pipeline for Milan telecom data, featuring batch and streaming ETL with Apache Spark, Kafka for real-time ingestion, MinIO for S3-compatible storage, and Airflow fo…
An end-to-end, containerized data pipeline for near-real-time user event analytics using Kafka, ClickHouse, Airflow, and PySpark. Made to learn some common data engineering practices.
AI-powered FPL Scout using historical and live data with ML models to predict player performance and optimize team selection.
data-catering / data-caterer
Forked from pflooky/data-catererTest data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.
Apache Spark - A unified analytics engine for large-scale data processing
codsalah / Dimensional-Modeling-Practiical-guide
Forked from EbEmad/Dimensional-Modeling-Practiical-guideTranslate full books and large texts with LLM autonomously
building a modern data warehouse , including ETL processes, data modeling, and analytics.
A FREE pragmatic DevOps learning to kickstart your DevOps career and knowledge in the Cloud Native era following the Agile MVP style! ⭐ (2025 plans for DevOps, Cloud, Platform, SRE, SWE)
This repository is managed by LeetPush extension: https://github.com/husamahmud/LeetPush
Design and optimize a star schema in Hive on HDFS, ingesting data in CSV, Avro, and Parquet. Benchmark performance with compression algorithms and enable OLAP/OLTP querying using Spark, Hive, and T…
A simple user friendly command line tool to download YouTube videos and playlists with fewer steps.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Chrome extension to capture and push LeetCode solutions to your GitHub repository.
Designed and implemented a medallion architecture on Azure Databricks, using Unity Catalog for data governance and Azure Data Lake Gen2 for storage. Created schemas with Spark SQL, ingested data f…