Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Stallians's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Stallians

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 38,574 7,421 Updated Oct 29, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 771 173 Updated Dec 18, 2024

Self-serve BI to 10x your data team ⚡️

TypeScript 5,333 645 Updated Nov 12, 2025

Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. ⚡

Jupyter Notebook 499 197 Updated Nov 7, 2025

Container Management and Kubernetes on the Desktop

TypeScript 6,788 338 Updated Nov 12, 2025

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook 33,462 7,109 Updated Oct 15, 2025

Roadmap to becoming a data engineer in 2021

12,721 1,363 Updated Jan 25, 2022

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 326,289 53,138 Updated Nov 3, 2025

An Awesome List of Open-Source Data Engineering Projects

2,876 509 Updated Oct 4, 2024

Near real time ETL to populate a dashboard.

Python 73 46 Updated Sep 9, 2025

Pyspark Spotify ETL

Python 17 2 Updated Aug 19, 2021

Beginner data engineering project - batch edition

HTML 549 185 Updated Jan 22, 2025

A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data…

Python 139 30 Updated Apr 18, 2020

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Jupyter Notebook 6,807 745 Updated Nov 12, 2025

The next-generation couch surfing platform. Free forever. Community‑led. Non‑profit. Modern. Chuck us a star :)

TypeScript 496 89 Updated Nov 12, 2025

Everything you need to know to get the job.

Java 64,654 12,935 Updated May 12, 2025

A topic-centric list of HQ open datasets.

70,407 10,901 Updated Nov 5, 2025

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 44,331 6,906 Updated Aug 18, 2024

100 Days of ML Coding

48,710 11,192 Updated Dec 29, 2023

📊 Path to a free self-taught education in Data Science!

20,579 3,894 Updated May 13, 2025

A list of semi to fully remote-friendly companies (jobs) in tech.

JavaScript 39,487 3,868 Updated Oct 27, 2025

The Python micro framework for building web applications.

Python 70,766 16,619 Updated Oct 14, 2025

Github repo to upload demo files of youtube videos and linkedin

Jupyter Notebook 339 352 Updated Apr 18, 2021

A progressive webapp template.

JavaScript 167 50 Updated Aug 24, 2022

Machine Learning University: Accelerated Tabular Data Class

Jupyter Notebook 1,032 311 Updated Oct 12, 2024

DroneKit-Python library for communicating with Drones via MAVLink.

Python 1,821 1,496 Updated May 30, 2024

A complete daily plan for studying to become a machine learning engineer.

28,646 6,219 Updated Jun 11, 2024

💫 Beautiful spinners for terminal, IPython and Jupyter

Python 3,000 151 Updated Jun 16, 2024

A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities

Python 8,841 2,205 Updated Aug 3, 2024
Next