Stars
The Dark Arts of Advanced and Unsafe Rust Programming
A static site generator for data apps, dashboards, reports, and more. Observable Framework combines JavaScript on the front-end for interactive graphics with any language on the back-end for data a…
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
Real-time analytics on Postgres tables
An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI
Boilerplates for running DLT on AWS Lambda to create well-structured datasets from unstructured JSON without breaking a sweat
Orchestrate everything - from scripts to data, infra, AI, and business - as code, with UI and AI Copilot. Simple. Fast. Scalable.
Embeddable stream processing engine based on Apache DataFusion
Open Source AI Platform - AI Chat with advanced features that works with every LLM
Malloy is a modern open source language for describing data relationships and transformations.
Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.
The Prometheus monitoring system and time series database.
Self-Hosting Guide. Learn all about locally hosting (on premises & private web servers) and managing software applications by yourself or your organization. Including Cloud, LLMs, WireGuard, Automa…
ZenML 🙏: MLOps for Reliable AI: from Classical ML to Agents. https://zenml.io.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
This is a repo with links to everything you'd ever want to learn about data engineering
Code for the Kaggle Ensembling Guide Article on MLWave
An ongoing list of pandas quirks
A community based Python library for quantitative economics
ggplot2: elegant graphics for data analysis
R's data.table package extends data.frame:
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Companion webpage to the book "Mathematics For Machine Learning"
a multi-system chiptune tracker compatible with DefleMask modules