Stars
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Real-time monitor and web admin for Celery distributed task queue
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementationβ¦
PgQueuer is a Python library leveraging PostgreSQL for efficient job queuing.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
Extremely fast Query Engine for DataFrames, written in Rust
trying out DuckLake from DuckDB with Postgres
DuckLake is an integrated data lake and catalog format
Frouros: an open-source Python library for drift detection in machine learning systems.
Terraform Visual is an interactive way of visualizing your Terraform plan
Example π Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using π§ Amazon SageMaker.
Data validation using Python type hints
Probabilistic Hierarchical forecasting π with statistical and econometric methods.
π Python-powered shell. Full-featured and cross-platform.
A PDF viewer that seamlessly integrates with any JavaScript project
Find, verify, and analyze leaked credentials
Forecasting: principles and practice in python
Template repo for kickstarting recipes for regression use case
Distributed Task Queue (development branch)