Stars
The official Python SDK for Sentry.io
Persistent, stale-free, local and cross-machine caching for Python functions.
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
🔧 High-performance Python rate limiting library with multiple algorithms (Fixed Window, Sliding Window, Token Bucket, Leaky Bucket & GCRA) and storage backends (Redis, In-Memory).
📄🧠 PageIndex: Document Index for Reasoning-based RAG
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
🪢 Langfuse Python SDK - Instrument your LLM app with decorators or low-level SDK and get detailed tracing/observability. Works with any LLM or framework
Data validation made beautiful and powerful
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
A tiny, useful Python lib of string, file, and object utilities
Buckaroo - The data table UI for Notebooks. Quickly explore dataframes, scroll through dataframes, search, sort, view summary stats and histograms. Works with Pandas, Polars, Jupyter, Marimo, VSCod…
Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
A proof of concept to scrape papers from journals
High accuracy RAG for answering questions from scientific documents with citations
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
High level asynchronous concurrency and networking framework that works on top of either Trio or asyncio
The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced.
OpenTelemetry Instrumentation for AI Observability
☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.