InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 Python Database Projects
-
quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
SQLMap Project
-
pandas-ai
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
-
vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.
Project mention: Beyond the Diff: How Deep Context Analysis Caught a Critical Bug in a 20K-Star Open Source Project | dev.to | 2025-10-20A developer submitted PR #951 to Vanna.ai, a popular open-source text-to-SQL tool with 20,000+ stars. The change added Databricks integration—156 lines of well-documented code supporting two connection engines (SQL warehouse and ODBC).
-
DB-GPT
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Project mention: Launch HN: Gecko Security (YC F24) – AI That Finds Vulnerabilities in Code | news.ycombinator.com | 2025-08-01Yes, that's exactly what we do. Some examples: https://github.com/eosphoros-ai/DB-GPT/pull/2650, https://github.com/dagster-io/dagster/pull/30002
We just need to follow responsible disclosure first by notifying the maintainers, working with them on a fix, and making it public once it is resolved.
-
gel
Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more. (by geldata)
[2] https://github.com/geldata/gel/issues/4228#issuecomment-1208...
-
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
Project mention: CLI to manage your SQL database schemas and migrations | news.ycombinator.com | 2025-11-01
Tangential, but anyone can suggest their favorite SQL client? Many years ago on Windows I enjoyed HeidiSQL, and while you can kind of use it with wine, it doesn't make a stable impression to me.
Recently I found mycli[1], which seems slightly better than the official mariadb cli client, but still a bit cumbersome.
[1] https://github.com/dbcli/mycli
-
Project mention: XAN: A Modern CSV-Centric Data Manipulation Toolkit for the Terminal | news.ycombinator.com | 2025-03-27
I used to use q for this sort of thing. Not sure if there are better choices now as it have been a few years.
https://harelba.github.io/q/
-
-
https://github.com/ibis-project/ibis and
-
FinanceDatabase
This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.
-
-
dataset
Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.
-
I was looking for a simple, no bloat sqlite explorer that runs in a container and ended up deploying sqlite-web[1] in read-only mode. Very impressed, no external dependencies, works great. A CLI tool is nice, but then still have to ssh into the remote server to access the db file.
[1] https://github.com/coleifer/sqlite-web
-
-
We use a BSL for our product (https://morphik.ai) and usually stay away from calling it anything. We'd just say "repo is public at: https://github.com/morphik-org/morphik-core". I like the term fair source, though.
Is it correct to assume that software than eventually becomes open under something like Apache or MIT is fair source? Or is there more subtlety to it?
-
pgadmin4
pgAdmin is the most popular and feature rich Open Source administration and development platform for PostgreSQL, the most advanced Open Source database in the world.
Project mention: Asynchronous SQLAlchemy 2: A simple step-by-step guide to configuration, models, relationships, and migrations using Alembic | dev.to | 2025-06-03Go to the official PGAdmin website and download the latest version.
-
baserow
Open-source no-code platform and the best alternative to Airtable. Build databases, applications, automations, and dashboards — all in one secure, self-hostable environment. Empower your team to create powerful tools without writing a single line of code.
Have any thoughts on how it compares to https://baserow.io/ ?
-
-
tencent-ml-images
Largest multi-label image database; ResNet-101 model; 80.73% top-1 acc on ImageNet
-
PyPika
PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
-
GarminDB
Download and parse data from Garmin Connect or a Garmin watch, FitBit CSV, and MS Health CSV files into and analyze data in Sqlite serverless databases with Jupyter notebooks.
Project mention: Garmin Beats Apple to Market with Satellite-Connected Smartwatch | news.ycombinator.com | 2025-09-07If you've got a Garmin device check out GarminDB [0]. Garmin actually exposes an API that you can access with your credentials and get the raw activity, heart rate, etc data.
[0]: https://github.com/tcgoetz/GarminDB
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Database discussion
Python Database related posts
-
Ask HN: What Are You Working On? (Nov 2025
-
Turning PySpark into a Universal DataFrame API
-
CLI to manage your SQL database schemas and migrations
-
Beyond the Diff: How Deep Context Analysis Caught a Critical Bug in a 20K-Star Open Source Project
-
All Data and AI Weekly #210: 6 Oct 2025
-
From Knowledge Graph Generation to RAG for Stablecoin Regulatory Intelligence
-
Database of Databases
-
A note from our sponsor - InfluxDB
www.influxdata.com | 15 Nov 2025
Index
What are some of the best open-source Database projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | quivr | 38,596 |
| 2 | SQLMap | 35,820 |
| 3 | pandas-ai | 22,534 |
| 4 | vanna | 21,588 |
| 5 | DB-GPT | 17,603 |
| 6 | gel | 13,936 |
| 7 | pgcli | 12,811 |
| 8 | mycli | 11,758 |
| 9 | q | 10,331 |
| 10 | TinyDB | 7,366 |
| 11 | ibis | 6,211 |
| 12 | FinanceDatabase | 6,044 |
| 13 | superduper | 5,228 |
| 14 | dataset | 4,827 |
| 15 | sqlite-web | 3,923 |
| 16 | dataherald | 3,574 |
| 17 | morphik-core | 3,367 |
| 18 | pgadmin4 | 3,291 |
| 19 | baserow | 3,238 |
| 20 | arctic | 3,080 |
| 21 | tencent-ml-images | 3,077 |
| 22 | PyPika | 2,750 |
| 23 | GarminDB | 2,743 |