Thanks to visit codestin.com
Credit goes to www.libhunt.com

Python SQL

Open-source Python projects categorized as SQL

Top 23 Python SQL Projects

  1. devops-exercises

    Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

    Project mention: A collection of exercises and examples for learning DevOps concepts | news.ycombinator.com | 2025-06-29
  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. pandas-ai

    Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

    Project mention: Pandas AI | news.ycombinator.com | 2025-07-18
  4. vanna

    🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

    Project mention: Beyond the Diff: How Deep Context Analysis Caught a Critical Bug in a 20K-Star Open Source Project | dev.to | 2025-10-20

    A developer submitted PR #951 to Vanna.ai, a popular open-source text-to-SQL tool with 20,000+ stars. The change added Databricks integration—156 lines of well-documented code supporting two connection engines (SQL warehouse and ODBC).

  5. sqlmodel

    SQL databases in Python, designed for simplicity, compatibility, and robustness.

    Project mention: Architecture Patterns with Python | news.ycombinator.com | 2025-03-28

    an SQLModel entity backed by a database table doesn't validate its fields on creation, which is the point of Pydantic.

    https://github.com/fastapi/sqlmodel/issues/52#issuecomment-1...

  6. SQLAlchemy

    The Database Toolkit for Python

    Project mention: How to Make Websites That Will Require Lots of Your Time and Energy | news.ycombinator.com | 2025-07-28

    at the very least, if you are really writing lots of INSERTs by hand I bet you are either not quoting properly or you are writing queries with 15 placeholders and someday you'll put one in the wrong place.

    ORMs and related toolkits have come a long way since they were called the "Vietnam of Computer Science". I am a big fan of JooQ in Java

    https://www.jooq.org/

    and SQLAlchemy in Python

    https://www.sqlalchemy.org/

    Note both of these support both an object <-> SQL mapper (usually with generated objects) that covers the case of my code sample above, and a DSL for SQL inside the host language which is delightful if you want to do code generation to make query builders and stuff like that. I work on a very complex search interface which builds out joins, subqueries, recursive CTEs, you name it, and the code is pretty easy to maintain.

  7. datasette

    An open source multi-tool for exploring and publishing data

    Project mention: The current state of LLM-driven development | news.ycombinator.com | 2025-08-10

    I've been using LLM-assistance for my larger open source projects - https://github.com/simonw/datasette https://github.com/simonw/llm and https://github.com/simonw/sqlite-utils - for a couple of years now.

    Also literally hundreds of smaller plugins and libraries and CLI tools, see https://github.com/simonw?tab=repositories (now at 880 repos) and https://pypi.org/user/simonw/ (340 published packages).

    Unlike my tools.simonwillison.net stuff the vast majority of those products are covered by automated tests and usually have comprehensive documentation too.

  8. q

    q - Run SQL directly on delimited files and multi-file sqlite databases (by harelba)

    Project mention: XAN: A Modern CSV-Centric Data Manipulation Toolkit for the Terminal | news.ycombinator.com | 2025-03-27

    I used to use q for this sort of thing. Not sure if there are better choices now as it have been a few years.

    https://harelba.github.io/q/

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. modin

    Modin: Scale your Pandas workflows by changing a single line of code

  11. sqlfluff

    A modular SQL linter and auto-formatter with support for multiple dialects and templated code.

  12. countries-states-cities-database

    🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, SQLSERVER, MONGODB, SQLITE, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities

  13. sqlglot

    Python SQL Parser and Transpiler

    Project mention: Text2SQL is dead – long live text2SQL | news.ycombinator.com | 2025-10-29

    Agreed, and it's an amazingly well-maintained GitHub repo: https://github.com/tobymao/sqlglot

    Big kudos to Toby and the team.

  14. Mage

    🧙 The modern replacement for Airflow. Mage is an open-source data pipeline tool for transforming and integrating data. https://github.com/mage-ai/mage-ai

    Project mention: The Data Awakens: My First Pipeline with Mage AI | dev.to | 2025-09-11

    That’s where Mage AI stood out. From the very first try to run it , it feels really easy and straight forward .

  15. ibis

    the portable Python dataframe library

    Project mention: Notes from PyData Berlin 2025 | dev.to | 2025-09-10

    https://github.com/ibis-project/ibis and

  16. Flask-AppBuilder

    Simple and rapid application development framework, built on top of Flask. includes detailed security, auto CRUD generation for your models, google charts and much more. Demo (login with guest/welcome) - http://flaskappbuilder.pythonanywhere.com/

  17. dataset

    Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.

  18. alembic

    A database migrations tool for SQLAlchemy.

  19. dataherald

    Interact with your SQL database, Natural Language to SQL using LLMs

  20. ethereum-etl

    Python scripts for ETL (extract, transform and load) jobs for Ethereum blocks, transactions, ERC20 / ERC721 tokens, transfers, receipts, logs, contracts, internal transactions. Data is available in Google BigQuery https://goo.gl/oY5BCQ

  21. pg_activity

    pg_activity is a top like application for PostgreSQL server activity monitoring.

    Project mention: Pg_activity: Htop for PostgreSQL | news.ycombinator.com | 2025-05-13
  22. django-sql-explorer

    SQL reporting that Just Works. Fast, simple, and confusion-free. Write and share queries in a delightful SQL editor, with AI assistance.

  23. PyPika

    PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.

  24. sqlmesh

    Scalable and efficient data transformation framework - backwards compatible with dbt.

  25. fugue

    A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python SQL discussion

Log in or Post with

Python SQL related posts

Index

What are some of the best open-source SQL projects in Python? This list will help you:

# Project Stars
1 devops-exercises 79,855
2 pandas-ai 22,534
3 vanna 21,588
4 sqlmodel 17,122
5 SQLAlchemy 11,108
6 datasette 10,519
7 q 10,331
8 modin 10,325
9 sqlfluff 9,299
10 countries-states-cities-database 8,938
11 sqlglot 8,568
12 Mage 8,517
13 ibis 6,211
14 Flask-AppBuilder 4,921
15 dataset 4,827
16 alembic 3,747
17 dataherald 3,574
18 ethereum-etl 3,103
19 pg_activity 2,929
20 django-sql-explorer 2,853
21 PyPika 2,750
22 sqlmesh 2,708
23 fugue 2,122

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?