InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 Python vector-database Projects
-
Project mention: How to Build a RAG Solution with Llama Index, ChromaDB, and Ollama | dev.to | 2025-11-04
Step 2: Set up LlamaIndex and Chroma DB
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
mem0
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
-
txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26GitHub: https://github.com/neuml/txtai
-
memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Project mention: Friday Links #30 — JavaScript Updates, Tools, and Inspiration | dev.to | 2025-10-17memvid - Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
-
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Project mention: What I Learned Comparing Zilliz Cloud and Deep Lake for Scalable Vector Search | dev.to | 2025-06-09As I scaled up a semantic search engine for multi-modal content, I found myself at a fork in the road. Should I lean into a purpose-built vector database like Zilliz Cloud, or embrace a more flexible data lake approach with Deep Lake? These tools promise vector search at scale—but they come from fundamentally different architectural philosophies.
-
Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26
URLs: https://github.com/topoteretes/cognee (hosted at cognee.ai / Cogwit)
-
deep-searcher
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Project mention: Deep Searcher, Open source deep researcher on your private data | news.ycombinator.com | 2025-02-21github https://github.com/zilliztech/deep-searcher
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
Project mention: Launch HN: Airweave (YC X25) – Let agents search any app | news.ycombinator.com | 2025-09-30
-
LEANN
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Project mention: First lightweight local semantic search MCP for Claude Code | news.ycombinator.com | 2025-08-15@Berkeley SkyLab, we’re the first to bring semantic search to Claude Code with a fully local index in a novel, lightweight structure — check it out at LEANN(https://github.com/yichuan-w/LEANN).
-
raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
3.2. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval (Stanford Univ, 2024)
-
-
pixeltable
Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
Project mention: Stop Gluing Data Infrastructure Tools: Build Multimodal AI Workloads and Application with One Declarative Python SDK | dev.to | 2025-07-06Star us on GitHub: https://github.com/pixeltable/pixeltable
-
-
-
NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
-
-
mcp-memory-service
Universal MCP memory service with semantic search, multi-client support, and autonomous consolidation for Claude Desktop, VS Code, and 13+ AI applications
Project mention: Supercharging Productivity with Cursor AI: A React Developer's Guide to MCP Servers and JSON Prompts | dev.to | 2025-04-17Key Takeaway: cursor10x-mcp and Repomix excel for speed and context. MCP Memory Service is great for quick wins, and Pieces organizes prompts. But tools alone don’t cut it—prompts are the real magic.
-
-
-
langchain-chatbot
AI Chatbot for analyzing/extracting information from data in conversational format.
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python vector-database discussion
Python vector-database related posts
-
Search Types in Cognee
-
Cognee: Building the Next Generation of Memory for AI Agents (OSS)
-
Launch HN: Airweave (YC X25) – Let agents search any app
-
Show HN: Vectorless RAG
-
Show HN: Airweave – Let agents search any app
-
Ingest (almost) any non-PDF document in a vector database, effortlessly
-
13 GitHub Projects that Supercharge Your AI and Development Journey 🚀
-
A note from our sponsor - InfluxDB
www.influxdata.com | 15 Nov 2025
Index
What are some of the best open-source vector-database projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | llama_index | 45,183 |
| 2 | mem0 | 42,849 |
| 3 | txtai | 11,819 |
| 4 | memvid | 10,372 |
| 5 | deeplake | 8,894 |
| 6 | cognee | 8,223 |
| 7 | deep-searcher | 7,138 |
| 8 | airweave | 5,116 |
| 9 | LEANN | 4,367 |
| 10 | raptor | 1,374 |
| 11 | pymilvus | 1,296 |
| 12 | pixeltable | 1,229 |
| 13 | SeaGOAT | 1,218 |
| 14 | qdrant-client | 1,132 |
| 15 | NeumAI | 861 |
| 16 | rag-demystified | 854 |
| 17 | mcp-memory-service | 852 |
| 18 | llmflows | 704 |
| 19 | vectordb | 631 |
| 20 | langchain-chatbot | 433 |
| 21 | GradCache | 409 |
| 22 | redis-vl-python | 341 |
| 23 | vector-db-benchmark | 340 |