Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →
Top 23 Python vector-database Projects
-
Project mention: How to Build a RAG Solution with Llama Index, ChromaDB, and Ollama | dev.to | 2025-11-04
Step 2: Set up LlamaIndex and Chroma DB
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
mem0
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
-
txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26GitHub: https://github.com/neuml/txtai
-
memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Project mention: Friday Links #30 — JavaScript Updates, Tools, and Inspiration | dev.to | 2025-10-17memvid - Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
-
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Project mention: What I Learned Comparing Zilliz Cloud and Deep Lake for Scalable Vector Search | dev.to | 2025-06-09As I scaled up a semantic search engine for multi-modal content, I found myself at a fork in the road. Should I lean into a purpose-built vector database like Zilliz Cloud, or embrace a more flexible data lake approach with Deep Lake? These tools promise vector search at scale—but they come from fundamentally different architectural philosophies.
-
Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26
URLs: https://github.com/topoteretes/cognee (hosted at cognee.ai / Cogwit)
-
deep-searcher
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Project mention: Deep Searcher, Open source deep researcher on your private data | news.ycombinator.com | 2025-02-21github https://github.com/zilliztech/deep-searcher
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
Project mention: Launch HN: Airweave (YC X25) – Let agents search any app | news.ycombinator.com | 2025-09-30
-
LEANN
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Project mention: First lightweight local semantic search MCP for Claude Code | news.ycombinator.com | 2025-08-15@Berkeley SkyLab, we’re the first to bring semantic search to Claude Code with a fully local index in a novel, lightweight structure — check it out at LEANN(https://github.com/yichuan-w/LEANN).
-
raptor
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
3.2. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval (Stanford Univ, 2024)
-
-
pixeltable
Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
Project mention: Stop Gluing Data Infrastructure Tools: Build Multimodal AI Workloads and Application with One Declarative Python SDK | dev.to | 2025-07-06Star us on GitHub: https://github.com/pixeltable/pixeltable
-
-
-
NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
-
-
mcp-memory-service
Universal MCP memory service with semantic search, multi-client support, and autonomous consolidation for Claude Desktop, VS Code, and 13+ AI applications
Project mention: Supercharging Productivity with Cursor AI: A React Developer's Guide to MCP Servers and JSON Prompts | dev.to | 2025-04-17Key Takeaway: cursor10x-mcp and Repomix excel for speed and context. MCP Memory Service is great for quick wins, and Pieces organizes prompts. But tools alone don’t cut it—prompts are the real magic.
-
-
-
langchain-chatbot
AI Chatbot for analyzing/extracting information from data in conversational format.
-
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python vector-database discussion
Python vector-database related posts
-
Search Types in Cognee
-
Cognee: Building the Next Generation of Memory for AI Agents (OSS)
-
Launch HN: Airweave (YC X25) – Let agents search any app
-
Show HN: Vectorless RAG
-
Show HN: Airweave – Let agents search any app
-
Ingest (almost) any non-PDF document in a vector database, effortlessly
-
13 GitHub Projects that Supercharge Your AI and Development Journey 🚀
-
A note from our sponsor - Stream
getstream.io | 16 Nov 2025
Index
What are some of the best open-source vector-database projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | llama_index | 45,183 |
| 2 | mem0 | 42,849 |
| 3 | txtai | 11,819 |
| 4 | memvid | 10,372 |
| 5 | deeplake | 8,894 |
| 6 | cognee | 8,614 |
| 7 | deep-searcher | 7,138 |
| 8 | airweave | 5,116 |
| 9 | LEANN | 4,367 |
| 10 | raptor | 1,374 |
| 11 | pymilvus | 1,296 |
| 12 | pixeltable | 1,229 |
| 13 | SeaGOAT | 1,218 |
| 14 | qdrant-client | 1,139 |
| 15 | NeumAI | 861 |
| 16 | rag-demystified | 854 |
| 17 | mcp-memory-service | 852 |
| 18 | llmflows | 704 |
| 19 | vectordb | 631 |
| 20 | langchain-chatbot | 433 |
| 21 | GradCache | 409 |
| 22 | redis-vl-python | 341 |
| 23 | vector-db-benchmark | 340 |