Thanks to visit codestin.com
Credit goes to www.libhunt.com

Python vector-database

Open-source Python projects categorized as vector-database

Top 23 Python vector-database Projects

vector-database
  1. llama_index

    LlamaIndex is the leading framework for building LLM-powered agents over your data.

    Project mention: How to Build a RAG Solution with Llama Index, ChromaDB, and Ollama | dev.to | 2025-11-04

    Step 2: Set up LlamaIndex and Chroma DB

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. mem0

    Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

    Project mention: Write an Agent | news.ycombinator.com | 2025-11-06
  4. txtai

    💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

    Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26

    GitHub: https://github.com/neuml/txtai

  5. memvid

    Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

    Project mention: Friday Links #30 — JavaScript Updates, Tools, and Inspiration | dev.to | 2025-10-17

    memvid - Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

  6. deeplake

    Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

    Project mention: What I Learned Comparing Zilliz Cloud and Deep Lake for Scalable Vector Search | dev.to | 2025-06-09

    As I scaled up a semantic search engine for multi-modal content, I found myself at a fork in the road. Should I lean into a purpose-built vector database like Zilliz Cloud, or embrace a more flexible data lake approach with Deep Lake? These tools promise vector search at scale—but they come from fundamentally different architectural philosophies.

  7. cognee

    Memory for AI Agents in 6 lines of code

    Project mention: The AI-Native GraphDB + GraphRAG + Graph Memory Landscape & Market Catalog | dev.to | 2025-10-26

    URLs: https://github.com/topoteretes/cognee (hosted at cognee.ai / Cogwit)

  8. deep-searcher

    Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

    Project mention: Deep Searcher, Open source deep researcher on your private data | news.ycombinator.com | 2025-02-21

    github https://github.com/zilliztech/deep-searcher

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. airweave

    Context retrieval for AI agents across apps and databases

    Project mention: Launch HN: Airweave (YC X25) – Let agents search any app | news.ycombinator.com | 2025-09-30
  11. LEANN

    RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

    Project mention: First lightweight local semantic search MCP for Claude Code | news.ycombinator.com | 2025-08-15

    @Berkeley SkyLab, we’re the first to bring semantic search to Claude Code with a fully local index in a novel, lightweight structure — check it out at LEANN(https://github.com/yichuan-w/LEANN).

  12. raptor

    The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

    Project mention: Graph RAG의 모든 것 | dev.to | 2025-04-20

    3.2. RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval (Stanford Univ, 2024)

  13. pymilvus

    Python SDK for Milvus Vector Database

  14. pixeltable

    Pixeltable — Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

    Project mention: Stop Gluing Data Infrastructure Tools: Build Multimodal AI Workloads and Application with One Declarative Python SDK | dev.to | 2025-07-06

    Star us on GitHub: https://github.com/pixeltable/pixeltable

  15. SeaGOAT

    local-first semantic code search engine

  16. qdrant-client

    Python client for Qdrant vector search engine

  17. NeumAI

    Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

  18. rag-demystified

    An LLM-powered advanced RAG pipeline built from scratch

  19. mcp-memory-service

    Universal MCP memory service with semantic search, multi-client support, and autonomous consolidation for Claude Desktop, VS Code, and 13+ AI applications

    Project mention: Supercharging Productivity with Cursor AI: A React Developer's Guide to MCP Servers and JSON Prompts | dev.to | 2025-04-17

    Key Takeaway: cursor10x-mcp and Repomix excel for speed and context. MCP Memory Service is great for quick wins, and Pieces organizes prompts. But tools alone don’t cut it—prompts are the real magic.

  20. llmflows

    LLMFlows - Simple, Explicit and Transparent LLM Apps

  21. vectordb

    A Python vector database you just need - no more, no less. (by jina-ai)

  22. langchain-chatbot

    AI Chatbot for analyzing/extracting information from data in conversational format.

  23. GradCache

    Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

  24. redis-vl-python

    Redis Vector Library (RedisVL) -- the AI-native Python client for Redis.

  25. vector-db-benchmark

    Framework for benchmarking vector search engines

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python vector-database discussion

Log in or Post with

Python vector-database related posts

  • Search Types in Cognee

    1 project | dev.to | 20 Oct 2025
  • Cognee: Building the Next Generation of Memory for AI Agents (OSS)

    1 project | dev.to | 17 Oct 2025
  • Launch HN: Airweave (YC X25) – Let agents search any app

    1 project | news.ycombinator.com | 30 Sep 2025
  • Show HN: Vectorless RAG

    6 projects | news.ycombinator.com | 27 Aug 2025
  • Show HN: Airweave – Let agents search any app

    1 project | news.ycombinator.com | 12 May 2025
  • Ingest (almost) any non-PDF document in a vector database, effortlessly

    4 projects | dev.to | 25 Apr 2025
  • 13 GitHub Projects that Supercharge Your AI and Development Journey 🚀

    11 projects | dev.to | 3 Mar 2025
  • A note from our sponsor - Stream
    getstream.io | 15 Nov 2025
    Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →

Index

What are some of the best open-source vector-database projects in Python? This list will help you:

# Project Stars
1 llama_index 45,183
2 mem0 42,849
3 txtai 11,819
4 memvid 10,372
5 deeplake 8,894
6 cognee 8,223
7 deep-searcher 7,138
8 airweave 5,116
9 LEANN 4,367
10 raptor 1,374
11 pymilvus 1,296
12 pixeltable 1,229
13 SeaGOAT 1,218
14 qdrant-client 1,132
15 NeumAI 861
16 rag-demystified 854
17 mcp-memory-service 852
18 llmflows 704
19 vectordb 631
20 langchain-chatbot 433
21 GradCache 409
22 redis-vl-python 341
23 vector-db-benchmark 340

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?