document-search

Star

Here are 37 public repositories matching this topic...

deepsense-ai / ragbits

Star

Building blocks for rapid development of GenAI applications

optimization evaluation agents prompts document-search rag guardrails llms vector-stores

Updated Oct 25, 2025
Python

neuml / paperai

Star

📄 🤖 AI for medical and scientific papers

python search nlp machine-learning ai artificial-intelligence medical scientific-papers document-search txtai

Updated Jul 9, 2025
Python

redis-developer / redis-arXiv-search

Star

Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.

react nlp redis machine-learning openai arxiv document-retrieval cohere document-search vector-search huggingface vector-database arxiv-papers

Updated Apr 15, 2025
Python

robindekoster / chatgpt-custom-knowledge-chatbot

Star

This open source chatbot project lets you create a chatbot that uses your own data to answer questions, thanks to the power of the OpenAI GPT-3.5 model.

python machine-learning ai chatbot python3 openai gpt knowledge-base document-search contextual-chatbot chatgpt chatgpt-api openai-chatgpt llama-index

Updated Jul 13, 2023
Python

capjamesg / jamesql

Sponsor

Star

An in-memory NoSQL database implemented in Python.

python nosql nosql-database web-search document-search

Updated Feb 10, 2025
Python

kcubeterm / achoz

Star

Search through all your personal data efficiently like web search.

search-engine crawler websearch filesearch document-search

Updated Jan 31, 2023
Python

neuml / cord19q

Star

COVID-19 Open Research Dataset (CORD-19) Analysis

python search nlp machine-learning medical scientific-papers document-search covid-19

Updated Nov 20, 2022
Python

PostgreSQL-native semantic search engine with multi-modal capabilities. Add AI-powered search to your existing database without separate vector databases, vendor fees, or complex setup. Features text + image search using CLIP embeddings, native SQL joins, and 10-minute Docker deployment.

Updated Jul 4, 2025
Python

aimaster-dev / chatbot-using-rag-and-langchain

Star

Chat with your PDFs using AI! This Streamlit app uses RAG, LangChain, FAISS, and OpenAI to let you ask questions and get answers with page and file references.

Updated May 29, 2025
Python

co-dev0909 / chatbot-using-rag-and-langchain

Star

Chat with your PDFs using AI! This Streamlit app uses RAG, LangChain, FAISS, and OpenAI to let you ask questions and get answers with page and file references.

Updated Jul 14, 2025
Python

lethalbit / bookwurm

Sponsor

Star

dead simple document index and search, nothing fancy

document-search document-indexing

Updated Mar 28, 2024
Python

aimaster-dev / SmartRAG

Star

SmartRAG is a terminal-based RAG system using LangGraph. It processes queries by retrieving relevant content from markdown or PDFs, then responds using OpenAI GPT. Supports webpage-to-PDF conversion, vector DB search, and modular flow control.

Updated Jun 17, 2025
Python

GoodGuyAdy / QueryBaseAI

Star

AI-powered hybrid search engine combining keyword, vector, and LLM-based contextual search using RAG with support for AI21, OpenAI or any other LLM.

elasticsearch django ai django-rest-framework openai document-search rag vector-search milvus llm

Updated May 3, 2025
Python

Qyokizzzz / simhash

Star

The extended version of simhash supports fingerprint extraction of documents and images.

fingerprint simhash image-search image-deduplication document-search

Updated Aug 22, 2022
Python

tomlin7 / AI-research-assistant

Star

Semantic document search system with pgvector and PGAI

postgres machine-learning natural-language-processing ai sentiment-analysis text-similarity postgresql assistant text-summarization summarization semantic-search sentence-embeddings document-search research-assistant sentence-transformers pgvector ollama pgai

Updated Nov 9, 2024
Python

jbmiller10 / semantik

Star

Semantik is a self-hosted semantic search engine for your documents.

search docker ai self-hosted embeddings nas document-search rag vector-search

Updated Oct 22, 2025
Python

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

Star

Given a set of PDFs and the query, the most relevant pdf can be found with the help of TF-IDF. The code has not used any library to implement TF-IDF

python glob pdf-converter python3 tf-idf querying pdfminer document-search pdf-search

Updated Oct 15, 2019
Python

kunjankanani / Document_Query_Search

Star

Retrieval-Augmented Generation, or RAG, is an innovative approach that enhances the capabilities of pre-trained large language models (LLMs) by integrating them with external data sources. This technique leverages the generative power of LLMs (Large Language Model), and combines it with the precision of specialized data search mechanisms.

document-search rag llm retrieval-augmented-generation document-query-search

Updated Jul 16, 2024
Python

salameaz / pdf-process-rag

Star

A Python-based application that extracts and processes PDF content using a Retrieval-Augmented Generation (RAG) approach. Leverage vector embeddings to enable efficient querying of both text-based and scanned PDFs, and interact with your documents using a large language model.

python nlp machine-learning document-search rag pdf-processing streamlit vector-embeddings retrieval-augmented-generation

Updated Jul 11, 2025
Python

Jivl00 / KIV_IR

Star

Semestrální práce z předmětu Information Retrieval

information-retrieval web-crawler inverted-index tf-idf stemming lemmatization vector-model czech-language boolean-model document-search pyqt5-desktop-application

Updated May 29, 2025
Python

Improve this page

Add a description, image, and links to the document-search topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-search topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-search

Here are 37 public repositories matching this topic...

deepsense-ai / ragbits

neuml / paperai

redis-developer / redis-arXiv-search

robindekoster / chatgpt-custom-knowledge-chatbot

capjamesg / jamesql

kcubeterm / achoz

neuml / cord19q

laxmanclo / pany

aimaster-dev / chatbot-using-rag-and-langchain

co-dev0909 / chatbot-using-rag-and-langchain

lethalbit / bookwurm

aimaster-dev / SmartRAG

GoodGuyAdy / QueryBaseAI

Qyokizzzz / simhash

tomlin7 / AI-research-assistant

jbmiller10 / semantik

shreyansh-kothari / PDF-Querying-using-TF-IDF-from-Scratch

kunjankanani / Document_Query_Search

salameaz / pdf-process-rag

Jivl00 / KIV_IR

Improve this page

Add this topic to your repo