document-retrieval

Here are 52 public repositories matching this topic...

OpenBMB / VisRAG

Parsing-free RAG supported by VLMs

retrieval multi-modal document-retrieval rag multi-modality document-understanding vision-language-model retrieval-augmented-generation

Updated Oct 22, 2025
Python

redis-developer / redis-arXiv-search

Star

Vector search demo with the arXiv paper dataset, RedisVL, HuggingFace, OpenAI, Cohere, FastAPI, React, and Redis.

react nlp redis machine-learning openai arxiv document-retrieval cohere document-search vector-search huggingface vector-database arxiv-papers

Updated Apr 15, 2025
Python

vTuanpham / Vietnamese_QA_System

Star

Vietnamese long form question answering system with documents retrieval.

nlp qa vietnamese instructions question-answering document-retrieval vietnamese-nlp dpr sentence-similarity sentence-embeddings sentence-transformers lfqa instruction-tune

Updated Mar 28, 2024
Python

grafana / vectorapi

Star

pgvector + embeddings API

embeddings document-retrieval llms pgvector

Updated Dec 14, 2023
Python

[VLSP 2025] ViDRILL is a Vietnamese document retrieval system for VLSP 2025. It combines dense and sparse retrieval, reranking, and optional LLM-based query rewriting and reasoning to support high-accuracy information retrieval and future LLM-enhanced pipelines.

information-retrieval reinforcement-learning query-rewriting document-retrieval vietnamese-nlp reranking vlsp-2025

Updated Aug 16, 2025
Python

HennyJie / GNN-DocRetrieval

Star

Implementation of ECIR 2022 Paper: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generation

graph-mining document-retrieval concept-graphs graph-neural-networks

Updated Sep 19, 2022
Python

manan-paneri-99 / Vector-Space-based-Document-Retrieval-system

Star

Retrieves the top 10 documents from the Wikipedia corpus for a user inputted free-text query

information-retrieval vector-space-model document-retrieval

Updated Nov 24, 2020
Python

Syed007Hassan / Document-Querying-With-VectorDB

Star

Document Querying with LLMs - Google PaLM API: Semantic Search With LLM Embeddings

embeddings chroma document-retrieval vectordb pdf-encoding palm-api

Updated Dec 14, 2023
Python

DebanjanSarkar / askdoc

Star

The Intelligent "ASKDOC" project combines the power of Langchain, Azure, OpenAI models, and Python to deliver an intelligent question-answering system, that scans your PDF documents and answer queries based on its contents. It can be queried using Human Natural Language.

natural-language-processing chatbot python3 artificial-intelligence document-retrieval natural-language-understanding faiss azure-openai langchain azure-openai-api langchain-python pdf-document-query

Updated Feb 4, 2024
Python

boudinfl / redefining-absent-keyphrases

Star

Code and dataset for the paper "Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness"

information-retrieval digital-library document-retrieval keyphrase-generation absent-keyphrases retrieval-effectiveness

Updated Sep 13, 2021
Python

SubhangiSati / LangChat-Explorer

Star

"LangChat Explorer: Your intuitive document companion. Effortlessly explore vast information with natural language conversations. Simplify queries, gain insights, and embark on a seamless journey of knowledge discovery. Unleash the power of language with LangChat Explorer."

api machine-learning deep-learning python3 document-retrieval pdf-document-processor q-and-a-bot llms generative-ai

Updated Feb 10, 2024
Python

PRITHIVSAKTHIUR / Multimodal-OCR2

Star

A comprehensive multimodal OCR application that supports both image and video document processing using state-of-the-art vision-language models. This application provides an intuitive Gradio interface for extracting text, converting documents to markdown, and performing advanced document analysis.

pillow image-analysis gradio video-understanding document-retrieval ocr-recognition huggingface-transformers vision-transformer qwen2-5-vl smoldocling

Updated Oct 16, 2025
Python

YesNLP / text-summ-for-doc-retrieval

Star

Neural text summarization for document retrieval

text-summarization document-retrieval precision-medicine bert-model

Updated Sep 13, 2022
Python

ahmadvh / Context-based-document-search

Star

A Python-based tool for context-based search across text documents using OpenAI embeddings and Chroma vector storage. This system enables efficient querying of document collections by generating vector embeddings, storing them persistently, and retrieving relevant results based on textual queries.

python nlp machine-learning embeddings openai document-retrieval contextual-search vector-database langchain chromadb

Updated Oct 11, 2024
Python

PRITHIVSAKTHIUR / Doc-VLMs-v2-Localization

Star

Doc-VLMs-v2-Localization is a demo app for the Camel-Doc-OCR-062825 model, fine-tuned from Qwen2.5-VL-7B-Instruct for advanced document retrieval, extraction, and analysis. It enhances document understanding and also integrates other notable Hugging Face models.

ocr table gradio document-retrieval ocr-recognition vision-language 7b huggingface-transformers vision-transformer qwen2-5-vl

Updated Jul 13, 2025
Python

MohammedNasserAhmed / CodeXpert

Star

CodeXpert: A cutting-edge AI-powered code analysis tool leveraging CodeLlama, FAISS, and HuggingFace for efficient code understanding, explanation, and optimization. 🚀✨