🚀 SmartRAG

SmartRAG is a terminal-based Retrieval-Augmented Generation (RAG) system built using LangGraph. It routes user queries through a custom flow that includes message history, query transformation, and document retrieval from a vector store.

🔗 GitHub: https://github.com/aimaster-dev/SmartRAG

🧠 Features

LangGraph-powered RAG pipeline
Smart routing of user queries
PDF and Markdown ingestion support
Optional webpage-to-PDF and PDF-to-Markdown conversion
OpenAI GPT integration for natural language responses

🗂️ Project Structure

SmartRAG/
├── architecture/         # LangGraph RAG workflow logic
├── data/                 # Processed markdown or PDF content
├── modules/              # Core logic for query handling & doc processing
├── main.py               # Entry point
└── processDocs.py        # Document preprocessing script

⚙️ Setup

Follow the steps below to get SmartRAG up and running:

1. Clone the Repo

git clone https://github.com/aimaster-dev/SmartRAG.git
cd SmartRAG

2. Create Virtual Environment

python3.12 -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

3. Install Dependencies

pip install -r requirements.txt
choco install wkhtmltopdf  # for HTML to PDF conversion (Windows only)

4. Environment Setup

Copy and edit the .env file:

cp .env.example .env

Edit .env to include:

OPENAI_API_KEY=your_openai_key
URLS=url1,url2         # Optional: URLs to fetch as PDF
GET_WEB_PAGES_TO_PDF=True
CONVERT_PDF_TO_MD=True
INTERMEDIATE_PDF_DIR=./pdfs
DATA_DIR=./data

5. Process Your Documents

python modules/processDocs.py

⚠️ Make sure to update .env parameters based on your use case.

6. Run SmartRAG

python main.py

🔍 How It Works

User query is passed into a LangGraph workflow.
Message history is cached and contextually enriched.
If needed, input is transformed for better retrieval.
Documents are pulled from a vector store using similarity search.
GPT model generates a context-aware answer.

🖼️ Architecture Overview

📄 Vector Store Creation

🧠 RAG Pipeline

🤝 Contributing

We welcome contributions!

Fork the repo
Create a feature branch
Submit a pull request

Got a big idea? Open an issue to discuss it first.

📬 Contact

For questions, feedback, or collaboration ideas — feel free to open an issue or reach out through GitHub!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 SmartRAG

🧠 Features

🗂️ Project Structure

⚙️ Setup

1. Clone the Repo

2. Create Virtual Environment

3. Install Dependencies

4. Environment Setup

5. Process Your Documents

6. Run SmartRAG

🔍 How It Works

🖼️ Architecture Overview

📄 Vector Store Creation

🧠 RAG Pipeline

🤝 Contributing

📬 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
architecture		architecture
data		data
modules		modules
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

License

aimaster-dev/SmartRAG

Folders and files

Latest commit

History

Repository files navigation

🚀 SmartRAG

🧠 Features

🗂️ Project Structure

⚙️ Setup

1. Clone the Repo

2. Create Virtual Environment

3. Install Dependencies

4. Environment Setup

5. Process Your Documents

6. Run SmartRAG

🔍 How It Works

🖼️ Architecture Overview

📄 Vector Store Creation

🧠 RAG Pipeline

🤝 Contributing

📬 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages