Understanding LLMs & RAG for Enterprise AI

Presented by: Shekar Kaki

AI Engineer | Data Engineer | Python Developer | Application Architect
🌐 Portfolio | 💼 LinkedIn

1. What is an LLM?

A Large Language Model (LLM) is an advanced AI system trained to understand and generate human-like text.
It learns from massive public datasets such as Wikipedia, books, and websites.

Key capabilities:

Answering questions
Summarizing content
Translating languages
Engaging in human-like conversation

Examples: ChatGPT, Google Gemini, Claude, LLaMA

2. The Limitations of LLMs in Enterprises

While LLMs are powerful, they come with limitations in business environments:

No access to internal documents, company policies, or private databases
Cannot provide accurate, up-to-date answers about internal or proprietary information
Lack of context about your organization limits their usefulness in real-world applications

3. Why RAG is Needed

Retrieval-Augmented Generation (RAG) bridges the gap between LLMs and enterprise knowledge:

Retrieves relevant internal data in real-time
Augments the LLM's responses with accurate, business-specific context
Delivers precise and trustworthy answers based on your own content

Result: Smarter, enterprise-ready AI that truly understands your business

`rag_local_pdfs`

— Retrieval-Augmented Generation with Local PDFs

This project implements a Retrieval-Augmented Generation (RAG) system using local PDF documents as the knowledge base. Built with LangChain, ChromaDB, and a local LLM via Ollama, this setup enables question-answering over your own documents using efficient vector search and contextual responses from LLMs.

Features

PDF Loader: Extracts text from PDF files in your local data/ directory.
Text Chunking: Splits text into overlapping chunks using RecursiveCharacterTextSplitter to preserve context.
Vector Store (ChromaDB): Stores and indexes embeddings for fast similarity search.
Local LLM via Ollama: Generates contextual answers using lightweight models like Mistral, all on your local machine.
Testing Suite: Validate system accuracy using test queries.

Project Structure

rag_local_pdfs/
│
├── chroma/                    # chroma directory will be added when we run populate_database.py
├── data/                      # Local PDFs for ingestion
├── media/                     # Concept images, documents
├── get_embedding_function.py  # Defines embedding logic
├── populate_database.py       # Loads, chunks, embeds, and stores PDFs
├── query_data.py              # Queries ChromaDB and returns LLM response
├── requirements.txt           # Python dependencies
└── README.md

Setup Instructions

1. Clone the repository

cd rag_local_pdfs

2. Create virtual environment & install dependencies

python -m venv venv
source venv/bin/activate   # on Windows use venv\Scripts\activate
pip install -r requirements.txt

3. Add PDFs to data/ folder

Put your PDF files in the data/ directory for ingestion.

4. Install and run Ollama server. For running LLM Locally.

Install Ollama from https://ollama.com, then run: Below step will download model

ollama run llama 3.2

Make sure Ollama is running in the background. By checking http://127.0.0.1:11434/ you should see "Ollama is running" in the page

5. Populate the vector database

python populate_database.py

6. Query your documents

python query_data.py "What is the summary of the document?"

Successful setup will give you response from the pdf you added by giving summary

** Local LLM Support**

Uses Ollama for running models like:
-llama3.2
-gemma

Change model name in query_data.py if you prefer a different LLM.

📌 Future Enhancements

-GUI or Streamlit interface
-Multi-file PDF summarization
-Real-time chat over documents
-RAG + Agent framework (e.g., LangGraph or CrewAI)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Understanding LLMs & RAG for Enterprise AI

Presented by: Shekar Kaki

1. What is an LLM?

2. The Limitations of LLMs in Enterprises

3. Why RAG is Needed

`rag_local_pdfs`

— Retrieval-Augmented Generation with Local PDFs

Features

Project Structure

Setup Instructions

1. Clone the repository

2. Create virtual environment & install dependencies

3. Add PDFs to data/ folder

4. Install and run Ollama server. For running LLM Locally.

5. Populate the vector database

6. Query your documents

Successful setup will give you response from the pdf you added by giving summary

📌 Future Enhancements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
media		media
README.md		README.md
get_embedding_function.py		get_embedding_function.py
populate_database.py		populate_database.py
query_data.py		query_data.py
requirements.txt		requirements.txt

shekar369/rag_local_pdfs

Folders and files

Latest commit

History

Repository files navigation

Understanding LLMs & RAG for Enterprise AI

Presented by: Shekar Kaki

1. What is an LLM?

2. The Limitations of LLMs in Enterprises

3. Why RAG is Needed

rag_local_pdfs

— Retrieval-Augmented Generation with Local PDFs

Features

Project Structure

Setup Instructions

1. Clone the repository

2. Create virtual environment & install dependencies

3. Add PDFs to data/ folder

4. Install and run Ollama server. For running LLM Locally.

5. Populate the vector database

6. Query your documents

Successful setup will give you response from the pdf you added by giving summary

📌 Future Enhancements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`rag_local_pdfs`

Packages