0% found this document useful (0 votes)

7 views12 pages

Understanding Retrieval-Augmented Generation (RAG)

Retrieval-Augmented Generation (RAG) is a hybrid AI framework that combines retrieval of relevant information and generation of coherent answers, enhancing traditional AI models by grounding responses in specific documents. It involves data preparation, query processing, retrieval, generation, and delivery of answers, making it applicable in various fields such as customer support and research. While RAG offers accurate and scalable answers, its effectiveness depends on the quality of the knowledge base and can incur computational costs.

Uploaded by

Rohit Srivastava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views12 pages

Understanding Retrieval-Augmented Generation (RAG)

Uploaded by

Rohit Srivastava

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Understanding

Retrieval-Augmented
Generation (RAG)
What is RAG? A Real-Life
Analogy

► Imagine a super-smart librarian in a vast library.

► Someone asks: “What are the benefits of a
Mediterranean diet?”
► Librarian searches the catalog, pulls relevant books,
and summarizes key points.
► RAG does this digitally: searches data and
generates
answers.
What is Retrieval-Augmented Generation
(RAG)?

► A hybrid AI framework combining:

► Retrieval: Finds relevant info from a dataset.
► Generation: Crafts coherent answers using AI.
► Used in chatbots, search tools, and research
assistants.
► Ensures answers are accurate and up-to-date.
Why Do We Need RAG?

► Traditional AI models have limitations:

► Limited Knowledge: Fixed at training time, can be
outdated.
► Hallucination: May generate incorrect info.
► Lack of Specificity: Struggles with domain-specific
data.
► RAG solves this by grounding answers in specific
documents.
How Does RAG Work?

1. Data Preparation: Build a knowledge base

(documents, PDFs, etc.).
2. Query Processing: User asks a question (e.g., “How
many vacation days?”).
3. Retrieval: Find relevant document chunks.
4. Generation: AI crafts a coherent answer.
5. Delivery: Return answer with source references.
Step 1: Data Preparation

► Collect Documents: E.g., employee handbook,

research papers.
► Chunking: Split into small pieces (100500 words).
► Embeddings: Convert chunks to numerical vectors
using models like BERT.
► Indexing: Store embeddings in a vector database
(e.g., FAISS, Pinecone).
Step 23: Query Processing and
Retrieval

► Query Embedding: Convert user question to a vector.

► Similarity Search: Compare query to document
embeddings using cosine similarity.
► Retrieve Top-K Chunks: E.g., top 5 most
relevant document pieces.
► Example: Query “How many vacation days?” retrieves
“20 days per year” chunk.
Step 45: Generation and Delivery

► Augmentation: Combine query and retrieved chunks

into a prompt.
► Generation: Use a language model (e.g., GPT,
LLaMA) to create an answer.
► Example Answer: “Employees get 20 vacation days per
year.”
► Delivery: Return answer with source (e.g.,
“Employee Handbook, Section 4.2”).
Benefits and
Limitations

Benefits: Limitations:
► Accurate, ► Depends on quality
fact-based of knowledge
answers. base.
► Flexible for any domain. ► Retrieval errors can
► Scalable with large affect answers.
datasets. ► Computational cost for
► Transparent with large datasets.
source references.
Real-World Applications

► Customer Support: Chatbots answering from

manuals or FAQs.
► Enterprise Search: Querying internal policies or
specs.
► Research: Summarizing papers with citations.
► Legal/Healthcare: Retrieving case law or medical
literature.
References

► Lewis, P., et al. (2020). “Retrieval-Augmented Generation

for Knowledge-Intensive NLP Tasks.”
https://arxiv.org/abs/2005.11401
► Gao, Y. (2023). “What is Retrieval-Augmented
Generation (RAG)?” Towards Data Science.
https://towardsdatascience.com/
what-is-retrieval-augmented-generation-rag
► Pinecone (2024). “RAG: The Definitive Guide to
Retrieval-Augmented Generation.”
https://www.pinecone.
io/learn/retrieval-augmented-generation/
Questions
?

Thank you! Any

questions?

Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
No ratings yet
Transcript For Explaining Retrieval-Augmented Generation (RAG) To Colleagues
6 pages
Chapters
No ratings yet
Chapters
7 pages
Generative AI
No ratings yet
Generative AI
25 pages
RAG Retrieval-Augmented Generation
No ratings yet
RAG Retrieval-Augmented Generation
12 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Retrieval Augmented Generation (RAG) For Everyone
No ratings yet
Retrieval Augmented Generation (RAG) For Everyone
57 pages
RAG - Genai
No ratings yet
RAG - Genai
11 pages
RAG Detailed Overview
No ratings yet
RAG Detailed Overview
3 pages
RAG vs GPT: A Comprehensive Guide
No ratings yet
RAG vs GPT: A Comprehensive Guide
8 pages
Retrieval-Augmented Generation (RAG) - A Comprehens
No ratings yet
Retrieval-Augmented Generation (RAG) - A Comprehens
8 pages
What Is Retrieval-Augmented Generation (RAG)
No ratings yet
What Is Retrieval-Augmented Generation (RAG)
12 pages
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
No ratings yet
Privacy First RAG Closed-Loop LLMs For Industrial Data Security
12 pages
A Powerful Technique For Improved Text Generation and Efficiency
No ratings yet
A Powerful Technique For Improved Text Generation and Efficiency
14 pages
RAG Understanding PDF
No ratings yet
RAG Understanding PDF
12 pages
RAG in 10 Minutes
No ratings yet
RAG in 10 Minutes
1 page
RAG and Vector Database Guide
No ratings yet
RAG and Vector Database Guide
18 pages
Learning: Gen Ai
No ratings yet
Learning: Gen Ai
6 pages
Building Blocks of Rag Ebook Final
100% (2)
Building Blocks of Rag Ebook Final
9 pages
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
No ratings yet
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
8 pages
Generative AI PPT Final
No ratings yet
Generative AI PPT Final
34 pages
???: ??? ??? ?? ??????? ?? ?????????!
No ratings yet
???: ??? ??? ?? ??????? ?? ?????????!
6 pages
Practical RAG
No ratings yet
Practical RAG
127 pages
RAG for NLP Experts
No ratings yet
RAG for NLP Experts
2 pages
What Is RAG.
No ratings yet
What Is RAG.
2 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
A Survey On Rag Meeting LLMS: Towards Retrieval-Augmented Large Language Models
No ratings yet
A Survey On Rag Meeting LLMS: Towards Retrieval-Augmented Large Language Models
18 pages
2024-05-EB-A Compact GuideTo RAG
No ratings yet
2024-05-EB-A Compact GuideTo RAG
38 pages
RAG Part 1
No ratings yet
RAG Part 1
1 page
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
No ratings yet
Module 4 - RAG (Retrieval Augmented Generation) - PEC GenAI Course
23 pages
Minor Proj
No ratings yet
Minor Proj
15 pages
A Survey On Rag Meeting LLM
No ratings yet
A Survey On Rag Meeting LLM
18 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
5 pages
Title
No ratings yet
Title
2 pages
RAG in NLP
No ratings yet
RAG in NLP
1 page
12 Essential RAG Types 1735544647
No ratings yet
12 Essential RAG Types 1735544647
29 pages
Retrieval Augmented Generation (Rag) For Precision Language Models
No ratings yet
Retrieval Augmented Generation (Rag) For Precision Language Models
10 pages
Medium
No ratings yet
Medium
22 pages
Natural Language Processing
No ratings yet
Natural Language Processing
11 pages
Zhao Et Al (2024) - Retrieval-Augmented Generation For AI-Generated Content
No ratings yet
Zhao Et Al (2024) - Retrieval-Augmented Generation For AI-Generated Content
21 pages
Understanding RAG
No ratings yet
Understanding RAG
16 pages
Understanding RAG AI
No ratings yet
Understanding RAG AI
6 pages
Agentic RAG: Survey on AI Advancements
No ratings yet
Agentic RAG: Survey on AI Advancements
39 pages
CONCLUSION
No ratings yet
CONCLUSION
2 pages
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
No ratings yet
What Is Retrieval-Augmented Generation, Aka RAG?: Rick Merritt
9 pages
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
100% (10)
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
6 pages
A Taxonomy of Retrieval Augmented Generation
100% (3)
A Taxonomy of Retrieval Augmented Generation
56 pages
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
No ratings yet
A Comprehensive Guide To Building Agentic RAG Systems With LangGraph
23 pages
Retrieval-Augmented Generation For AI-Generated Content A Survey
No ratings yet
Retrieval-Augmented Generation For AI-Generated Content A Survey
28 pages
IR LLMs
No ratings yet
IR LLMs
17 pages
RAG Cheat Sheet-2
No ratings yet
RAG Cheat Sheet-2
29 pages
RAG Deep-Dive Research Report
No ratings yet
RAG Deep-Dive Research Report
46 pages
RAG Architecture
100% (9)
RAG Architecture
52 pages
Top 20+ RAG Interview Questions
No ratings yet
Top 20+ RAG Interview Questions
8 pages
Challenge
No ratings yet
Challenge
8 pages
Tyjt
No ratings yet
Tyjt
2 pages
Retrieval-Augmented Generation in NLP
No ratings yet
Retrieval-Augmented Generation in NLP
17 pages
Modular RAG: Transforming RAG Systems Into LEGO-like Reconfigurable Frameworks
No ratings yet
Modular RAG: Transforming RAG Systems Into LEGO-like Reconfigurable Frameworks
17 pages
Llmrag
No ratings yet
Llmrag
6 pages
Untitled Document
No ratings yet
Untitled Document
2 pages
Untitled Document
No ratings yet
Untitled Document
2 pages
Word Wrap Visualisation Script
No ratings yet
Word Wrap Visualisation Script
2 pages
Knapsack Problem Updated
No ratings yet
Knapsack Problem Updated
7 pages
Research Paper
No ratings yet
Research Paper
7 pages
? CUET PG Geography (HUQP08) Preparation Roadmap (Sept 2025 March 2026)
No ratings yet
? CUET PG Geography (HUQP08) Preparation Roadmap (Sept 2025 March 2026)
17 pages
Tasking Guidelines - Project Shield
No ratings yet
Tasking Guidelines - Project Shield
12 pages
Political Role of India's Caste Associations
No ratings yet
Political Role of India's Caste Associations
12 pages
Rural Settlement
No ratings yet
Rural Settlement
14 pages
Thesis Help for Trade Students
100% (2)
Thesis Help for Trade Students
6 pages
Higher Education Strategy 2011-2016
No ratings yet
Higher Education Strategy 2011-2016
4 pages
4JH1 Gestión Electrónica
No ratings yet
4JH1 Gestión Electrónica
79 pages
Formulir Berlangganan / Subscription Form: Informasi Pelanggan / Customer Information
No ratings yet
Formulir Berlangganan / Subscription Form: Informasi Pelanggan / Customer Information
7 pages
IDEALS Essay Framework
No ratings yet
IDEALS Essay Framework
1 page
Audels Engineers and Mechanics Guide Volume 5 From WWW Jgokey Com
No ratings yet
Audels Engineers and Mechanics Guide Volume 5 From WWW Jgokey Com
556 pages
Apr04 Seismic Forward Modeling
100% (1)
Apr04 Seismic Forward Modeling
12 pages
6EP1332-1SH31 - Industry Support Siemens
No ratings yet
6EP1332-1SH31 - Industry Support Siemens
3 pages
Harrington 1 Ton Hand Chain Hoist OM Manual
No ratings yet
Harrington 1 Ton Hand Chain Hoist OM Manual
55 pages
Math 10 SLM 18 Permutation and Combination
No ratings yet
Math 10 SLM 18 Permutation and Combination
17 pages
Introduction and Course Roadmap: Zicklin School of Business, Baruch College, CUNY
No ratings yet
Introduction and Course Roadmap: Zicklin School of Business, Baruch College, CUNY
4 pages
95 843 Xiameter Ofx 0531 Fluid
No ratings yet
95 843 Xiameter Ofx 0531 Fluid
5 pages
PC 101 Life Skills Gathering
No ratings yet
PC 101 Life Skills Gathering
2 pages
Substation
No ratings yet
Substation
10 pages
Digital Innovations Exam UiTM
No ratings yet
Digital Innovations Exam UiTM
6 pages
Liquid Coating Resins and Additives
No ratings yet
Liquid Coating Resins and Additives
12 pages
English MAINS Practice Shot 200
No ratings yet
English MAINS Practice Shot 200
4 pages
Adjustment and Challenges of Technology and Livelihood Education Teachers in K To 12 Curriculum
No ratings yet
Adjustment and Challenges of Technology and Livelihood Education Teachers in K To 12 Curriculum
5 pages
Consciousness Study: Three Paradigms
No ratings yet
Consciousness Study: Three Paradigms
11 pages
Physics Project
No ratings yet
Physics Project
15 pages
Witsoc Reviewer
No ratings yet
Witsoc Reviewer
11 pages
Improvised Mist Fan
No ratings yet
Improvised Mist Fan
32 pages
Aristotle - History of Animals
No ratings yet
Aristotle - History of Animals
183 pages
Database Programming With PL/SQL 2-3: Practice Activities: Recognizing Data Types
No ratings yet
Database Programming With PL/SQL 2-3: Practice Activities: Recognizing Data Types
3 pages
Catalogo Juntas Rotativas DEUBLIN
100% (1)
Catalogo Juntas Rotativas DEUBLIN
32 pages
U3 w22 Revision 4b (Handout)
No ratings yet
U3 w22 Revision 4b (Handout)
12 pages
Tent Pole Project Report
No ratings yet
Tent Pole Project Report
6 pages
Elements of Aeronautics Notes
No ratings yet
Elements of Aeronautics Notes
37 pages
Passport Appointment Receipt India
No ratings yet
Passport Appointment Receipt India
3 pages
Kaldor'S Growth Theory Nancy J. Wulwick
No ratings yet
Kaldor'S Growth Theory Nancy J. Wulwick
19 pages

Understanding Retrieval-Augmented Generation (RAG)

Uploaded by

Understanding Retrieval-Augmented Generation (RAG)

Uploaded by

Understanding

► Imagine a super-smart librarian in a vast library.

► A hybrid AI framework combining:

► Traditional AI models have limitations:

1. Data Preparation: Build a knowledge base

► Collect Documents: E.g., employee handbook,

► Query Embedding: Convert user question to a vector.

► Augmentation: Combine query and retrieved chunks

► Customer Support: Chatbots answering from

► Lewis, P., et al. (2020). “Retrieval-Augmented Generation

Thank you! Any

You might also like