Systematically Improving RAG Applications

A comprehensive course teaching data-driven approaches to building and improving Retrieval-Augmented Generation (RAG) systems. This repository contains course materials, code examples, and a companion book.

🎓 Take the Course

All of this material is supported by the Systematically Improving RAG Course.

Click here to get 20% off →

Course Overview

This course teaches you how to systematically improve RAG applications through:

Data-driven evaluation and metrics
Embedding fine-tuning and optimization
Query understanding and routing
Structured data integration
Production deployment strategies

The RAG Flywheel

The core philosophy centers around the "RAG Flywheel" - a continuous improvement cycle that emphasizes:

Measure: Establish benchmarks and evaluation metrics
Analyze: Understand failure modes and user patterns
Improve: Apply targeted optimizations
Iterate: Continuous refinement based on real-world usage

Repository Structure

.
├── cohort_1/        # First cohort materials (6 weeks)
├── cohort_2/        # Second cohort materials (weeks 0-6)
├── latest/          # Current course version with latest updates
│   ├── week0/       # Getting started with Jupyter, LanceDB, and evals
│   ├── week1/       # RAG evaluation foundations
│   ├── week2/       # Embedding fine-tuning
│   ├── week4/       # Query understanding and routing
│   ├── week5/       # Structured data and metadata
│   ├── week6/       # Tool selection and product integration
│   ├── case_study/  # Comprehensive WildChat project
│   └── extra_kura/  # Advanced notebooks on clustering and classifiers
├── docs/            # MkDocs documentation source
│   ├── workshops/   # Detailed chapter guides (0-7) aligned with course weeks
│   ├── talks/       # Industry expert presentations and case studies
│   ├── office-hours/# Q&A summaries from cohorts 2 and 3
│   ├── assets/      # Images and diagrams for documentation
│   └── misc/        # Additional learning resources
├── data/            # CSV files from industry talks
├── md/              # Markdown conversions of notebooks
└── mkdocs.yml       # Documentation configuration

Course Structure: Weekly Curriculum & Book Chapters

The course follows a 6-week structure where each week corresponds to specific workshop chapters in the companion book:

Week 1: Starting the Flywheel

Book Coverage: Chapter 0 (Introduction) + Chapter 1 (Starting the Flywheel with Data)
Topics:
- Shifting from static implementations to continuously improving products
- Overcoming the cold-start problem through synthetic data generation
- Establishing meaningful metrics aligned with business goals
- RAG as a recommendation engine wrapped around language models

Week 2: From Evaluation to Enhancement

Book Coverage: Chapter 2 (From Evaluation to Product Enhancement)
Topics:
- Transforming evaluation insights into concrete improvements
- Fine-tuning embeddings with Cohere and open-source models
- Re-ranking strategies and targeted capability development

Week 3: User Experience Design

Book Coverage: Chapter 3 (UX - 3 parts)
- Part 1: Design Principles
- Part 2: Feedback Collection
- Part 3: Iterative Improvement
Topics:
- Building interfaces that delight users and gather feedback
- Creating virtuous cycles of improvement
- Continuous refinement based on user interaction

Week 4: Query Understanding & Topic Modeling

Book Coverage: Chapter 4 (Topic Modeling - 2 parts)
- Part 1: Analysis - Segmenting users and queries
- Part 2: Prioritization - High-value opportunities
Topics:
- Query classification with BERTopic
- Pattern discovery in user queries
- Creating improvement roadmaps based on usage patterns

Week 5: Multimodal & Structured Data

Book Coverage: Chapter 5 (Multimodal - 2 parts)
- Part 1: Understanding different content types
- Part 2: Implementation strategies
Topics:
- Working with documents, images, tables, and structured data
- Metadata filtering and Text-to-SQL integration
- PDF parsing and multimodal embeddings

Week 6: Architecture & Product Integration

Book Coverage: Chapter 6 (Architecture - 3 parts)
- Part 1: Intelligent routing to specialized components
- Part 2: Building and integrating specialized tools
- Part 3: Creating unified product experiences
Topics:
- Tool evaluation and selection
- Performance optimization strategies
- Streaming implementations and production deployment

Capstone Project

A comprehensive project using the WildChat dataset that covers:

Data exploration and understanding
Vector database integration (ChromaDB, LanceDB, Turbopuffer)
Synthetic question generation
Summarization strategies
Complete test suite implementation

Technologies Used

LLM APIs: OpenAI, Anthropic, Cohere
Vector Databases: LanceDB, ChromaDB, Turbopuffer
ML/AI Frameworks: Sentence-transformers, BERTopic, Transformers
Evaluation Tools: Braintrust, Pydantic-evals
Monitoring: Logfire, production monitoring strategies
Data Processing: Pandas, NumPy, BeautifulSoup, SQLModel
Visualization: Matplotlib, Seaborn, Streamlit
CLI Framework: Typer + Rich for interactive command-line tools
Document Processing: Docling for PDF parsing and analysis

Course Book & Documentation

The /docs directory contains a comprehensive book built with MkDocs that serves as the primary learning resource:

Book Structure

Introduction & Core Concepts: The RAG Flywheel philosophy and product-first thinking
Workshop Chapters (0-6): Detailed guides that map directly to each course week
Office Hours: Q&A summaries from Cohorts 2 and 3 with real-world implementation insights
Industry Talks: Expert presentations including:
- RAG Anti-patterns in the Wild
- Semantic Search Over the Web
- Understanding Embedding Performance
- Online Evals and Production Monitoring
- RAG Without APIs (Browser-based approaches)

Key Themes in the Book

Product-First Thinking: Treating RAG as an evolving product, not a static implementation
Data-Driven Improvement: Using metrics, evaluations, and user feedback to guide development
Systematic Approach: Moving from ad-hoc tweaking to structured improvement processes
User-Centered Design: Focusing on user value and experience, not just technical capabilities
Continuous Learning: Building systems that improve with every interaction

To build and view the documentation:

# Serve documentation locally (live reload)
mkdocs serve

# Build static documentation
mkdocs build

Getting Started

Important: Use the `latest/` Directory

⚠️ Always work in the latest/ directory for the most current course content.

The cohort_1/ and cohort_2/ directories contain materials from previous course iterations and are kept for reference only. All new development and course work should be done in latest/.

Prerequisites

Python 3.11 (required - the project uses specific features from this version)
uv package manager (recommended) or pip

Installation

Clone the repository
Navigate to the latest/ directory:
```
cd latest/
```

Install dependencies:

# Using uv (recommended)
uv install

# Or using pip
pip install -e .

Start with week0/ for the most up-to-date content
Follow the notebooks in sequential order within each week
Reference the corresponding book chapters in /docs for deeper understanding

Code Quality

Before committing changes, run:

# Format and fix code issues
uv run ruff check --fix --unsafe-fixes .
uv run ruff format .

Philosophy

This course emphasizes:

Systematic Improvement: Data-driven approaches over guesswork
Product Thinking: Building RAG systems that solve real problems
Practical Application: Real-world datasets and examples
Evaluation-First: Measure before and after every change
Continuous Learning: The field evolves rapidly; the flywheel helps you adapt

Additional Resources

Industry talk transcripts in /data/
Office hours recordings summaries in /docs/office_hours/
Advanced notebooks in /latest/extra_kura/ for clustering and classification topics
Complete case study implementation in /latest/case_study/

License

This is educational material for the "Systematically Improving RAG Applications" course.

📧 Free Email Course

Want to learn more about RAG? Take our free email course and get the latest news and information about RAG techniques and best practices.

Sign up for the free RAG Crash Course →{ .md-button .md-button--primary }

🚀 Ready to Systematically Improve Your RAG Applications?

Join thousands of developers who are building better AI applications with our proven methodology.

--8<-- "snippets/enrollment-button.md" --8<--

Learn evaluation strategies, user experience design, and architectural patterns that actually work in production.

Name		Name	Last commit message	Last commit date
Latest commit History 189 Commits
.claude/agents		.claude/agents
.cursor/rules		.cursor/rules
.github/workflows		.github/workflows
cohort_1		cohort_1
cohort_2		cohort_2
docs		docs
images		images
intro-slides		intro-slides
latest		latest
md		md
scripts		scripts
.gitignore		.gitignore
.prettierignore		.prettierignore
AGENT.md		AGENT.md
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
build_book.sh		build_book.sh
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Systematically Improving RAG Applications

🎓 Take the Course

Course Overview

The RAG Flywheel

Repository Structure

Course Structure: Weekly Curriculum & Book Chapters

Week 1: Starting the Flywheel

Week 2: From Evaluation to Enhancement

Week 3: User Experience Design

Week 4: Query Understanding & Topic Modeling

Week 5: Multimodal & Structured Data

Week 6: Architecture & Product Integration

Capstone Project

Technologies Used

Course Book & Documentation

Book Structure

Key Themes in the Book

Getting Started

Important: Use the `latest/` Directory

Prerequisites

Installation

Code Quality

Philosophy

Additional Resources

License

📧 Free Email Course

🚀 Ready to Systematically Improve Your RAG Applications?

About

Uh oh!

Releases

Packages

Contributors 5

Languages

567-labs/systematically-improving-rag

Folders and files

Latest commit

History

Repository files navigation

Systematically Improving RAG Applications

🎓 Take the Course

Course Overview

The RAG Flywheel

Repository Structure

Course Structure: Weekly Curriculum & Book Chapters

Week 1: Starting the Flywheel

Week 2: From Evaluation to Enhancement

Week 3: User Experience Design

Week 4: Query Understanding & Topic Modeling

Week 5: Multimodal & Structured Data

Week 6: Architecture & Product Integration

Capstone Project

Technologies Used

Course Book & Documentation

Book Structure

Key Themes in the Book

Getting Started

Important: Use the latest/ Directory

Prerequisites

Installation

Code Quality

Philosophy

Additional Resources

License

📧 Free Email Course

🚀 Ready to Systematically Improve Your RAG Applications?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Important: Use the `latest/` Directory

Packages