Real World MCP Tutorial: Augmenting Model Context from Existing Data Sources

Hands-on learning and demo material: you run a small Olympic data stack in Docker, expose it through Google MCP Toolbox for Databases, then drive the same tools from a Python agent (OpenRouter) and a demo notebook—so you can see the same ideas in the UI, in logs, and in code.

Follow this order

1. Start Docker

From the repo root:

docker compose up --build

That brings up PostgreSQL and MongoDB, runs a one-shot seed (downloads public CSV slices from Hugging Face, loads both databases, then exits), and starts Toolbox on http://127.0.0.1:5050 (host 5050 → container 5000). Copy .env.example to .env if you want local overrides.

Detached (no log stream on your terminal): use docker compose up --build -d. Containers keep running in the background; follow logs only when you want, e.g. docker compose logs -f toolbox or docker compose logs -f for all services. docker compose ps shows status.

Postgres: localhost:5432 — default user / password / database: olympics.
MongoDB: localhost:27017 — database olympics.
Toolbox: set TOOLBOX_BASE_URL=http://127.0.0.1:5050 for Python (adjust if you change the published port in docker-compose.yml).

Re-seeding is skipped if data already exists. Clean slate: docker compose down -v then docker compose up --build again.

2. Optional: check the databases

With the stack up and seeded:

bash scripts/tests/test_databases.sh

For a full reset plus sample queries: bash scripts/tests/verify_stack.sh.

3. Python environment

Python 3.12+ recommended. From the repo root, create a virtual environment, activate it (so the next steps use this interpreter), and install the project in editable mode so import mcp_tutorial works like any other package (no PYTHONPATH):

python3.12 -m venv .venv
# macOS / Linux:
source .venv/bin/activate
# Windows cmd:        .venv\Scripts\activate.bat
# Windows PowerShell: .venv\Scripts\Activate.ps1
pip install -U pip
pip install -e .

That reads dependencies from pyproject.toml and registers the mcp_tutorial package in editable mode.

For Jupyter later, register a kernel from this same venv, e.g. python -m ipykernel install --user --name=mcp-tutorial --display-name="MCP tutorial".

Leave the venv activated (same shell, or point your IDE terminal at it) for steps 4 and 5.

4. Run the agent demo (terminal)

With the venv from step 3 still active, set OPENROUTER_API_KEY (see .env.example) if you have not already, then:

python scripts/run_agent_demo.py

This runs two cross-database questions against OpenRouter with Toolbox’s combined toolset. Watch INFO lines for when each chat request and each tool starts and finishes—handy if something feels slow.

5. Walk through the demo notebook

Open notebooks/demo_mcp_toolbox.ipynb with the same venv as the Jupyter kernel (step 3), Docker still running (step 1), and OPENROUTER_API_KEY set (e.g. in repo-root .env). Run cells in order from the top.

That notebook is the teaching path alongside the CLI demo:

Compose + YAML — short explanation of how Toolbox is wired in docker-compose.yml and how merged files under toolbox/config/tools/ define tools and the combined toolset.
Tool-calling loop in cells — imports, mapping Toolbox tools to OpenAI tools=, OpenRouter client, one completion at a time, then a full loop—implemented in the notebook as small steps (same ideas as run_chat_with_tools in mcp_tutorial.agent, not a re-export of that function).
Two cross-database runs — Example A (relational facts first, bios second) and Example B (biography search first, Postgres cross-check). Prompts for A/B are defined next to those cells; Example B uses a notebook-only simplified prompt (one random sport phrase, no hit-index nudge). The terminal demo still uses mcp_tutorial.prompts / build_prompt_b() for its second prompt.

For a quicker notebook that only calls run_chat_with_tools, see notebooks/simple_tool_calls.ipynb.

What you are looking at

Piece	Role
`pyproject.toml`	Declares the installable `mcp_tutorial` package (under `src/mcp_tutorial/`) and its dependencies.
`docker-compose.yml`	Postgres, Mongo, seed, Toolbox (inline comments describe each service).
`toolbox/config/tools/`	Merged YAML: sources, SQL/Mongo tools, toolsets (`combined`, `postgres_only`, `mongo_only`), prompts. Numbered filenames set merge order.
`src/mcp_tutorial/`	Tutorial package `mcp_tutorial`: `run_chat_with_tools`, `Settings`. Installed with `pip install -e .` in step 3.
`scripts/run_agent_demo.py`	Terminal walkthrough of two prompts.
`notebooks/demo_mcp_toolbox.ipynb`	Demo notebook: Compose/YAML context, step-by-step OpenRouter + Toolbox loop in cells, two cross-database examples.
`notebooks/simple_tool_calls.ipynb`	Minimal notebook: loads settings and calls `run_chat_with_tools` for the same two prompts.
`scripts/seed_databases.py`	Logic the seed container uses to fill the DBs.
`db/postgres/`, `db/mongo/`	Schema and init helpers for the containers.

Data (high level): Postgres holds core Olympic tables (countries, games, athletes, events); Mongo holds biography-style text and wide event rows—both fed from the public dataset SVeldman/126-years-olympic-results during seed.

Toolbox in the browser: http://127.0.0.1:5050/ui — try tools and parameters by hand. MCP (Streamable HTTP): http://127.0.0.1:5050/mcp — e.g. MCP Inspector quickstart (npx @modelcontextprotocol/inspector). Python uses the base URL only (no /mcp suffix).

Cross-database prompts the repo is built around: relational slice (medals, NOC, year, optional sport fragment) then Mongo biographies by athlete_id; or Mongo text search on biographies then Postgres checks for the same athlete.

Other OpenAI-compatible APIs: set OPENROUTER_BASE_URL and OPENROUTER_MODEL instead of OpenRouter defaults (see .env.example).

After edits to scripts/seed_databases.py, rebuild the seed image when re-running seed: docker compose run --build --rm seed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real World MCP Tutorial: Augmenting Model Context from Existing Data Sources

Follow this order

1. Start Docker

2. Optional: check the databases

3. Python environment

4. Run the agent demo (terminal)

5. Walk through the demo notebook

What you are looking at

About

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
db		db
docker		docker
notebooks		notebooks
scripts		scripts
src/mcp_tutorial		src/mcp_tutorial
toolbox/config/tools		toolbox/config/tools
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

Real World MCP Tutorial: Augmenting Model Context from Existing Data Sources

Follow this order

1. Start Docker

2. Optional: check the databases

3. Python environment

4. Run the agent demo (terminal)

5. Walk through the demo notebook

What you are looking at

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages