Telegram RAG Chat

Self-hosted retrieval-augmented generation stack for searching and chatting over your personal Telegram history. The project ships with ingestion, hybrid retrieval, and an authenticated UI; everything runs locally via Docker Compose.

Highlights

Index DMs, groups, channels, and Saved Messages with Telethon and store content in Postgres
Hybrid Vespa retrieval (vector + BM25 + recency) with optional VoyageAI rerank
Chat workflow that compresses context, answers strictly from your data, and returns citations
Astro + React interface with login, filters, live search results, and model selection
Docker-first deployment with automatic Vespa package activation on startup

System Architecture

[indexer] Telethon daemon or one-shot -> chunk -> cache -> OpenAI embed -> Vespa upsert
  |-- Postgres (sync state, embedding cache, chunks)
  `-- Telethon session persisted on Docker volume

[api] FastAPI -> /auth, /models, /search, /chat (LLM answer + citations)
[ui]  Astro + React -> login, filters, hybrid search, chat
[vespa] Hybrid retrieval + recency boosting (auto-deployed on startup)

Prerequisites

Docker and Docker Compose
Telegram API ID + hash and a phone number for login
OpenAI API key (for embeddings/LLM)
Optional: VoyageAI API key for reranking

Setup

Clone and configure the repository:

git clone <repo> telegram-rag
cd telegram-rag
cp .env.example .env

Edit .env with your credentials. Required values include APP_USER, APP_USER_HASH_BCRYPT, TELEGRAM_API_ID, TELEGRAM_API_HASH, OPENAI_API_KEY. Optional keys such as VOYAGE_API_KEY enable reranking.

Install local tooling (recommended for contributors):

make install        # api + indexer dependencies
make ui-install     # ui dependencies
pre-commit install  # enable formatting/lint hooks

To generate a bcrypt hash for the login password:

python - <<'PY'
import bcrypt; print(bcrypt.hashpw(b"your-password", bcrypt.gensalt()).decode())
PY

Running the Stack

Development

docker compose up -d api vespa postgres indexer vespa-deploy
(cd ui && npm run dev)

UI: http://localhost:4321 (development server)
API: http://localhost:8000 (health probe at /healthz)
Vespa status: http://localhost:19071/ApplicationStatus

Production-like

docker compose up -d --build

The vespa-deploy service waits for Vespa to become healthy and then pushes vespa/application automatically.

Indexing Telegram Data

Caveats & indexing modes

Near-live daemon: python main.py (no --once) attaches Telethon event handlers, replays the last few minutes on startup/reconnect, and streams new messages into Vespa.
Initial backfill: progress is checkpointed per chat in /sessions/backfill_state.json (override with --backfill-state-path). The daemon resumes from the last stored message_id automatically.
Hourly sweep: by default the daemon re-scans the last 7 days every 60 minutes to catch late edits; tune with --hourly-sweep-days and --hourly-sweep-interval-minutes.
Telegram deletions are intentionally ignored—once ingested, messages remain searchable to preserve historical context.
First full sync: docker compose run --rm indexer python main.py --once

Target specific chats/dates (example):

docker compose run --rm indexer python main.py --once \
  --chats '<Saved Messages>' --days 30 --limit-messages 50

After the backfill finishes, run the daemon without --once to stay current.

Common daemon tuning flags:

--daemon-lookback-minutes (default 5) — replay window on startup/reconnect.
--lookback-message-limit (default 250) — cap per-chat catch-up volume.
--backfill-checkpoint-interval (default 50) — persist JSON progress every N messages.

API & UI Usage

Login at /login using the credentials from .env; an HTTP-only session cookie is issued.
Core endpoints: /models, /search, and /chat. See api/ for request/response schemas.
Model labels shown in the UI map to OpenAI IDs defined in the environment variables (e.g., gpt-5, gpt-5-mini).

Testing & Quality

Format and lint before committing: make precommit or pre-commit run --all-files
Python tests (api + indexer): make test-python
UI unit tests: make test-ui
UI end-to-end tests: make test-ui-e2e
Optional smoke checks against a running stack: ./scripts/smoke_tests.sh

Directory Layout

api/                FastAPI service (auth, models, search, chat)
indexer/            Telethon ingestion, chunking, embeddings, Vespa upserts
ui/                 Astro + React front-end
vespa/application/  Vespa schemas and services (auto-deployed)
scripts/            Helpers: deploy-vespa.sh, wait_for_health.sh, smoke_tests.sh

Troubleshooting

Cannot log in: confirm APP_USER and APP_USER_HASH_BCRYPT; check system clock for cookie expiry.
Indexer stalled: ensure the Telethon .session file exists on the Docker volume and inspect docker compose logs indexer for rate limiting.
Empty search results: verify Vespa deployment (docker compose logs vespa-deploy) and that embeddings populated successfully.
Rerank skipped: set VOYAGE_API_KEY and RERANK_ENABLED=true.

Security & Privacy

Single-user authentication backed by bcrypt; cookies are HTTP-only.
Secrets stay in .env and are never committed. Copy .env.example as a starting point.
Telethon sessions and Postgres data live on Docker volumes you control.
If exposing the stack, front it with TLS termination and consider IP allow-listing.

Supporting Scripts

./scripts/wait_for_health.sh - wait until API and Vespa report healthy
./scripts/deploy-vespa.sh - redeploy the Vespa package manually
./scripts/smoke_tests.sh - basic functional checks against a running stack

For additional implementation details, browse the relevant module directories noted above.

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.github		.github
api		api
indexer		indexer
scripts		scripts
ui		ui
vespa/application		vespa/application
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
Dockerfile.vespa-deploy		Dockerfile.vespa-deploy
Makefile		Makefile
README.md		README.md
application.zip		application.zip
docker-compose.override.yml		docker-compose.override.yml
docker-compose.yml		docker-compose.yml
pytest.ini		pytest.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Telegram RAG Chat

Highlights

System Architecture

Prerequisites

Setup

Running the Stack

Development

Production-like

Indexing Telegram Data

Caveats & indexing modes

API & UI Usage

Testing & Quality

Directory Layout

Troubleshooting

Security & Privacy

Supporting Scripts

About

Uh oh!

Releases

Packages

Languages

rtyshyk/telegram-rag

Folders and files

Latest commit

History

Repository files navigation

Telegram RAG Chat

Highlights

System Architecture

Prerequisites

Setup

Running the Stack

Development

Production-like

Indexing Telegram Data

Caveats & indexing modes

API & UI Usage

Testing & Quality

Directory Layout

Troubleshooting

Security & Privacy

Supporting Scripts

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages