Context Optimization

Two things matter for a long-running Claude session: keeping the context window lean so tokens go to your code, and handling the moments when it fills up anyway.

With 1M context windows (API subscribers on Team and Enterprise get this on all models; Max plan users can opt into 1M per-row on Opus rows — see Smart Model Routing), compaction is rare — most sessions complete well within the available context. Pilot Shell's strategies focus on staying lean, and making compaction and parallel work painless when they happen.

Keeping context lean

Strategy	Savings	How
RTK proxy	60–90%	Rewrites dev tool output (`git status`, `npm test`, etc.) to remove noise before it enters the context window
Semble code search	~98%	Returns only the matched chunks instead of dumping whole files — Semble's own benchmark shows ~98% fewer tokens than `grep + read` at 94% recall
Conditional rule loading	Variable	Coding standards load only for matching file types — Python rules don't load when editing TypeScript
Progressive skill disclosure	~90%	Skill frontmatter (~100 tokens) loads always; full SKILL.md loads only on activation; linked files load on demand
Scoped MCP tools	Variable	MCP tool schemas are lazy-loaded via `ToolSearch` — only fetched when needed, not preloaded
Routing hooks	Variable	PreToolUse hooks block `curl`/`wget`/built-in `WebFetch` and redirect to the dedicated web-fetch MCP, so large pages don't dump into context

Status line display

The status line shows context usage as a visual progress bar:

Opus 4.7 [1M] | █████░▓ 60% | ...

Claude Code reserves ~16.5% of the context window as a compaction buffer, triggering auto-compaction at ~83.5% raw usage. Pilot Shell rescales this to an effective 0–100% range so the bar fills naturally to 100% right before compaction fires. A ▓ indicator shows the reserved zone. The monitor warns at ~80% effective (informational) and ~90%+ effective (caution).

When compaction fires

On 200K windows, compaction happens more often. Pilot Shell preserves state automatically across the three lifecycle events:

PreCompact → Compact → SessionStart(compact)

PreCompact — pre_compact.py captures active plan, task list, recent decisions, and key context to Pilot Shell Console memory.
Compact — Claude Code summarizes conversation history while preserving recent tool calls and flow.
SessionStart(compact) — post_compact_restore.py re-injects the active plan path, task state, and key decisions. Work resumes seamlessly.

Memory observations (decisions, discoveries, bugfixes) persist independently in SQLite — they survive compaction regardless of hooks.

Don't rush the current task

Context limits are not an emergency — auto-compaction preserves everything and resumes cleanly. Finish the current task with full quality. The only thing that matters is the output, not the context percentage.

Running parallel sessions

Multiple Pilot Shell sessions can run on the same project without interference. Each session has its own context window, task list, and plan state. The Console dashboard tracks every active session so you can jump between them.

Keeping context lean​

Status line display​

When compaction fires​

Running parallel sessions​

Keeping context lean

Status line display

When compaction fires

Running parallel sessions