CodingBuddy

CodingBuddy is a terminal-native coding agent written in Rust. It combines chat, planning, tool execution, patch workflows, indexing, and long-running autopilot loops in one cross-platform CLI.

Highlights

Agent intelligence: Adaptive complexity (Simple/Medium/Complex), planning protocols, error recovery, stuck detection
Agent profiles: Task-specialized tool filtering (build/explore/plan) — reduces decision space for weaker models
Doom loop detection: Detects and breaks repeated identical tool calls with corrective guidance
Model-tier prompts: Separate system prompts optimized for deepseek-chat (action-biased) vs deepseek-reasoner (thinking-leveraging)
Bootstrap context: Automatic project awareness (file tree, git status, repo map, dependency analysis) on first turn
Per-turn retrieval: Vector + BM25 code search with RRF fusion — fires every turn, not just the first
Privacy scanning: 3-layer secret detection (path/content/builtin patterns) with redaction on tool outputs
Ghost text: Local ML-powered inline completions in the TUI (Tab to accept, Alt+Right for one word)
Multi-provider: DeepSeek default, any OpenAI-compatible endpoint (GLM-5, Qwen, Ollama, OpenRouter, etc.) via setup wizard or --model flag
Model routing: Automatically routes complex tasks to deepseek-reasoner, simple tasks to deepseek-chat
Step snapshots: Per-tool-call file snapshots with content hashing for fine-grained undo
14 lifecycle hooks: SessionStart through TaskCompleted — extend behavior at every stage
Interactive TUI: Vim mode, @file autocomplete, !bash prefix, syntax highlighting, keyboard shortcuts
Skills & subagents: Forked execution, worktree isolation, custom agent definitions
MCP integration: JSON-RPC stdio/http transports, prompts as slash commands
Permission engine: 7 modes, glob allowlist/denylist, team-managed policy overlays
Session persistence: JSONL event log + SQLite projections, deterministic replay
LLM compaction: Structured LLM-based conversation compaction preserving goals, progress, and findings
Local ML (opt-in): Candle-powered embeddings and code completion, HNSW vector index — runs fully offline

Architecture

CodingBuddy is a Rust workspace organized into focused crates:

Crate	Role
`codingbuddy-cli`	CLI dispatch, argument parsing, 24 subcommand handlers
`codingbuddy-agent`	Agent engine, tool-use loop, complexity classifier, prompt construction, team mode
`codingbuddy-core`	Shared types (`AppConfig`, `ChatRequest`, `StreamChunk`, `EventEnvelope`), config loading, multi-provider config
`codingbuddy-llm`	LLM client (`LlmClient` trait), streaming, prompt cache
`codingbuddy-tools`	Tool definitions (enriched descriptions), plugin manager, shell runner, sandbox wrapping
`codingbuddy-policy`	Permission engine (denylist/allowlist), approval gates, output scanner, `ManagedSettings`
`codingbuddy-hooks`	14 lifecycle events, `HookRuntime`, once/disabled fields, `PermissionDecision`
`codingbuddy-local-ml`	Local ML via Candle: embeddings, completion, chunking, vector index, hybrid retrieval, privacy router
`codingbuddy-store`	Session persistence (JSONL event log + SQLite projections)
`codingbuddy-memory`	Long-term memory, shadow commits, checkpoints, step snapshots
`codingbuddy-index`	Full-text code index (Tantivy), RAG retrieval with citations
`codingbuddy-mcp`	MCP server management (JSON-RPC stdio/http transports)
`codingbuddy-ui`	TUI rendering (ratatui/crossterm), autocomplete, vim mode, ML ghost text
`codingbuddy-diff`	Unified diff parsing, patch staging, and git-apply
`codingbuddy-context`	Context enrichment, dependency analysis (petgraph), file relevance scoring
`codingbuddy-skills`	Skill discovery, forked execution, frontmatter parsing
`codingbuddy-subagent`	Background tasks, worktree isolation, custom agent definitions
`codingbuddy-observe`	Structured logging
`codingbuddy-jsonrpc`	JSON-RPC server for IDE integration
`codingbuddy-chrome`	Chrome native host bridge
`codingbuddy-errors`	Error types
`codingbuddy-testkit`	Test utilities

The default execution mode is the tool-use loop (think→act→observe):

User → LLM (with tools) → Tool calls → Results → LLM → ... → Final response

The LLM freely decides which tools to call (file read/write, shell, search, etc.)
Tools execute locally with policy-gated approval for write operations
Checkpoints are created before destructive tool calls
The loop continues until the LLM responds without tool calls (task complete)
Thinking mode (deepseek-reasoner) can be enabled for complex reasoning
Adaptive complexity: Simple/Medium/Complex classification with thinking budget escalation (8K→64K)
Agent profiles: task-type tool filtering (build/explore/plan) reduces the model's decision space
Doom loop detection: rolling hash window detects 3+ identical tool calls and injects corrective guidance
Model-tier prompts: deepseek-chat gets action-biased instructions, deepseek-reasoner gets thinking-leveraging
Bootstrap context: automatic project awareness (tree, git status, repo map, manifests) on first turn
Per-turn retrieval: vector + BM25 search with RRF, fires every turn with remaining-budget awareness
LLM compaction: structured LLM-based summary (Goal/Completed/In Progress/Key Facts/Findings/Modified Files) with code-based fallback
Step snapshots: before/after file state captured per tool call with SHA-256 hashing and revert support
Error recovery: automatic guidance injection on failures, stuck detection after repeated errors
Model routing: Complex+escalated tasks route to deepseek-reasoner automatically

Install

Option 1: Prebuilt release binary (recommended)

macOS/Linux:

curl -fsSL https://raw.githubusercontent.com/aloutndoye/deepseek-cli/main/scripts/install.sh | bash -s -- --version latest

Windows (PowerShell):

$script = Join-Path $env:TEMP "deepseek-install.ps1"
Invoke-WebRequest https://raw.githubusercontent.com/aloutndoye/deepseek-cli/main/scripts/install.ps1 -OutFile $script
& $script -Version latest

Option 2: Build from source

cargo build --release --bin codingbuddy
./target/release/codingbuddy --help

Quickstart

export DEEPSEEK_API_KEY="<your-api-key>"
codingbuddy chat

Non-interactive examples:

codingbuddy ask "Summarize this repository"
codingbuddy plan "Implement feature X and list risks"
codingbuddy autopilot "Execute plan and verify tests" --hours 2

Credential behavior:

If the API key is missing in interactive TTY mode, CodingBuddy prompts for it before first model use.
You can persist the key for the current workspace in .codingbuddy/settings.local.json.
In non-interactive or JSON mode, missing credentials fail fast.
codingbuddy config show redacts llm.api_key.

Core Commands

Run codingbuddy --help for full details. The most used commands are:

codingbuddy chat: interactive chat session (TUI when enabled and running in a TTY)
codingbuddy ask "<prompt>": one-shot response
codingbuddy plan "<prompt>": generate a plan without running it
codingbuddy run [session-id]: continue execution for a session
codingbuddy autopilot "<prompt>": bounded unattended loop
codingbuddy review --diff|--staged|--pr <n>: code review workflows
codingbuddy exec "<command>": policy-enforced shell execution
codingbuddy diff, codingbuddy apply, codingbuddy rewind: patch/checkpoint lifecycle
codingbuddy status, codingbuddy context, codingbuddy usage: runtime and cost visibility
codingbuddy index build|update|status|watch|query: local code index operations
codingbuddy permissions show|set|dry-run: safety policy inspection and tuning
codingbuddy replay run|list: deterministic replay tooling
codingbuddy search "<query>": web search with provenance metadata
codingbuddy setup: interactive setup wizard (provider, API key, local ML, privacy)

Useful global flags:

--json: machine-readable output
--permission-mode ask|auto|plan: per-run permission mode override

Example Workflows

Plan and execute:

codingbuddy plan "Refactor auth middleware and keep behavior unchanged"
codingbuddy run

Autopilot with live status:

codingbuddy autopilot "Fix flaky tests in crates/codingbuddy-cli" --hours 1
codingbuddy autopilot status --follow

Review staged changes:

codingbuddy review --staged --focus correctness

Publish strict review findings to a PR:

codingbuddy review --pr 123 --publish --max-comments 20

Generate shell completions:

codingbuddy completions --shell zsh > ~/.zsh/completions/_codingbuddy

Configuration

Settings merge in order (later wins):

~/.codingbuddy/settings.json (user — all projects)
.codingbuddy/settings.json (project — shared with team)
.codingbuddy/settings.local.json (local overrides — gitignore this)

codingbuddy config show               # View merged config (keys redacted)
codingbuddy config edit               # Open in editor

Key settings:

{
  "llm": {
    "base_model": "deepseek-chat",
    "max_think_model": "deepseek-reasoner",
    "context_window_tokens": 128000,
    "api_key_env": "DEEPSEEK_API_KEY"
  },
  "agent_loop": {
    "tool_loop_max_turns": 50,
    "context_bootstrap_enabled": true
  },
  "policy": {
    "approve_edits": "ask",
    "approve_bash": "ask",
    "allowlist": ["cargo *", "npm test", "git status"]
  },
  "local_ml": {
    "enabled": false
  }
}

See docs/CONFIG_REFERENCE.md for all options.

Local ML (Optional)

An optional local intelligence layer — hybrid code retrieval, privacy scanning, and ghost text completions. No models bundled; downloads from HuggingFace on first use.

# Enable (mock backends, no download needed)
echo '{"local_ml": {"enabled": true}}' > .codingbuddy/settings.json

# Enable with real ML models (requires --features local-ml build)
cargo build --release --bin codingbuddy --features local-ml

Feature	Mock Mode	Full ML Mode
Code retrieval	Hash-based matching	Semantic (jina-code-v2, ~270MB)
Ghost text	Empty	Candle completion (~700MB–1.5GB)
Privacy scanning	Fully functional	Fully functional
Vector index	BruteForce O(n)	HNSW via Usearch

codingbuddy privacy scan              # Find secrets in your project
codingbuddy privacy redact-preview    # Preview what gets redacted
codingbuddy index build               # Build hybrid index
codingbuddy index --hybrid doctor     # Diagnose index issues

See docs/LOCAL_ML_GUIDE.md for the full guide.

Development

cargo fmt --all -- --check
cargo clippy --workspace --all-targets -- -D warnings
cargo test --workspace --all-targets    # 974 tests
cargo build --release --bin codingbuddy

Docs

User Guide — complete guide: chat modes, keyboard shortcuts, slash commands, tools, intelligence layers, permissions, hooks, skills, subagents, MCP, local ML, sessions, autopilot, code review, configuration, troubleshooting
Local ML Guide — setup, models, retrieval, privacy, ghost text
Configuration Reference — all settings with types and defaults
Operations Playbook — incident response, failure modes, rollback
Release Guide — release process, artifacts, installers

Name		Name	Last commit message	Last commit date
Latest commit History 185 Commits
.cargo		.cargo
.github		.github
crates		crates
docs		docs
example-plugin		example-plugin
extensions		extensions
scripts		scripts
.gitignore		.gitignore
.mcp.json		.mcp.json
CODINGBUDDY.md		CODINGBUDDY.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
config.example.json		config.example.json
config.example.toml		config.example.toml
deny.toml		deny.toml
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CodingBuddy

Highlights

Architecture

Install

Option 1: Prebuilt release binary (recommended)

Option 2: Build from source

Quickstart

Core Commands

Example Workflows

Configuration

Local ML (Optional)

Development

Docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

aloundoye/codingbuddy

Folders and files

Latest commit

History

Repository files navigation

CodingBuddy

Highlights

Architecture

Install

Option 1: Prebuilt release binary (recommended)

Option 2: Build from source

Quickstart

Core Commands

Example Workflows

Configuration

Local ML (Optional)

Development

Docs

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages