MU

Your codebase, understood.

"Where's the auth code?" "What breaks if I change this?" "Why does this file exist?"
MU answers in seconds.

MU builds a semantic graph of your codebase and provides fast queries, semantic search, and diff analysis. Feed your entire codebase to an AI in seconds, not hours.

The Problem

You know the drill:

"Paste the codebase into Claude" → token limit exceeded
"Where's the authentication code?" → grep, grep, grep...
"What depends on this?" → ¯\_(ツ)_/¯
"Read the documentation" → what documentation?

LLMs choke on large codebases. Context windows are precious. 90% of code is boilerplate. You're feeding syntax when you need semantics.

The Solution

MU parses your codebase into a semantic graph database. Fast queries. Semantic search. Intelligent context extraction.

Input:  66,493 lines of Python
Output: 2,173 tokens (mu compress)
Result: LLM correctly answers architectural questions

Feature Reality Check

Tested on real codebases. Here's what actually works:

Feature	Rating	Notes
`mu c` (compress)	★★★★★	Best feature - structured compression with sigils
`mu q` (SQL mode)	★★★★	Full SQL works great
`mu search`	★★★★	Fast semantic search (~115ms), good relevance
`mu path`	★★★★	Shows dependency paths between nodes
`mu ancestors`	★★★	Good for functions, weaker for classes
`mu impact`	★★★	Shows downstream effects
`mu sus`	★★★	Finds untested security code
`mu export`	★★★	Mermaid/D2/JSON export
`mu q` (terse)	★★	Terse syntax is unreliable - use SQL instead
`mu grok`	★★	Slower than search, less useful
`mu omg`	★	Unstructured output - use `mu c` instead

Validated Results

Codebase	Files	Lines	MU Nodes	Export LOC	Bootstrap
Next.js 14 App (TypeScript)	1,112	406k	4,471	16k	63s
.NET 8 Microservices (C#)	796	147k	6,843	22k	117s

Real test: Fed MU output to Claude, asked architectural questions — correctly explained service patterns, identified dependencies, found complexity hotspots.

Installation

# Build from source (requires Rust 1.70+)
git clone https://github.com/0ximu/mu.git
cd mu
cargo build --release

# Binary is at: ./target/release/mu
# Optionally, copy to PATH:
sudo cp target/release/mu /usr/local/bin/

Binary releases are also available for standalone deployment. See Releases for platform-specific binaries.

Quick Start

# 1. Bootstrap with embeddings (recommended)
mu bootstrap --embed              # Builds graph + generates embeddings

# 2. Compress for LLMs (the killer feature)
mu c > codebase.txt               # Feed entire codebase to Claude/GPT

# 3. Semantic search (fast, ~115ms)
mu search "webhook processing"    # 90% relevance scores

# 4. SQL queries (use full SQL, not terse syntax)
mu query "SELECT name, complexity FROM functions WHERE complexity > 30 ORDER BY complexity DESC"

# 5. Dependency analysis
mu path AuthService UserRepo      # How are these connected?
mu impact TransactionService      # What breaks if I change this?

That's it. Your codebase is now understood.

CLI Commands

Core Commands

mu bootstrap                      # Initialize and build graph database
mu bootstrap --embed              # Build graph + generate embeddings (recommended)
mu status                         # Show project status
mu doctor                         # Run health checks on MU installation

Compress (The Killer Feature)

Feed your entire codebase to an LLM in seconds. MU compresses your code into a hierarchical, star-ranked format that preserves semantic structure while minimizing tokens.

mu compress                       # Compress codebase to stdout
mu compress > codebase.txt        # Save for LLM consumption
mu c                              # Alias

# Detail levels
mu compress --detail low          # Minimal: just structure
mu compress --detail medium       # Default: structure + hot paths + core entities
mu compress --detail high         # Full: everything including relationship clusters

# Output to file
mu compress -o context.mu         # Write directly to file

Why this is the best feature:

Sigil notation: ! modules, $ classes, # functions
Complexity scores: c=14 shows cyclomatic complexity
Call counts: calls=11 shows how often it's called
Importance stars: ★★★ for high-connectivity nodes
Hierarchical structure that mirrors actual code organization

Example output:

# MU v2.0 - Compressed Codebase
# 42 modules, 15 classes, 128 functions, 245 edges

## Domain Overview
### Core Entities
$ AuthService  [★★★]
  @attrs [user_repo, token_manager]
  → Database (uses), UserRepo (calls)
  ← LoginHandler, ApiMiddleware

## Hot Paths (complexity > 20 or calls > 5)
  # process_request  c=35  calls=12 ★★
    | src/handlers/api.rs

This preserves semantic structure for LLMs, not just random token soup.

Graph Analysis

mu deps <node>                    # Show dependencies of a node
mu deps <node> -r                 # Show reverse dependencies (dependents)
mu path <from> <to>               # Find path between nodes (★★★★ actually useful)
mu impact <node>                  # Find downstream impact (what breaks if this changes)
mu ancestors <node>               # Find upstream (works best for functions)
mu cycles                         # Detect circular dependencies

Search & Discovery

mu search "query"                 # Semantic search - fast (~115ms), good relevance
mu patterns                       # Detect code patterns
mu read <file>                    # Read and display a file with MU context

Tip: Run mu bootstrap --embed to enable semantic search. It actually works well - "webhook processing" returns 90% match to WebhookService.cs.

MUQL Queries

Use full SQL syntax. The terse syntax (fn c>50) is unreliable.

# Full SQL syntax (recommended - this works)
mu query "SELECT * FROM functions WHERE complexity > 10"
mu query "SELECT name, complexity FROM functions ORDER BY complexity DESC LIMIT 20"
mu query "SELECT name FROM functions WHERE name LIKE '%Create%'"

# Aggregations work too
mu query "SELECT file_path, COUNT(*) FROM functions GROUP BY file_path ORDER BY 2 DESC"

# Interactive REPL
mu query -i

Note: Terse syntax (mu q "fn c>50") often returns no results. Stick with full SQL.

Semantic Diff & History

mu diff main HEAD                 # Semantic diff between git refs
mu diff HEAD~5 HEAD               # Last 5 commits
mu history <node>                 # Show change history for a node

Embeddings

mu embed                          # Generate embeddings (incremental)
mu embed --force                  # Regenerate all embeddings
mu embed --status                 # Show embedding coverage status

Export Formats

For diagram generation and data interchange:

mu export -F json                 # JSON graph export
mu export -F mermaid              # Mermaid diagram
mu export -F d2                   # D2 diagram
mu export -F json -l 100          # Limit to 100 nodes

Vibes

Fun aliases that do real work:

mu yolo <node>     # Impact analysis with personality
                   # → "Low impact. Go ahead, YOLO!"

mu sus             # Find suspicious patterns
                   # → Security-sensitive code without tests
                   # → High complexity functions

mu wtf <file>      # Git archaeology
                   # → Original author, evolution history
                   # → Co-changed files, related issues

mu vibe            # Naming convention check
                   # → snake_case vs camelCase consistency

mu zen             # Clear caches, reset state

Skip mu omg - it outputs unstructured token soup. Use mu c instead.

Configuration

Create .murc.toml in your project:

[mu]
exclude = ["vendor/", "node_modules/", ".git/", "__pycache__/"]

Supported Languages

Language	Status
Python	Full support
TypeScript	Full support
JavaScript	Full support
Go	Full support
Rust	Full support
Java	Full support
C#	Full support

Node Identifiers

MU uses prefixed identifiers to distinguish node types:

Prefix	Type	Example
`mod:`	Module/File	`mod:src/lib/utils.ts`
`cls:`	Class	`cls:src/models/User.ts:User`
`fn:`	Function	`fn:src/api/auth.ts:login`
`ext:`	External dependency	`ext:react`

When using commands like mu deps, you can use:

Short name: mu deps UserService (fuzzy match)
Full ID: mu deps mod:src/services/UserService.ts (exact match)

Language Notes

Language	Import Resolution
TypeScript/JS	File-based with path alias support (`@/`, `~/`)
Python	Module-based with relative imports
C#	Namespace-based (e.g., `using System.Net`)
Go	Package-based
Java	Package-based
Rust	Module-based with `use` statements

TypeScript Path Aliases: MU automatically reads tsconfig.json or jsconfig.json to resolve path aliases like @/lib/utils.

How It Works

Source Code → Scanner → Parser → Graph Builder → DuckDB
                │          │          │
            manifest    AST       nodes &
                       data       edges

Scanner: Walks filesystem, detects languages, filters noise
Parser: Tree-sitter extracts AST (classes, functions, imports)
Graph Builder: Creates nodes and edges for code relationships
DuckDB: Stores the semantic graph for fast SQL queries

Embeddings (MU-SIGMA-V2)

MU includes a bundled BERT model trained on code for semantic search:

Dimension: 384
Max sequence: 512 tokens
Pooling: Mean
No API keys required: Runs locally

Development

# Build
cargo build --release

# Run tests
cargo test

# Format
cargo fmt

# Lint
cargo clippy

Known Limitations

Terse Query Syntax

The terse MUQL syntax (fn c>50, deps Auth d2) is unreliable and often returns no results. Use full SQL syntax instead.

Test Coverage Detection

The mu sus command detects test coverage by looking for test files in the scanned directory tree. If your tests live in a sibling directory (e.g., src/ and tests/ at the same level), MU won't find them.

Workaround: Run mu bootstrap from the parent directory, or configure test paths in .murc.toml:

[scanner]
# Include sibling test directory
include_paths = ["../tests"]

Ancestors for Classes

mu ancestors works well for functions but is less useful for classes/services. Target specific functions instead.

.NET Solutions

For .NET projects with a solution file at root and code in src/, run MU from the src/ directory:

cd src && mu bootstrap --embed

Monorepo Support

MU works best when run from the directory containing your source code. For monorepos, bootstrap each project separately or run from the root with appropriate excludes.

Troubleshooting

"No supported files found"

MU didn't find parseable files in the current directory. Check:

You're in the right directory (source code, not project root)
Files aren't excluded by .murc.toml or .gitignore
Language is supported (run mu doctor)

Database errors

rm -rf .mu && mu bootstrap

Embeddings not working

mu embed --status  # Check coverage
mu embed --force   # Regenerate all

Check what's in the database

duckdb .mu/mubase "SELECT type, COUNT(*) FROM nodes GROUP BY type"
duckdb .mu/mubase "SELECT type, COUNT(*) FROM edges GROUP BY type"

Roadmap

Contributing

Contributions welcome! We're building something genuinely useful here.

cargo fmt && cargo clippy && cargo test  # The holy trinity

See CONTRIBUTING.md for details.

License

Apache License 2.0 - see LICENSE for details.

MU: Because life's too short to grep through 500k lines of code.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github		.github
mu-cli		mu-cli
mu-core		mu-core
mu-daemon		mu-daemon
mu-embeddings		mu-embeddings
scripts		scripts
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
QUICKSTART.md		QUICKSTART.md
README.md		README.md
codebase.txt		codebase.txt

License

0ximu/mu

Folders and files

Latest commit

History

Repository files navigation

MU

The Problem

The Solution

Feature Reality Check

Validated Results

Installation

Quick Start

CLI Commands

Core Commands

Compress (The Killer Feature)

Graph Analysis

Search & Discovery

MUQL Queries

Semantic Diff & History

Embeddings

Export Formats

Vibes

Configuration

Supported Languages

Node Identifiers

Language Notes

How It Works

Embeddings (MU-SIGMA-V2)

Development

Known Limitations

Terse Query Syntax

Test Coverage Detection

Ancestors for Classes

.NET Solutions

Monorepo Support

Troubleshooting

"No supported files found"

Database errors

Embeddings not working

Check what's in the database

Roadmap

Contributing

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Uh oh!

Languages

Packages