VirtuDirector IA

VirtuDirector IA is a local-first FastAPI application for SME AI consulting and prototyping. It combines:

a guided operator workspace for SME diagnosis,
tenant-scoped document ingestion and retrieval,
deterministic opportunity diagnosis,
executive proposal generation,
implementation bundle generation,
curated intelligence ingestion and ranked retrieval,
local/LAN model routing through LM Studio,
optional premium escalation through customer-owned APIs or CLIs,
and an FDE Labs subsystem that evaluates changes before they are promoted.

This repository is not a generic chatbot template. It is an operational product and engineering lab for deciding where AI should be introduced, packaging pilots, and governing runtime choices inside small and medium-sized businesses.

License: MIT. See LICENSE.

60-second demo

Describe an SME, department, or process.
Run AI opportunity diagnosis.
Generate an executive proposal for management.
Generate an implementation bundle for the selected pilot.
Create a tracked pilot with tasks, status, risks, and success metrics.
Route work through local or optional premium models according to tenant policy.
Validate system improvements in FDE Labs before promotion.

For a complete text-only demo script, see docs/DEMO_WALKTHROUGH.md.

Who this is for

AI consultants delivering practical SME implementations.
Forward-deployed engineers building local-first AI systems.
SME digital transformation teams that need a structured AI adoption workflow.
Local AI implementers working with sensitive documents.
Legal, healthcare, real estate, accounting, public-sector, and office automation prototypes.

Repository purpose

The repository solves six practical problems:

Ingest client documents into a tenant-scoped knowledge layer.
Answer AI implementation questions using deterministic scoring plus curated knowledge.
Turn ranked opportunities into executive proposals and implementation bundles.
Route work across local and optional premium providers under tenant runtime policy.
Run measured lab experiments on RAG, routing, workflows, GRC, ROI, and market intelligence.
Require human approval before promoting a measured improvement.

What is included

Core application

backend/app/main.py: FastAPI entrypoint.
backend/app/api/: HTTP routes.
backend/app/core/: orchestration, opportunity scoring, executive proposals, process scanner, routing, runtime policy.
backend/app/rag/: embeddings, ingestion, retriever, local store.
backend/app/knowledge/: curated knowledge ingestion, compaction, retrieval, ranking.
backend/app/security/: PII redaction, sensitivity classification, audit helpers.
backend/app/tools/: web search, LM Studio, and premium CLI integration.
backend/app/static/: operator UI and Labs admin UI.

FDE Labs

backend/app/labs/: lab definitions, schemas, service, registry, change promotion.
backend/app/api/labs.py: HTTP surface for labs.
scripts/smoke_labs.py: local smoke test.
scripts/labs_quality_gate.py: deterministic validation gate for labs.

Data and curated knowledge

backend/app/data/solutions_catalog.json: base catalog used by the solutions engine.
backend/app/data/curated_intel/: versioned curated knowledge loaded into the local knowledge store.

Tests and utilities

tests/: pytest suite.
scripts/import_curated_intel.py: imports curated markdown into the local knowledge DB.
scripts/recompact_knowledge_briefs.py: recomputes compact knowledge briefs with current logic.
scripts/smoke_tests.py: HTTP smoke tests against a running backend.

Runtime model

The product can run in three modes:

demo_mode=true: deterministic/demo behavior where external services are optional.
Cloud-assisted mode: external web search and hosted model APIs enabled.
Local/LAN mode: LM Studio provides OpenAI-compatible inference from the local machine or other machines on the LAN.

At the UI level, /app supports three product modes:

SME: guided diagnosis and executive proposal generation.
Consultant: diagnosis, implementation bundle generation, and intelligence exploration.
Technical: runtime controls, scanner surfaces, and operator diagnostics.

Authentication model

There are two authentication paths:

Operator and tenant routes

For most application routes, requests are resolved to a tenant principal.

In production: use Authorization: Bearer <jwt>.
In development: X-Tenant-Id and X-User-Id headers are accepted.

Labs admin routes

Labs routes and the Labs admin UI use HTTP Basic auth.

Development defaults:

username: admin
password: change-me-admin

Change these values in .env before exposing the service outside local development.

Requirements

Minimum requirements for local development:

Python 3.11 or newer
venv
pip

Optional but recommended:

PostgreSQL, if you want to match the configured DATABASE_URL
Redis, if you plan to extend the runtime to use it
Tesseract OCR, for scanned PDFs
LM Studio, for local/LAN inference

The current labs persistence uses SQLite through LABS_SQLITE_PATH when set, or data/virtudirector_labs.sqlite3 by default.

Quick start

From the repository root:

make venv
make install
cp backend/.env.example backend/.env
make run

Open:

Exact setup steps

1. Create the virtual environment

python3 -m venv backend/.venv

or:

make venv

2. Install dependencies

backend/.venv/bin/pip install -r backend/requirements.txt

or:

make install

3. Create the environment file

cp backend/.env.example backend/.env

4. Start the backend

make run

For LAN access:

make run-lan

or:

./scripts/run_backend_lan.sh

Make targets

The repository exposes these make targets:

make help: print available targets.
make venv: create backend/.venv.
make install: install backend dependencies into backend/.venv.
make run: start FastAPI on 127.0.0.1:8000.
make run-lan: start FastAPI on 0.0.0.0:8000.
make smoke: run scripts/smoke_labs.py.
make smoke-http: run scripts/smoke_tests.py against a running backend.
make compile: run Python bytecode compilation over backend/app and scripts.
make test: run pytest.
make labs-quality: run deterministic lab validation.
make recompact-intel: recompute all knowledge briefs with current compaction logic.

Environment variables

Environment variables are loaded from backend/.env.

Important variables:

Core runtime

DEMO_MODE
ENVIRONMENT
DATA_REGION

Search provider

SEARCH_PROVIDER
BRAVE_SEARCH_API_KEY
TAVILY_API_KEY
PERPLEXITY_API_KEY
WEB_SEARCH_TIMEOUT_SECONDS
WEB_SEARCH_CACHE_TTL_SECONDS
WEB_SEARCH_DEFAULT_COUNTRY
WEB_SEARCH_DEFAULT_LANGUAGE

Hosted model routing

DEEPINFRA_API_KEY
TOGETHER_API_KEY
FIREWORKS_API_KEY
GROQ_API_KEY
ANTHROPIC_API_KEY
OPENAI_API_KEY
MODEL_ROUTER_CHEAP
MODEL_ROUTER_MEDIUM
MODEL_ROUTER_PREMIUM
MODEL_EMBEDDINGS

Local/LAN LM Studio

LOCAL_LLM_ENABLED
LOCAL_LLM_PROVIDER
PREMIUM_PROVIDER
ESCALATION_ENABLED
ESCALATION_ALLOWED_INTENTS
ESCALATION_ALLOW_SENSITIVE
LOCAL_CONTEXT_LIMIT
PREMIUM_SANDBOX_DIR
CLAUDE_CLI_COMMAND
CODEX_CLI_COMMAND
PREMIUM_CLI_TIMEOUT_SECONDS
LM_STUDIO_BASE_URL
LM_STUDIO_API_KEY
LM_STUDIO_TIMEOUT_SECONDS
LM_STUDIO_CHAT_MODEL
LM_STUDIO_MODEL_CHEAP
LM_STUDIO_MODEL_MEDIUM
LM_STUDIO_MODEL_PREMIUM
LM_STUDIO_EMBEDDING_MODEL
LM_STUDIO_REMOTE_BASE_URLS
LOCAL_EMBEDDING_FALLBACK

Persistence and auth

DATABASE_URL
REDIS_URL
LABS_SQLITE_PATH
JWT_SECRET
ADMIN_BASIC_USERNAME
ADMIN_BASIC_PASSWORD

See backend/.env.example for defaults and comments.

Main HTTP routes

Public or development-friendly routes

GET /
GET /healthz
GET /app
GET /docs

Chat and operator workflows

POST /chat
POST /opportunities/diagnose
POST /opportunities/executive-proposal
POST /opportunities/implementation-bundle
POST /process-scanner/analyze

Document ingestion

POST /documents
GET /documents/status

Knowledge ingestion and retrieval

GET /knowledge/updates/status
POST /knowledge/updates
GET /knowledge/updates
GET /knowledge/briefs
GET /knowledge/blocks
GET /knowledge/use-cases
POST /knowledge/solutions

Tool status

GET /tools/web-search/status
GET /tools/web-search/test
GET /tools/lm-studio/status
GET /tools/lm-studio/test
GET /tools/premium/status
GET /tools/runtime-policy
POST /tools/runtime-policy
POST /tools/sensitivity/analyze

Labs admin routes

GET /labs/catalog
GET /labs/schedule/preview
POST /labs/experiments/run
GET /labs/runs
GET /labs/reports
GET /labs/reports/{report_id}
POST /labs/reports/{report_id}/decision
GET /labs/changes
GET /labs/changes/{change_id}
POST /labs/changes/{change_id}/apply
GET /labs/feature-flags

How to use the web application

Operator workspace: `/app`

The /app interface is the main workspace for day-to-day usage.

Typical usage flow:

Set Tenant and Company.
Upload client documents under Documentos cliente.
Upload curated intelligence under Intel IA diaria.
Select the product mode (SME, Consultant, or Technical).
Use the guided intake or write a custom question.
Use Opportunity workbench to run a structured diagnosis.
Generate:
- an executive proposal,
- an implementation bundle,
- or both.
Use Explorador intel to inspect:
- curated blocks,
- direct searches,
- detected query intent,
- ranking reasons.
Use Process scanner to submit structured process descriptions and review candidate automations.

Labs admin: `/admin/labs`

The Labs admin panel is for measured evaluation and change promotion.

Typical usage flow:

Log in with admin Basic auth.
Review available labs.
Run a full experiment batch or a single lab.
Inspect reports with baseline vs candidate evidence.
Approve or reject reports.
Apply approved staged changes.
Inspect resulting feature flags.

How to use the repository from the command line

Run the local labs smoke test

make smoke

This command:

initializes the local labs database,
runs every registered lab,
stores run records,
stores proposed reports,
approves one report as part of the smoke flow.

Expected output shape:

Catalog: 6 labs
Runs: 6
Reports proposed: N
...
OK

Run the lab quality gate

make labs-quality

This command validates that:

every lab defined in the catalog is registered,
every lab is deterministic across two consecutive runs,
scores and metrics are well-formed,
report-producing labs generate valid report drafts.

Run the test suite

make test

The test suite covers:

knowledge ranking behavior,
explainable search metadata,
executive proposal generation,
premium escalation behavior,
runtime policy behavior,
sensitivity classification,
lab quality gate behavior,
lab persistence,
report approval and change application flow.

Rebuild all knowledge briefs

make recompact-intel

Use this after changing knowledge compaction logic in backend/app/knowledge/updates.py.

Import curated intelligence from versioned markdown

backend/.venv/bin/python scripts/import_curated_intel.py --date 2026-05-21

Common options:

--date <folder>: import only one dated folder under backend/app/data/curated_intel/
--uploaded-by <name>
--scope global|internal
--source-type <value>

How the knowledge layer works

The knowledge subsystem stores two related records:

knowledge_updates: raw ingested documents and metadata.
knowledge_briefs: compacted summaries used for retrieval.

The workflow is:

ingest markdown, text, PDF, or DOCX,
parse and normalize,
compact into summary, tags, key points, and relevance fields,
store a retrieval-ready brief,
rank briefs for future operator questions.

The ranking layer currently supports:

accent-insensitive matching,
token expansion,
field weighting,
phrase boosts,
sector boosts,
query-intent detection,
explainable search output.

How the labs subsystem works

The labs subsystem uses a fixed lifecycle:

a lab definition exists in backend/app/labs/catalog.py,
a concrete evaluator class is registered in backend/app/labs/registry.py,
the lab runs a deterministic experiment,
the service stores a lab_run,
if the threshold is exceeded, the lab generates a CoreReportDraft,
a human approves or rejects the report,
approved reports create staged changes,
staged changes can be applied,
applied changes create or update feature flags.

Current labs:

rag_grounding
model_routing_cost
agent_workflow
roi_solutions
grc_eu_ai_act
market_intelligence

Executive proposals and implementation bundles

Two product outputs sit on top of the diagnosis engine:

Executive proposals

Use POST /opportunities/executive-proposal when you need a shareable decision artifact for management.

The proposal payload includes:

selected opportunity,
problem statement,
recommended solution,
annual benefit estimate,
setup and monthly cost ranges,
deployment mode,
pilot window,
quick wins,
90-day roadmap,
primary risk,
and a concise sales message.

When persistence is enabled, the backend writes:

proposal.json
proposal.html

under data/executive_proposals/<timestamp-tenant-opportunity>/.

Implementation bundles

Use POST /opportunities/implementation-bundle when you want a delivery-ready package for a pilot or prototype.

The bundle generator writes:

swarm_input.md
execution_request.json
review_checklist.md
command.txt

under data/implementation_bundles/<timestamp-tenant-opportunity>/.

How to add a new lab

Add a catalog entry in backend/app/labs/catalog.py.
Create an evaluator in backend/app/labs/evaluators/.
Inherit from the base lab contract.
Register the evaluator with @register_lab("your_lab_id").
Ensure run() is deterministic.
Ensure build_report() returns a valid reviewable draft.
Run:

make labs-quality
make test
make smoke

How to use LM Studio

Local machine

Start LM Studio.
Enable its OpenAI-compatible server.
Set:

LOCAL_LLM_ENABLED=true
LM_STUDIO_BASE_URL=http://127.0.0.1:1234/v1

Configure chat and embedding model names in .env.

Other machines on the LAN

Enable network serving in LM Studio on the remote machine.
Add its URL to:

LM_STUDIO_REMOTE_BASE_URLS=http://192.168.x.y:1234/v1,http://192.168.x.z:1234/v1

Restart the backend.
Inspect:

GET /tools/lm-studio/status
the Runtime panel in /app

Runtime policy and premium escalation

Runtime behavior is local-first.

Each tenant can override premium routing and escalation policy without changing global settings.

Relevant routes:

GET /tools/runtime-policy
POST /tools/runtime-policy
GET /tools/premium/status
POST /tools/sensitivity/analyze

Optional premium escalation

The application is local-first by default.

If a self-hosting customer wants frontier escalation for difficult tasks, they can configure one of these modes in backend/.env:

# A) local only
LOCAL_LLM_ENABLED=true
PREMIUM_PROVIDER=lmstudio

# B) local + Claude CLI
LOCAL_LLM_ENABLED=true
PREMIUM_PROVIDER=claude_cli
ESCALATION_ENABLED=true

# C) local + Codex CLI
LOCAL_LLM_ENABLED=true
PREMIUM_PROVIDER=codex_cli
ESCALATION_ENABLED=true

# D) local + hosted API
LOCAL_LLM_ENABLED=true
PREMIUM_PROVIDER=anthropic_api
ESCALATION_ENABLED=true

Behavior:

cheap and medium tiers can remain local,
premium can be routed separately,
escalation is disabled by default,
confidential and regulated content is blocked from escalation by default,
and /tools/premium/status reports whether the selected premium backend is available.

Scheduled ingestion agent

If the client does not want to upload files manually, use:

backend/.venv/bin/python scripts/ingest_agent.py \
  --base-url http://127.0.0.1:8000 \
  --tenant-id demo-tenant \
  --user-id ingest-agent \
  --client-name "Demo SL" \
  --source-dir /absolute/path/to/source-folder

The script:

scans allowlisted local folders,
sends document-like files to /documents,
sends update-like files to /knowledge/updates,
and is designed to be run from cron or another scheduler.

Minimal end-to-end usage example

1. Start the backend

make run

2. Upload a text document into RAG

curl -X POST http://127.0.0.1:8000/documents \
  -H 'X-Tenant-Id: demo-tenant' \
  -H 'X-User-Id: tester' \
  -H 'X-Client-Name: Demo SL' \
  -F 'file=@/absolute/path/to/file.txt'

3. Ask for an opportunity diagnosis

curl -X POST http://127.0.0.1:8000/opportunities/diagnose \
  -H 'Content-Type: application/json' \
  -H 'X-Tenant-Id: demo-tenant' \
  -H 'X-User-Id: tester' \
  -H 'X-Client-Name: Demo SL' \
  -d '{"question":"Where should we implement AI first in a 500-employee SME?","employee_count":500}'

4. Generate an executive proposal

Use the diagnosis response from the previous step as the diagnosis field in the request body.

curl -X POST http://127.0.0.1:8000/opportunities/executive-proposal \
  -H 'Content-Type: application/json' \
  -H 'X-Tenant-Id: demo-tenant' \
  -H 'X-User-Id: tester' \
  -H 'X-Client-Name: Demo SL' \
  -d @/absolute/path/to/executive-proposal-request.json

5. Run a lab batch as admin

curl -X POST http://127.0.0.1:8000/labs/experiments/run \
  -u admin:change-me-admin \
  -H 'Content-Type: application/json' \
  -d '{"triggered_by":"manual"}'

CI

GitHub Actions currently runs:

make compile
make labs-quality VENV_PYTHON=python
make test VENV_PYTHON=python
make smoke VENV_PYTHON=python

CI definition: .github/workflows/ci.yml

Documentation map

Additional repository documentation:

Scope note

Repository-facing documentation is in English.

Curated intelligence datasets under backend/app/data/curated_intel/ intentionally remain in Spanish because they are runtime product content for Spanish-language retrieval and operator workflows.

An optional execution scaffold is also available under:

extensions/implementation-engine/README.md

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
backend		backend
docs		docs
extensions/implementation-engine		extensions/implementation-engine
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

VirtuDirector IA

60-second demo

Who this is for

Repository purpose

What is included

Core application

FDE Labs

Data and curated knowledge

Tests and utilities

Runtime model

Authentication model

Operator and tenant routes

Labs admin routes

Requirements

Quick start

Exact setup steps

1. Create the virtual environment

2. Install dependencies

3. Create the environment file

4. Start the backend

Make targets

Environment variables

Core runtime

Search provider

Hosted model routing

Local/LAN LM Studio

Persistence and auth

Main HTTP routes

Public or development-friendly routes

Chat and operator workflows

Document ingestion

Knowledge ingestion and retrieval

Tool status

Labs admin routes

How to use the web application

Operator workspace: /app

Labs admin: /admin/labs

How to use the repository from the command line

Run the local labs smoke test

Run the lab quality gate

Run the test suite

Rebuild all knowledge briefs

Import curated intelligence from versioned markdown

How the knowledge layer works

How the labs subsystem works

Executive proposals and implementation bundles

Executive proposals

Implementation bundles

How to add a new lab

How to use LM Studio

Local machine

Other machines on the LAN

Runtime policy and premium escalation

Optional premium escalation

Scheduled ingestion agent

Minimal end-to-end usage example

1. Start the backend

2. Upload a text document into RAG

3. Ask for an opportunity diagnosis

4. Generate an executive proposal

5. Run a lab batch as admin

CI

Documentation map

Scope note

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Operator workspace: `/app`

Labs admin: `/admin/labs`

Packages