Agent CLI

agent-cli is a collection of local-first, AI-powered command-line agents that run entirely on your machine. It provides a suite of powerful tools for voice and text interaction, designed for privacy, offline capability, and seamless integration with system-wide hotkeys and workflows.

Important

Local and Private by Design All agents in this tool are designed to run 100% locally. Your data, whether it's from your clipboard, microphone, or files, is never sent to any cloud API. This ensures your privacy and allows the tools to work completely offline. You can also optionally configure the agents to use OpenAI/Gemini services.

Why I built this

I got tired of typing long prompts to LLMs. Speaking is faster, so I built this tool to transcribe my voice directly to the clipboard with a hotkey.

What it does:

Voice transcription to clipboard with system-wide hotkeys (Cmd+Shift+R on macOS)
Autocorrect any text from your clipboard
Edit clipboard content with voice commands ("make this more formal")
Runs locally - no internet required, your audio stays on your machine
Works with any app that can copy/paste

I use it mostly for the transcribe function when working with LLMs. Being able to speak naturally means I can provide more context without the typing fatigue.

See agent-cli in action: Watch the demo

Features

autocorrect: Correct grammar and spelling in your text (e.g., from clipboard) using a local LLM with Ollama or OpenAI.
transcribe: Transcribe audio from your microphone to text in your clipboard using a local Whisper model or OpenAI's Whisper API.
speak: Convert text to speech using Piper HTTP server, Wyoming TTS, OpenAI, or Kokoro TTS.
voice-edit: A voice-powered clipboard assistant that edits text based on your spoken commands.
assistant: A hands-free voice assistant that starts and stops recording based on a wake word.
chat: A conversational AI agent with tool-calling capabilities.

Quick Start

Just want the CLI tool?

If you already have AI services running (or plan to use OpenAI), simply install:

# Using uv (recommended)
uv tool install agent-cli

# Using pip
pip install agent-cli

Then use it:

agent-cli autocorrect "this has an eror"

Want automatic setup with everything?

We offer two ways to set up agent-cli with all services:

Option A: Using Shell Scripts (Traditional)

# 1. Clone the repository
git clone https://github.com/basnijholt/agent-cli.git
cd agent-cli

# 2. Run setup (installs all services + agent-cli)
./scripts/setup-macos.sh  # or setup-linux.sh

# 3. Start services
./scripts/start-all-services.sh

# 4. (Optional) Set up system-wide hotkeys
./scripts/setup-macos-hotkeys.sh  # or setup-linux-hotkeys.sh

# 5. Use it!
agent-cli autocorrect "this has an eror"

Option B: Using CLI Commands (New!)

# 1. Install agent-cli
uv tool install agent-cli

# 2. Install all required services
agent-cli install-services

# 3. Start all services
agent-cli start-services

# 4. (Optional) Set up system-wide hotkeys
agent-cli install-hotkeys

# 5. Use it!
agent-cli autocorrect "this has an eror"

The setup scripts automatically install:

✅ Package managers (Homebrew/uv) if needed
✅ All AI services (Ollama, Whisper, TTS, etc.)
✅ The agent-cli tool
✅ System dependencies
✅ Hotkey managers (if using hotkey scripts)

[ToC] 📚

Installation

Option 1: CLI Tool Only

If you already have AI services set up or plan to use cloud services (OpenAI/Gemini):

# Using uv (recommended)
uv tool install agent-cli

# Using pip
pip install agent-cli

Option 2: Automated Full Setup

For a complete local setup with all AI services:

Step 1: Clone the Repository

git clone https://github.com/basnijholt/agent-cli.git
cd agent-cli

Step 2: Run the Setup Script

Platform	Setup Command	What It Does	Detailed Guide
🍎 macOS	`./scripts/setup-macos.sh`	Installs Homebrew (if needed), uv, Ollama, all services, and agent-cli	macOS Guide
🐧 Linux	`./scripts/setup-linux.sh`	Installs uv, Ollama, all services, and agent-cli	Linux Guide
❄️ NixOS	See guide →	Special instructions for NixOS	NixOS Guide
🐳 Docker	See guide →	Container-based setup (slower)	Docker Guide

Step 3: Start All Services

./scripts/start-all-services.sh

This launches all AI services in a single terminal session using Zellij.

Step 4: Test Your Installation

agent-cli autocorrect "this has an eror"
# Output: this has an error

Note

The setup scripts handle everything automatically. For platform-specific details or troubleshooting, see the installation guides.

Development Installation

For contributing or development:

git clone https://github.com/basnijholt/agent-cli.git
cd agent-cli
uv sync
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

System Integration

Want system-wide hotkeys? You'll need the repository for the setup scripts:

# If you haven't already cloned it
git clone https://github.com/basnijholt/agent-cli.git
cd agent-cli

macOS Hotkeys

./scripts/setup-macos-hotkeys.sh

This script automatically:

✅ Installs Homebrew if not present
✅ Installs skhd (hotkey daemon) and terminal-notifier
✅ Configures these system-wide hotkeys:
- Cmd+Shift+R - Toggle voice transcription
- Cmd+Shift+A - Autocorrect clipboard text
- Cmd+Shift+V - Voice edit clipboard text

Note

After setup, you may need to grant Accessibility permissions to skhd in System Settings → Privacy & Security → Accessibility

Linux Hotkeys

./scripts/setup-linux-hotkeys.sh

This script automatically:

✅ Installs notification tools if needed
✅ Provides configuration for your desktop environment
✅ Sets up these hotkeys:
- Super+Shift+R - Toggle voice transcription
- Super+Shift+A - Autocorrect clipboard text
- Super+Shift+V - Voice edit clipboard text

The script supports Hyprland, GNOME, KDE, Sway, i3, XFCE, and provides instructions for manual configuration on other environments.

Prerequisites

What You Need to Install Manually

The only thing you need to have installed is Git to clone this repository. Everything else is handled automatically!

What the Setup Scripts Install for You

Our installation scripts automatically handle all dependencies:

Core Requirements (Auto-installed)

🍺 Homebrew (macOS) - Installed if not present
🐍 uv - Python package manager - Installed automatically
🎶 PortAudio - For microphone and speaker I/O - Installed via package manager
📋 Clipboard Tools - Pre-installed on macOS, handled on Linux

AI Services (Auto-installed and configured)

Service	Purpose	Auto-installed?
Ollama	Local LLM for text processing	✅ Yes, with default model
Wyoming Faster Whisper	Speech-to-text	✅ Yes, via `uvx`
Piper TTS	Text-to-speech (HTTP server)	✅ Yes, via `uvx`
Wyoming Piper	Text-to-speech (Wyoming protocol)	⚙️ Alternative to HTTP server

Note

TTS Provider Update: The default Piper TTS now uses HTTP server mode for better performance. Scripts have been renamed: run-piper.sh → run-piper-wyoming.sh, run-piper2.sh → run-piper-server.sh. Use --tts-provider piper for the new HTTP server, or --tts-provider local for Wyoming protocol. | Kokoro-FastAPI | Premium TTS (optional) | ⚙️ Can be added later | | Wyoming openWakeWord | Wake word detection | ✅ Yes, for assistant |

Alternative Cloud Services (Optional)

If you prefer cloud services over local ones:

Service	Purpose	Setup Required
OpenAI	LLM, Speech-to-text, TTS	API key in config
Gemini	LLM alternative	API key in config

Usage

This package provides multiple command-line tools, each designed for a specific purpose.

Installation Commands

These commands help you set up agent-cli and its required services:

install-services: Install all required AI services (Ollama, Whisper, Piper, OpenWakeWord)
install-hotkeys: Set up system-wide hotkeys for quick access to agent-cli features
start-services: Start all services in a Zellij terminal session

All necessary scripts are bundled with the package, so you can run these commands immediately after installing agent-cli.

Configuration

All agent-cli commands can be configured using a TOML file. The configuration file is searched for in the following locations, in order:

./agent-cli-config.toml (in the current directory)
~/.config/agent-cli/config.toml

You can also specify a path to a configuration file using the --config option, e.g., agent-cli transcribe --config /path/to/your/config.toml.

Command-line options always take precedence over settings in the configuration file.

An example configuration file is provided in example.agent-cli-config.toml.

Service Provider

You can choose to use local services (Wyoming/Ollama) or OpenAI services by setting the service_provider option in the [defaults] section of your configuration file.

[defaults]
# service_provider = "openai"  # 'local' or 'openai'
# openai_api_key = "sk-..."

`autocorrect`

Purpose: Quickly fix spelling and grammar in any text you've copied.

Workflow: This is a simple, one-shot command.

It reads text from your system clipboard (or from a direct argument).
It sends the text to a local Ollama LLM with a prompt to perform only technical corrections.
The corrected text is copied back to your clipboard, replacing the original.

How to Use It: This tool is ideal for integrating with a system-wide hotkey.

From Clipboard: agent-cli autocorrect
From Argument: agent-cli autocorrect "this text has an eror"

See the output of agent-cli autocorrect --help

 Usage: agent-cli autocorrect [OPTIONS] [TEXT]

 Correct text from clipboard using a local or remote LLM.


╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│   text      [TEXT]  The text to correct. If not provided, reads from clipboard.        │
│                     [default: None]                                                    │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Options ──────────────────────────────────────────────────────────────────────────────╮
│ --help          Show this message and exit.                                            │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Provider Selection ───────────────────────────────────────────────────────────────────╮
│ --llm-provider        TEXT  The LLM provider to use ('local' for Ollama, 'openai',     │
│                             'gemini').                                                 │
│                             [default: local]                                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Ollama (local) ────────────────────────────────────────────────────╮
│ --llm-ollama-model        TEXT  The Ollama model to use. Default is qwen3:4b.          │
│                                 [default: qwen3:4b]                                    │
│ --llm-ollama-host         TEXT  The Ollama server host. Default is                     │
│                                 http://localhost:11434.                                │
│                                 [default: http://localhost:11434]                      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: OpenAI ────────────────────────────────────────────────────────────╮
│ --llm-openai-model        TEXT  The OpenAI model to use for LLM tasks.                 │
│                                 [default: gpt-4o-mini]                                 │
│ --openai-api-key          TEXT  Your OpenAI API key. Can also be set with the          │
│                                 OPENAI_API_KEY environment variable.                   │
│                                 [env var: OPENAI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Gemini ────────────────────────────────────────────────────────────╮
│ --llm-gemini-model        TEXT  The Gemini model to use for LLM tasks.                 │
│                                 [default: gemini-2.5-flash]                            │
│ --gemini-api-key          TEXT  Your Gemini API key. Can also be set with the          │
│                                 GEMINI_API_KEY environment variable.                   │
│                                 [env var: GEMINI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│ --log-level           TEXT  Set logging level. [default: WARNING]                      │
│ --log-file            TEXT  Path to a file to write logs to. [default: None]           │
│ --quiet       -q            Suppress console output from rich.                         │
│ --config              TEXT  Path to a TOML configuration file. [default: None]         │
│ --print-args                Print the command line arguments, including variables      │
│                             taken from the configuration file.                         │
╰────────────────────────────────────────────────────────────────────────────────────────╯

`transcribe`

Purpose: A simple tool to turn your speech into text.

Workflow: This agent listens to your microphone and converts your speech to text in real-time.

Run the command. It will start listening immediately.
Speak into your microphone.
Press Ctrl+C to stop recording.
The transcribed text is copied to your clipboard.
Optionally, use the --llm flag to have an Ollama model clean up the raw transcript (fixing punctuation, etc.).

How to Use It:

Simple Transcription: agent-cli transcribe --input-device-index 1
With LLM Cleanup: agent-cli transcribe --input-device-index 1 --llm

See the output of agent-cli transcribe --help

 Usage: agent-cli transcribe [OPTIONS]

 Wyoming ASR Client for streaming microphone audio to a transcription server.


╭─ Options ──────────────────────────────────────────────────────────────────────────────╮
│ --extra-instructions        TEXT  Additional instructions for the LLM to process the   │
│                                   transcription.                                       │
│                                   [default: None]                                      │
│ --help                            Show this message and exit.                          │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Provider Selection ───────────────────────────────────────────────────────────────────╮
│ --asr-provider        TEXT  The ASR provider to use ('local' for Wyoming, 'openai').   │
│                             [default: local]                                           │
│ --llm-provider        TEXT  The LLM provider to use ('local' for Ollama, 'openai',     │
│                             'gemini').                                                 │
│                             [default: local]                                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration ────────────────────────────────────────────────────────────╮
│ --input-device-index        INTEGER  Index of the PyAudio input device to use.         │
│                                      [default: None]                                   │
│ --input-device-name         TEXT     Device name keywords for partial matching.        │
│                                      [default: None]                                   │
│ --list-devices                       List available audio input and output devices and │
│                                      exit.                                             │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: Wyoming (local) ───────────────────────────────────────────╮
│ --asr-wyoming-ip          TEXT     Wyoming ASR server IP address. [default: localhost] │
│ --asr-wyoming-port        INTEGER  Wyoming ASR server port. [default: 10300]           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: OpenAI ────────────────────────────────────────────────────╮
│ --asr-openai-model        TEXT  The OpenAI model to use for ASR (transcription).       │
│                                 [default: whisper-1]                                   │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Ollama (local) ────────────────────────────────────────────────────╮
│ --llm-ollama-model        TEXT  The Ollama model to use. Default is qwen3:4b.          │
│                                 [default: qwen3:4b]                                    │
│ --llm-ollama-host         TEXT  The Ollama server host. Default is                     │
│                                 http://localhost:11434.                                │
│                                 [default: http://localhost:11434]                      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: OpenAI ────────────────────────────────────────────────────────────╮
│ --llm-openai-model        TEXT  The OpenAI model to use for LLM tasks.                 │
│                                 [default: gpt-4o-mini]                                 │
│ --openai-api-key          TEXT  Your OpenAI API key. Can also be set with the          │
│                                 OPENAI_API_KEY environment variable.                   │
│                                 [env var: OPENAI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Gemini ────────────────────────────────────────────────────────────╮
│ --llm-gemini-model        TEXT  The Gemini model to use for LLM tasks.                 │
│                                 [default: gemini-2.5-flash]                            │
│ --gemini-api-key          TEXT  Your Gemini API key. Can also be set with the          │
│                                 GEMINI_API_KEY environment variable.                   │
│                                 [env var: GEMINI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration ────────────────────────────────────────────────────────────────────╮
│ --llm    --no-llm      Use an LLM to process the transcript. [default: no-llm]         │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Process Management Options ───────────────────────────────────────────────────────────╮
│ --stop            Stop any running background process.                                 │
│ --status          Check if a background process is running.                            │
│ --toggle          Toggle the background process on/off. If the process is running, it  │
│                   will be stopped. If the process is not running, it will be started.  │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│ --clipboard              --no-clipboard          Copy result to clipboard.             │
│                                                  [default: clipboard]                  │
│ --log-level                                TEXT  Set logging level. [default: WARNING] │
│ --log-file                                 TEXT  Path to a file to write logs to.      │
│                                                  [default: None]                       │
│ --quiet              -q                          Suppress console output from rich.    │
│ --config                                   TEXT  Path to a TOML configuration file.    │
│                                                  [default: None]                       │
│ --print-args                                     Print the command line arguments,     │
│                                                  including variables taken from the    │
│                                                  configuration file.                   │
│ --transcription-log                        PATH  Path to log transcription results     │
│                                                  with timestamps, hostname, model, and │
│                                                  raw output.                           │
│                                                  [default: None]                       │
╰────────────────────────────────────────────────────────────────────────────────────────╯

`speak`

Purpose: Reads any text out loud.

Workflow: A straightforward text-to-speech utility.

It takes text from a command-line argument or your clipboard.
It sends the text to a TTS server (Piper HTTP server, Wyoming TTS, OpenAI, or Kokoro).
The generated audio is played through your default speakers.

TTS Provider Options:

Piper HTTP Server (default local): Fast, high-quality TTS via HTTP
- Start server: ./scripts/run-piper-server.sh
- Use: agent-cli speak --tts-provider piper "Hello, world!"
Wyoming Piper: Alternative Wyoming protocol interface
- Start server: ./scripts/run-piper-wyoming.sh
- Use: agent-cli speak --tts-provider local "Hello, world!"
OpenAI: Cloud-based TTS (requires API key)
- Use: agent-cli speak --tts-provider openai "Hello, world!"
Kokoro: High-quality local TTS (optional setup)
- Use: agent-cli speak --tts-provider kokoro "Hello, world!"

How to Use It:

Speak from Argument: agent-cli speak "Hello, world!"
Speak from Clipboard: agent-cli speak
Save to File: agent-cli speak "Hello" --save-file hello.wav
With Piper HTTP: agent-cli speak --tts-provider piper "Hello"

See the output of agent-cli speak --help

 Usage: agent-cli speak [OPTIONS] [TEXT]

 Convert text to speech using Wyoming or OpenAI TTS server.


╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│   text      [TEXT]  Text to speak. Reads from clipboard if not provided.               │
│                     [default: None]                                                    │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Options ──────────────────────────────────────────────────────────────────────────────╮
│ --help          Show this message and exit.                                            │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Provider Selection ───────────────────────────────────────────────────────────────────╮
│ --tts-provider        TEXT  The TTS provider to use ('local' for Wyoming, 'openai',    │
│                             'kokoro').                                                 │
│                             [default: local]                                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration ───────────────────────────────────────────────────╮
│ --output-device-index        INTEGER  Index of the PyAudio output device to use for    │
│                                       TTS.                                             │
│                                       [default: None]                                  │
│ --output-device-name         TEXT     Output device name keywords for partial          │
│                                       matching.                                        │
│                                       [default: None]                                  │
│ --tts-speed                  FLOAT    Speech speed multiplier (1.0 = normal, 2.0 =     │
│                                       twice as fast, 0.5 = half speed).                │
│                                       [default: 1.0]                                   │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Wyoming (local) ──────────────────────────────────╮
│ --tts-wyoming-ip              TEXT     Wyoming TTS server IP address.                  │
│                                        [default: localhost]                            │
│ --tts-wyoming-port            INTEGER  Wyoming TTS server port. [default: 10200]       │
│ --tts-wyoming-voice           TEXT     Voice name to use for Wyoming TTS (e.g.,        │
│                                        'en_US-lessac-medium').                         │
│                                        [default: None]                                 │
│ --tts-wyoming-language        TEXT     Language for Wyoming TTS (e.g., 'en_US').       │
│                                        [default: None]                                 │
│ --tts-wyoming-speaker         TEXT     Speaker name for Wyoming TTS voice.             │
│                                        [default: None]                                 │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: OpenAI ───────────────────────────────────────────╮
│ --tts-openai-model        TEXT  The OpenAI model to use for TTS. [default: tts-1]      │
│ --tts-openai-voice        TEXT  The voice to use for OpenAI TTS. [default: alloy]      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Kokoro ───────────────────────────────────────────╮
│ --tts-kokoro-model        TEXT  The Kokoro model to use for TTS. [default: kokoro]     │
│ --tts-kokoro-voice        TEXT  The voice to use for Kokoro TTS. [default: af_sky]     │
│ --tts-kokoro-host         TEXT  The base URL for the Kokoro API.                       │
│                                 [default: http://localhost:8880/v1]                    │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Piper ────────────────────────────────────────────╮
│ --tts-piper-host                 TEXT     The base URL for the Piper HTTP server.      │
│                                           [default: http://localhost:10200]            │
│ --tts-piper-voice                TEXT     The voice to use for Piper TTS (optional).   │
│                                           [default: None]                              │
│ --tts-piper-speaker              TEXT     The speaker to use for multi-speaker voices  │
│                                           (optional).                                  │
│                                           [default: None]                              │
│ --tts-piper-speaker-id           INTEGER  The speaker ID to use for multi-speaker      │
│                                           voices (optional, overrides speaker).        │
│                                           [default: None]                              │
│ --tts-piper-length-scale         FLOAT    Speaking speed (1.0 = normal speed).         │
│                                           [default: 1.0]                               │
│ --tts-piper-noise-scale          FLOAT    Speaking variability (optional).             │
│                                           [default: None]                              │
│ --tts-piper-noise-w-scale        FLOAT    Phoneme width variability (optional).        │
│                                           [default: None]                              │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration ────────────────────────────────────────────────────────────╮
│ --list-devices          List available audio input and output devices and exit.        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│ --save-file           PATH  Save TTS response audio to WAV file. [default: None]       │
│ --log-level           TEXT  Set logging level. [default: WARNING]                      │
│ --log-file            TEXT  Path to a file to write logs to. [default: None]           │
│ --quiet       -q            Suppress console output from rich.                         │
│ --config              TEXT  Path to a TOML configuration file. [default: None]         │
│ --print-args                Print the command line arguments, including variables      │
│                             taken from the configuration file.                         │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Process Management Options ───────────────────────────────────────────────────────────╮
│ --stop            Stop any running background process.                                 │
│ --status          Check if a background process is running.                            │
│ --toggle          Toggle the background process on/off. If the process is running, it  │
│                   will be stopped. If the process is not running, it will be started.  │
╰────────────────────────────────────────────────────────────────────────────────────────╯

`voice-edit`

Purpose: A powerful clipboard assistant that you command with your voice.

Workflow: This agent is designed for a hotkey-driven workflow to act on text you've already copied.

Copy a block of text to your clipboard (e.g., an email draft).
Press a hotkey to run agent-cli voice-edit & in the background. The agent is now listening.
Speak a command, such as "Make this more formal" or "Summarize the key points."
Press the same hotkey again, which should trigger agent-cli voice-edit --stop.
The agent transcribes your command, sends it along with the original clipboard text to the LLM, and the LLM performs the action.
The result is copied back to your clipboard. If --tts is enabled, it will also speak the result.

How to Use It: The power of this tool is unlocked with a hotkey manager like Keyboard Maestro (macOS) or AutoHotkey (Windows). See the docstring in agent_cli/agents/voice_edit.py for a detailed Keyboard Maestro setup guide.

See the output of agent-cli voice-edit --help

 Usage: agent-cli voice-edit [OPTIONS]

 Interact with clipboard text via a voice command using local or remote services.

 Usage: - Run in foreground: agent-cli voice-edit --input-device-index 1 - Run in
 background: agent-cli voice-edit --input-device-index 1 & - Check status: agent-cli
 voice-edit --status - Stop background process: agent-cli voice-edit --stop - List output
 devices: agent-cli voice-edit --list-output-devices - Save TTS to file: agent-cli
 voice-edit --tts --save-file response.wav

╭─ Options ──────────────────────────────────────────────────────────────────────────────╮
│ --help          Show this message and exit.                                            │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Provider Selection ───────────────────────────────────────────────────────────────────╮
│ --asr-provider        TEXT  The ASR provider to use ('local' for Wyoming, 'openai').   │
│                             [default: local]                                           │
│ --llm-provider        TEXT  The LLM provider to use ('local' for Ollama, 'openai',     │
│                             'gemini').                                                 │
│                             [default: local]                                           │
│ --tts-provider        TEXT  The TTS provider to use ('local' for Wyoming, 'openai',    │
│                             'kokoro').                                                 │
│                             [default: local]                                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration ────────────────────────────────────────────────────────────╮
│ --input-device-index        INTEGER  Index of the PyAudio input device to use.         │
│                                      [default: None]                                   │
│ --input-device-name         TEXT     Device name keywords for partial matching.        │
│                                      [default: None]                                   │
│ --list-devices                       List available audio input and output devices and │
│                                      exit.                                             │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: Wyoming (local) ───────────────────────────────────────────╮
│ --asr-wyoming-ip          TEXT     Wyoming ASR server IP address. [default: localhost] │
│ --asr-wyoming-port        INTEGER  Wyoming ASR server port. [default: 10300]           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: OpenAI ────────────────────────────────────────────────────╮
│ --asr-openai-model        TEXT  The OpenAI model to use for ASR (transcription).       │
│                                 [default: whisper-1]                                   │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Ollama (local) ────────────────────────────────────────────────────╮
│ --llm-ollama-model        TEXT  The Ollama model to use. Default is qwen3:4b.          │
│                                 [default: qwen3:4b]                                    │
│ --llm-ollama-host         TEXT  The Ollama server host. Default is                     │
│                                 http://localhost:11434.                                │
│                                 [default: http://localhost:11434]                      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: OpenAI ────────────────────────────────────────────────────────────╮
│ --llm-openai-model        TEXT  The OpenAI model to use for LLM tasks.                 │
│                                 [default: gpt-4o-mini]                                 │
│ --openai-api-key          TEXT  Your OpenAI API key. Can also be set with the          │
│                                 OPENAI_API_KEY environment variable.                   │
│                                 [env var: OPENAI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Gemini ────────────────────────────────────────────────────────────╮
│ --llm-gemini-model        TEXT  The Gemini model to use for LLM tasks.                 │
│                                 [default: gemini-2.5-flash]                            │
│ --gemini-api-key          TEXT  Your Gemini API key. Can also be set with the          │
│                                 GEMINI_API_KEY environment variable.                   │
│                                 [env var: GEMINI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration ───────────────────────────────────────────────────╮
│ --tts                    --no-tts             Enable text-to-speech for responses.     │
│                                               [default: no-tts]                        │
│ --output-device-index                INTEGER  Index of the PyAudio output device to    │
│                                               use for TTS.                             │
│                                               [default: None]                          │
│ --output-device-name                 TEXT     Output device name keywords for partial  │
│                                               matching.                                │
│                                               [default: None]                          │
│ --tts-speed                          FLOAT    Speech speed multiplier (1.0 = normal,   │
│                                               2.0 = twice as fast, 0.5 = half speed).  │
│                                               [default: 1.0]                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Wyoming (local) ──────────────────────────────────╮
│ --tts-wyoming-ip              TEXT     Wyoming TTS server IP address.                  │
│                                        [default: localhost]                            │
│ --tts-wyoming-port            INTEGER  Wyoming TTS server port. [default: 10200]       │
│ --tts-wyoming-voice           TEXT     Voice name to use for Wyoming TTS (e.g.,        │
│                                        'en_US-lessac-medium').                         │
│                                        [default: None]                                 │
│ --tts-wyoming-language        TEXT     Language for Wyoming TTS (e.g., 'en_US').       │
│                                        [default: None]                                 │
│ --tts-wyoming-speaker         TEXT     Speaker name for Wyoming TTS voice.             │
│                                        [default: None]                                 │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: OpenAI ───────────────────────────────────────────╮
│ --tts-openai-model        TEXT  The OpenAI model to use for TTS. [default: tts-1]      │
│ --tts-openai-voice        TEXT  The voice to use for OpenAI TTS. [default: alloy]      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Kokoro ───────────────────────────────────────────╮
│ --tts-kokoro-model        TEXT  The Kokoro model to use for TTS. [default: kokoro]     │
│ --tts-kokoro-voice        TEXT  The voice to use for Kokoro TTS. [default: af_sky]     │
│ --tts-kokoro-host         TEXT  The base URL for the Kokoro API.                       │
│                                 [default: http://localhost:8880/v1]                    │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Piper ────────────────────────────────────────────╮
│ --tts-piper-host                 TEXT     The base URL for the Piper HTTP server.      │
│                                           [default: http://localhost:10200]            │
│ --tts-piper-voice                TEXT     The voice to use for Piper TTS (optional).   │
│                                           [default: None]                              │
│ --tts-piper-speaker              TEXT     The speaker to use for multi-speaker voices  │
│                                           (optional).                                  │
│                                           [default: None]                              │
│ --tts-piper-speaker-id           INTEGER  The speaker ID to use for multi-speaker      │
│                                           voices (optional, overrides speaker).        │
│                                           [default: None]                              │
│ --tts-piper-length-scale         FLOAT    Speaking speed (1.0 = normal speed).         │
│                                           [default: 1.0]                               │
│ --tts-piper-noise-scale          FLOAT    Speaking variability (optional).             │
│                                           [default: None]                              │
│ --tts-piper-noise-w-scale        FLOAT    Phoneme width variability (optional).        │
│                                           [default: None]                              │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Process Management Options ───────────────────────────────────────────────────────────╮
│ --stop            Stop any running background process.                                 │
│ --status          Check if a background process is running.                            │
│ --toggle          Toggle the background process on/off. If the process is running, it  │
│                   will be stopped. If the process is not running, it will be started.  │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│ --save-file                         PATH  Save TTS response audio to WAV file.         │
│                                           [default: None]                              │
│ --clipboard       --no-clipboard          Copy result to clipboard.                    │
│                                           [default: clipboard]                         │
│ --log-level                         TEXT  Set logging level. [default: WARNING]        │
│ --log-file                          TEXT  Path to a file to write logs to.             │
│                                           [default: None]                              │
│ --quiet       -q                          Suppress console output from rich.           │
│ --config                            TEXT  Path to a TOML configuration file.           │
│                                           [default: None]                              │
│ --print-args                              Print the command line arguments, including  │
│                                           variables taken from the configuration file. │
╰────────────────────────────────────────────────────────────────────────────────────────╯

`assistant`

Purpose: A hands-free voice assistant that starts and stops recording based on a wake word.

Workflow: This agent continuously listens for a wake word (e.g., "Hey Nabu").

Run the assistant command. It will start listening for the wake word.
Say the wake word to start recording.
Speak your command or question.
Say the wake word again to stop recording.
The agent transcribes your speech, sends it to the LLM, and gets a response.
The agent speaks the response back to you and then immediately starts listening for the wake word again.

How to Use It:

Start the agent: agent-cli assistant --wake-word "ok_nabu" --input-device-index 1
With TTS: agent-cli assistant --wake-word "ok_nabu" --tts --voice "en_US-lessac-medium"

See the output of agent-cli assistant --help

 Usage: agent-cli assistant [OPTIONS]

 Wake word-based voice assistant using local or remote services.


╭─ Options ──────────────────────────────────────────────────────────────────────────────╮
│ --help          Show this message and exit.                                            │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Provider Selection ───────────────────────────────────────────────────────────────────╮
│ --asr-provider        TEXT  The ASR provider to use ('local' for Wyoming, 'openai').   │
│                             [default: local]                                           │
│ --llm-provider        TEXT  The LLM provider to use ('local' for Ollama, 'openai',     │
│                             'gemini').                                                 │
│                             [default: local]                                           │
│ --tts-provider        TEXT  The TTS provider to use ('local' for Wyoming, 'openai',    │
│                             'kokoro').                                                 │
│                             [default: local]                                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Wake Word Options ────────────────────────────────────────────────────────────────────╮
│ --wake-server-ip          TEXT     Wyoming wake word server IP address.                │
│                                    [default: localhost]                                │
│ --wake-server-port        INTEGER  Wyoming wake word server port. [default: 10400]     │
│ --wake-word               TEXT     Name of wake word to detect (e.g., 'ok_nabu',       │
│                                    'hey_jarvis').                                      │
│                                    [default: ok_nabu]                                  │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration ────────────────────────────────────────────────────────────╮
│ --input-device-index        INTEGER  Index of the PyAudio input device to use.         │
│                                      [default: None]                                   │
│ --input-device-name         TEXT     Device name keywords for partial matching.        │
│                                      [default: None]                                   │
│ --list-devices                       List available audio input and output devices and │
│                                      exit.                                             │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: Wyoming (local) ───────────────────────────────────────────╮
│ --asr-wyoming-ip          TEXT     Wyoming ASR server IP address. [default: localhost] │
│ --asr-wyoming-port        INTEGER  Wyoming ASR server port. [default: 10300]           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: OpenAI ────────────────────────────────────────────────────╮
│ --asr-openai-model        TEXT  The OpenAI model to use for ASR (transcription).       │
│                                 [default: whisper-1]                                   │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Ollama (local) ────────────────────────────────────────────────────╮
│ --llm-ollama-model        TEXT  The Ollama model to use. Default is qwen3:4b.          │
│                                 [default: qwen3:4b]                                    │
│ --llm-ollama-host         TEXT  The Ollama server host. Default is                     │
│                                 http://localhost:11434.                                │
│                                 [default: http://localhost:11434]                      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: OpenAI ────────────────────────────────────────────────────────────╮
│ --llm-openai-model        TEXT  The OpenAI model to use for LLM tasks.                 │
│                                 [default: gpt-4o-mini]                                 │
│ --openai-api-key          TEXT  Your OpenAI API key. Can also be set with the          │
│                                 OPENAI_API_KEY environment variable.                   │
│                                 [env var: OPENAI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Gemini ────────────────────────────────────────────────────────────╮
│ --llm-gemini-model        TEXT  The Gemini model to use for LLM tasks.                 │
│                                 [default: gemini-2.5-flash]                            │
│ --gemini-api-key          TEXT  Your Gemini API key. Can also be set with the          │
│                                 GEMINI_API_KEY environment variable.                   │
│                                 [env var: GEMINI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration ───────────────────────────────────────────────────╮
│ --tts                    --no-tts             Enable text-to-speech for responses.     │
│                                               [default: no-tts]                        │
│ --output-device-index                INTEGER  Index of the PyAudio output device to    │
│                                               use for TTS.                             │
│                                               [default: None]                          │
│ --output-device-name                 TEXT     Output device name keywords for partial  │
│                                               matching.                                │
│                                               [default: None]                          │
│ --tts-speed                          FLOAT    Speech speed multiplier (1.0 = normal,   │
│                                               2.0 = twice as fast, 0.5 = half speed).  │
│                                               [default: 1.0]                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Wyoming (local) ──────────────────────────────────╮
│ --tts-wyoming-ip              TEXT     Wyoming TTS server IP address.                  │
│                                        [default: localhost]                            │
│ --tts-wyoming-port            INTEGER  Wyoming TTS server port. [default: 10200]       │
│ --tts-wyoming-voice           TEXT     Voice name to use for Wyoming TTS (e.g.,        │
│                                        'en_US-lessac-medium').                         │
│                                        [default: None]                                 │
│ --tts-wyoming-language        TEXT     Language for Wyoming TTS (e.g., 'en_US').       │
│                                        [default: None]                                 │
│ --tts-wyoming-speaker         TEXT     Speaker name for Wyoming TTS voice.             │
│                                        [default: None]                                 │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: OpenAI ───────────────────────────────────────────╮
│ --tts-openai-model        TEXT  The OpenAI model to use for TTS. [default: tts-1]      │
│ --tts-openai-voice        TEXT  The voice to use for OpenAI TTS. [default: alloy]      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Kokoro ───────────────────────────────────────────╮
│ --tts-kokoro-model        TEXT  The Kokoro model to use for TTS. [default: kokoro]     │
│ --tts-kokoro-voice        TEXT  The voice to use for Kokoro TTS. [default: af_sky]     │
│ --tts-kokoro-host         TEXT  The base URL for the Kokoro API.                       │
│                                 [default: http://localhost:8880/v1]                    │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Piper ────────────────────────────────────────────╮
│ --tts-piper-host                 TEXT     The base URL for the Piper HTTP server.      │
│                                           [default: http://localhost:10200]            │
│ --tts-piper-voice                TEXT     The voice to use for Piper TTS (optional).   │
│                                           [default: None]                              │
│ --tts-piper-speaker              TEXT     The speaker to use for multi-speaker voices  │
│                                           (optional).                                  │
│                                           [default: None]                              │
│ --tts-piper-speaker-id           INTEGER  The speaker ID to use for multi-speaker      │
│                                           voices (optional, overrides speaker).        │
│                                           [default: None]                              │
│ --tts-piper-length-scale         FLOAT    Speaking speed (1.0 = normal speed).         │
│                                           [default: 1.0]                               │
│ --tts-piper-noise-scale          FLOAT    Speaking variability (optional).             │
│                                           [default: None]                              │
│ --tts-piper-noise-w-scale        FLOAT    Phoneme width variability (optional).        │
│                                           [default: None]                              │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Process Management Options ───────────────────────────────────────────────────────────╮
│ --stop            Stop any running background process.                                 │
│ --status          Check if a background process is running.                            │
│ --toggle          Toggle the background process on/off. If the process is running, it  │
│                   will be stopped. If the process is not running, it will be started.  │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│ --save-file                         PATH  Save TTS response audio to WAV file.         │
│                                           [default: None]                              │
│ --clipboard       --no-clipboard          Copy result to clipboard.                    │
│                                           [default: clipboard]                         │
│ --log-level                         TEXT  Set logging level. [default: WARNING]        │
│ --log-file                          TEXT  Path to a file to write logs to.             │
│                                           [default: None]                              │
│ --quiet       -q                          Suppress console output from rich.           │
│ --config                            TEXT  Path to a TOML configuration file.           │
│                                           [default: None]                              │
│ --print-args                              Print the command line arguments, including  │
│                                           variables taken from the configuration file. │
╰────────────────────────────────────────────────────────────────────────────────────────╯

`chat`

Purpose: A full-featured, conversational AI assistant that can interact with your system.

Workflow: This is a persistent, conversational agent that you can have a conversation with.

Run the chat command. It will start listening for your voice.
Speak your command or question (e.g., "What's in my current directory?").
The agent transcribes your speech, sends it to the LLM, and gets a response. The LLM can use tools like read_file or execute_code to answer your question.
The agent speaks the response back to you and then immediately starts listening for your next command.
The conversation continues in this loop. Conversation history is saved between sessions.

Interaction Model:

To Interrupt: Press Ctrl+C once to stop the agent from either listening or speaking, and it will immediately return to a listening state for a new command. This is useful if it misunderstands you or you want to speak again quickly.
To Exit: Press Ctrl+C twice in a row to terminate the application.

How to Use It:

Start the agent: agent-cli chat --input-device-index 1 --tts
Have a conversation:
- You: "Read the pyproject.toml file and tell me the project version."
- AI: (Reads file) "The project version is 0.1.0."
- You: "Thanks!"

See the output of agent-cli chat --help

 Usage: agent-cli chat [OPTIONS]

 An chat agent that you can talk to.


╭─ Options ──────────────────────────────────────────────────────────────────────────────╮
│ --help          Show this message and exit.                                            │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Provider Selection ───────────────────────────────────────────────────────────────────╮
│ --asr-provider        TEXT  The ASR provider to use ('local' for Wyoming, 'openai').   │
│                             [default: local]                                           │
│ --llm-provider        TEXT  The LLM provider to use ('local' for Ollama, 'openai',     │
│                             'gemini').                                                 │
│                             [default: local]                                           │
│ --tts-provider        TEXT  The TTS provider to use ('local' for Wyoming, 'openai',    │
│                             'kokoro').                                                 │
│                             [default: local]                                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration ────────────────────────────────────────────────────────────╮
│ --input-device-index        INTEGER  Index of the PyAudio input device to use.         │
│                                      [default: None]                                   │
│ --input-device-name         TEXT     Device name keywords for partial matching.        │
│                                      [default: None]                                   │
│ --list-devices                       List available audio input and output devices and │
│                                      exit.                                             │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: Wyoming (local) ───────────────────────────────────────────╮
│ --asr-wyoming-ip          TEXT     Wyoming ASR server IP address. [default: localhost] │
│ --asr-wyoming-port        INTEGER  Wyoming ASR server port. [default: 10300]           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ ASR (Audio) Configuration: OpenAI ────────────────────────────────────────────────────╮
│ --asr-openai-model        TEXT  The OpenAI model to use for ASR (transcription).       │
│                                 [default: whisper-1]                                   │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Ollama (local) ────────────────────────────────────────────────────╮
│ --llm-ollama-model        TEXT  The Ollama model to use. Default is qwen3:4b.          │
│                                 [default: qwen3:4b]                                    │
│ --llm-ollama-host         TEXT  The Ollama server host. Default is                     │
│                                 http://localhost:11434.                                │
│                                 [default: http://localhost:11434]                      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: OpenAI ────────────────────────────────────────────────────────────╮
│ --llm-openai-model        TEXT  The OpenAI model to use for LLM tasks.                 │
│                                 [default: gpt-4o-mini]                                 │
│ --openai-api-key          TEXT  Your OpenAI API key. Can also be set with the          │
│                                 OPENAI_API_KEY environment variable.                   │
│                                 [env var: OPENAI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ LLM Configuration: Gemini ────────────────────────────────────────────────────────────╮
│ --llm-gemini-model        TEXT  The Gemini model to use for LLM tasks.                 │
│                                 [default: gemini-2.5-flash]                            │
│ --gemini-api-key          TEXT  Your Gemini API key. Can also be set with the          │
│                                 GEMINI_API_KEY environment variable.                   │
│                                 [env var: GEMINI_API_KEY]                              │
│                                 [default: None]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration ───────────────────────────────────────────────────╮
│ --tts                    --no-tts             Enable text-to-speech for responses.     │
│                                               [default: no-tts]                        │
│ --output-device-index                INTEGER  Index of the PyAudio output device to    │
│                                               use for TTS.                             │
│                                               [default: None]                          │
│ --output-device-name                 TEXT     Output device name keywords for partial  │
│                                               matching.                                │
│                                               [default: None]                          │
│ --tts-speed                          FLOAT    Speech speed multiplier (1.0 = normal,   │
│                                               2.0 = twice as fast, 0.5 = half speed).  │
│                                               [default: 1.0]                           │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Wyoming (local) ──────────────────────────────────╮
│ --tts-wyoming-ip              TEXT     Wyoming TTS server IP address.                  │
│                                        [default: localhost]                            │
│ --tts-wyoming-port            INTEGER  Wyoming TTS server port. [default: 10200]       │
│ --tts-wyoming-voice           TEXT     Voice name to use for Wyoming TTS (e.g.,        │
│                                        'en_US-lessac-medium').                         │
│                                        [default: None]                                 │
│ --tts-wyoming-language        TEXT     Language for Wyoming TTS (e.g., 'en_US').       │
│                                        [default: None]                                 │
│ --tts-wyoming-speaker         TEXT     Speaker name for Wyoming TTS voice.             │
│                                        [default: None]                                 │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: OpenAI ───────────────────────────────────────────╮
│ --tts-openai-model        TEXT  The OpenAI model to use for TTS. [default: tts-1]      │
│ --tts-openai-voice        TEXT  The voice to use for OpenAI TTS. [default: alloy]      │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Kokoro ───────────────────────────────────────────╮
│ --tts-kokoro-model        TEXT  The Kokoro model to use for TTS. [default: kokoro]     │
│ --tts-kokoro-voice        TEXT  The voice to use for Kokoro TTS. [default: af_sky]     │
│ --tts-kokoro-host         TEXT  The base URL for the Kokoro API.                       │
│                                 [default: http://localhost:8880/v1]                    │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ TTS (Text-to-Speech) Configuration: Piper ────────────────────────────────────────────╮
│ --tts-piper-host                 TEXT     The base URL for the Piper HTTP server.      │
│                                           [default: http://localhost:10200]            │
│ --tts-piper-voice                TEXT     The voice to use for Piper TTS (optional).   │
│                                           [default: None]                              │
│ --tts-piper-speaker              TEXT     The speaker to use for multi-speaker voices  │
│                                           (optional).                                  │
│                                           [default: None]                              │
│ --tts-piper-speaker-id           INTEGER  The speaker ID to use for multi-speaker      │
│                                           voices (optional, overrides speaker).        │
│                                           [default: None]                              │
│ --tts-piper-length-scale         FLOAT    Speaking speed (1.0 = normal speed).         │
│                                           [default: 1.0]                               │
│ --tts-piper-noise-scale          FLOAT    Speaking variability (optional).             │
│                                           [default: None]                              │
│ --tts-piper-noise-w-scale        FLOAT    Phoneme width variability (optional).        │
│                                           [default: None]                              │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Process Management Options ───────────────────────────────────────────────────────────╮
│ --stop            Stop any running background process.                                 │
│ --status          Check if a background process is running.                            │
│ --toggle          Toggle the background process on/off. If the process is running, it  │
│                   will be stopped. If the process is not running, it will be started.  │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ History Options ──────────────────────────────────────────────────────────────────────╮
│ --history-dir            PATH     Directory to store conversation history.             │
│                                   [default: ~/.config/agent-cli/history]               │
│ --last-n-messages        INTEGER  Number of messages to include in the conversation    │
│                                   history. Set to 0 to disable history.                │
│                                   [default: 50]                                        │
╰────────────────────────────────────────────────────────────────────────────────────────╯
╭─ General Options ──────────────────────────────────────────────────────────────────────╮
│ --save-file           PATH  Save TTS response audio to WAV file. [default: None]       │
│ --log-level           TEXT  Set logging level. [default: WARNING]                      │
│ --log-file            TEXT  Path to a file to write logs to. [default: None]           │
│ --quiet       -q            Suppress console output from rich.                         │
│ --config              TEXT  Path to a TOML configuration file. [default: None]         │
│ --print-args                Print the command line arguments, including variables      │
│                             taken from the configuration file.                         │
╰────────────────────────────────────────────────────────────────────────────────────────╯

Development

Running Tests

The project uses pytest for testing. To run tests using uv:

uv run pytest

Pre-commit Hooks

This project uses pre-commit hooks (ruff for linting and formatting, mypy for type checking) to maintain code quality. To set them up:

Install pre-commit:
```
pip install pre-commit
```
Install the hooks:
```
pre-commit install
```
Now, the hooks will run automatically before each commit.

Contributing

Contributions are welcome! If you find a bug or have a feature request, please open an issue. If you'd like to contribute code, please fork the repository and submit a pull request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 340 Commits
.github		.github
agent_cli		agent_cli
docker		docker
docs/installation		docs/installation
scripts		scripts
tests		tests
.cursorrules		.cursorrules
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
example.agent-cli-config.toml		example.agent-cli-config.toml
pyproject.toml		pyproject.toml
shell.nix		shell.nix

Uh oh!

License

Uh oh!

sbalk/agent-cli

Folders and files

Latest commit

History

Repository files navigation

Agent CLI

Why I built this

Features

Quick Start

Just want the CLI tool?

Want automatic setup with everything?

Option A: Using Shell Scripts (Traditional)

Option B: Using CLI Commands (New!)

Installation

Option 1: CLI Tool Only

Option 2: Automated Full Setup

Step 1: Clone the Repository

Step 2: Run the Setup Script

Step 3: Start All Services

Step 4: Test Your Installation

System Integration

macOS Hotkeys

Linux Hotkeys

Prerequisites

What You Need to Install Manually

What the Setup Scripts Install for You

Core Requirements (Auto-installed)

AI Services (Auto-installed and configured)

Alternative Cloud Services (Optional)

Usage

Installation Commands

Configuration

Service Provider

autocorrect

transcribe

speak

voice-edit

assistant

chat

Development

Running Tests

Pre-commit Hooks

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`autocorrect`

`transcribe`

`speak`

`voice-edit`

`assistant`

`chat`

Packages