WebSocket terminal server with voice input support and Claude AI intelligent interaction.
- WebSocket Terminal - Real-time terminal web interface based on Axum framework
- Voice Input - Speech-to-text support for executing commands via voice
- Claude AI Integration - AI-assisted terminal interaction using
echokit_terminal - Multiple ASR Support
- OpenAI Whisper API
- Alibaba Cloud Paraformer real-time speech recognition (todo)
Create a .env file and configure the Whisper API (Groq recommended):
# Use Groq's Whisper API (recommended, fast)
WHISPER_API_KEY=your_groq_api_key_here
WHISPER_API_URL=https://api.groq.com/openai/v1/audio/transcriptions
WHISPER_MODEL=whisper-large-v3# Run directly with cargo
cargo run -- -- claude
# Or build and run
cargo build --release
./target/release/vibetty -- --claudeFor more options, use --help:
cargo run -- --helpVisit: http://localhost:3000 after starting the service.