Talkie-Codie

Voice to prompt, best for vibe coding

🎥 Demo Video

Watch the demo to see Talkie-Codie in action:

A cross-platform desktop application that converts your voice into optimized prompts using AI-powered speech recognition and language model enhancement.

Quick Start

Prerequisites

Python 3.10 or higher
Microphone access

⚠️ Audio Dependencies Setup (IMPORTANT)

Before running the application, you may need to install system-level audio libraries.

If you encounter "PortAudio library not found" errors, please follow the audio setup guide:

📖 Audio Dependencies Setup Guide

Run Application

Optional: Create virtual environment

# Using conda
conda create -n talkie-codie python=3.10
conda activate talkie-codie

CUDA Support (Optional) If you have an NVIDIA GPU and want to use CUDA acceleration, manually install pytorch accordingly (for example, my GPU is sm 120 architecture):
```
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128
```
Launch GUI (auto-installs dependencies)
```
python run_gui.py
```

Note: First startup may be slow due to model downloads. Configure your API key in the GUI settings.

Note: Without API configuration, the app will only use Whisper for audio-to-text conversion.

Configuration

All settings can be configured through the GUI:

LLM Provider & API Key: OpenAI or DeepSeek
Whisper Settings: Device, model size, compute type
Audio Device: Select your microphone

Command Line Mode

python src/main.py

Usage

Select Input Device: Choose your microphone from the dropdown
Start Recording: Click the record button to begin voice capture
View Results: See your transcribed text and AI-enhanced prompt
Copy Output: Use the copy button to copy the optimized prompt
Settings: Access configuration options via the settings button

Cache Management

# View cache information
python scripts/clear_cache.py info

# Clear cache (with confirmation)
python scripts/clear_cache.py clear

# Force clear cache
python scripts/clear_cache.py clear-force

Dependencies

PyQt6: Modern GUI framework
sounddevice: Audio recording and playback
faster-whisper: Speech-to-text transcription
scipy/numpy: Scientific computing
requests: HTTP client for API calls

Troubleshooting

Common Issues

No audio input detected
- Check microphone permissions
- Verify device selection in settings
- Ensure microphone is not muted
LLM API errors
- Verify API key is correct
- Check internet connection
- Ensure sufficient API credits
Whisper model download issues
- Check internet connection
- Verify sufficient disk space
- Try different model size in settings

Note: This application requires an active internet connection for LLM API calls and initial Whisper model downloads.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
assets/images		assets/images
config		config
docs		docs
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
README_CN.md		README_CN.md
requirements.txt		requirements.txt
run_gui.py		run_gui.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Talkie-Codie

🎥 Demo Video

Quick Start

Prerequisites

⚠️ Audio Dependencies Setup (IMPORTANT)

Run Application

Configuration

Command Line Mode

Usage

Cache Management

Dependencies

Troubleshooting

Common Issues

About

Uh oh!

Releases

Packages

Languages

cziwen/Talkie-Codie

Folders and files

Latest commit

History

Repository files navigation

Talkie-Codie

🎥 Demo Video

Quick Start

Prerequisites

⚠️ Audio Dependencies Setup (IMPORTANT)

Run Application

Configuration

Command Line Mode

Usage

Cache Management

Dependencies

Troubleshooting

Common Issues

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages