AI-powered speech-to-text that transcribes wherever your cursor is on your computer
Transform your voice into text instantly, anywhere on your screen ✨
- Real-time transcription - Press F1, speak, release, done!
- Lightning fast - Powered by Groq's Whisper v3 Turbo
- System-wide - Works in any application
- Customizable - Configure hotkeys and settings
- Windows optimized - Native Windows experience
- Lightweight - Minimal resource usage
Want to try it right now?
Download the latest release - Just download Whisprly.exe and run it!
- Download the latest
Whisprly.exefrom releases - Get your API key from https://console.groq.com/keys
- Run
Whisprly.exeand enter your API key in settings - Start transcribing! Press
F1to record
# Clone the repository
git clone https://github.com/plfavreau/whisprly.git
cd whisprly
# Set up environment
uv venv
.venv\Scripts\activate # Windows
# source .venv/bin/activate # Linux/Mac
# Install dependencies
uv sync
# Configure
cp .env.example .env
# Add your GROQ_API_KEY to .env
# Run
python main.pyWant to create your own Whisprly.exe? It's super easy!
# Build the executable
uv run python build.pyYour new Whisprly.exe will be in the dist/ folder!
| Action | Shortcut | Description |
|---|---|---|
| Record | Hold F1 |
Start recording your voice |
| Stop | Release F1 |
Stop recording and transcribe |
| Exit | Ctrl + Alt + X |
Close the application |
Pro tip: The transcription appears wherever your cursor is - works in any app!
Customize your experience by editing the .env file:
# API Configuration
GROQ_API_KEY=your_api_key_here
# Hotkey Settings
RECORD_KEY=f1
EXIT_KEY=ctrl+alt+x
# Audio Settings
SAMPLE_RATE=16000
CHANNELS=1This project is licensed under the MIT License - see the LICENSE file for details.
- Groq provides the Whisper v3 Turbo API
- OpenAI provides the original Whisper model
- PyQt6 provides the python UI framework
Made with ❤️ by plfavreau
Star this repo if you find it useful!