Thanks to visit codestin.com
Credit goes to github.com

Skip to content

plfavreau/whisprly

Repository files navigation

Demo

Whisprly

AI-powered speech-to-text that transcribes wherever your cursor is on your computer

Release Downloads License Python

Transform your voice into text instantly, anywhere on your screen


✨ Features

  • Real-time transcription - Press F1, speak, release, done!
  • Lightning fast - Powered by Groq's Whisper v3 Turbo
  • System-wide - Works in any application
  • Customizable - Configure hotkeys and settings
  • Windows optimized - Native Windows experience
  • Lightweight - Minimal resource usage

📥 Quick Download

Want to try it right now?

Download the latest release - Just download Whisprly.exe and run it!

🚀 Quick Start

Option 1: Use the Pre-built Executable

  1. Download the latest Whisprly.exe from releases
  2. Get your API key from https://console.groq.com/keys
  3. Run Whisprly.exe and enter your API key in settings
  4. Start transcribing! Press F1 to record

Option 2: Run from Source

# Clone the repository
git clone https://github.com/plfavreau/whisprly.git
cd whisprly

# Set up environment
uv venv
.venv\Scripts\activate  # Windows
# source .venv/bin/activate  # Linux/Mac

# Install dependencies
uv sync

# Configure
cp .env.example .env
# Add your GROQ_API_KEY to .env

# Run
python main.py

🔨 Build Your Own Executable

Want to create your own Whisprly.exe? It's super easy!

# Build the executable
uv run python build.py

Your new Whisprly.exe will be in the dist/ folder!

🎯 How to Use

Action Shortcut Description
Record Hold F1 Start recording your voice
Stop Release F1 Stop recording and transcribe
Exit Ctrl + Alt + X Close the application

Pro tip: The transcription appears wherever your cursor is - works in any app!

Configuration

Customize your experience by editing the .env file:

# API Configuration
GROQ_API_KEY=your_api_key_here

# Hotkey Settings
RECORD_KEY=f1
EXIT_KEY=ctrl+alt+x

# Audio Settings
SAMPLE_RATE=16000
CHANNELS=1

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Groq provides the Whisper v3 Turbo API
  • OpenAI provides the original Whisper model
  • PyQt6 provides the python UI framework

Made with ❤️ by plfavreau

Star this repo if you find it useful!

About

AI-powered speech-to-text app for Windows

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •  

Languages