Open Super Whisper

Simple desktop application for speech transcription with global hotkey control. Record, transcribe, and paste - all without switching applications.

Quick Start - Just 3 Steps!

Start Recording - Press the global hotkey (default: Ctrl+Shift+R) from any application
Stop Recording - Press the same hotkey again when you're done speaking
Paste Text - The transcription is automatically copied to your clipboard, just paste it wherever you need

That's it! No need to switch applications during your workflow.

Features

🎙️ Record audio directly from your microphone
🌎 Support for 100+ languages with automatic language detection
📝 Custom vocabulary support to improve transcription accuracy
🔧 System instructions for controlling transcription behavior
📋 Copy transcription to clipboard
🔄 Real-time recording status and timer

Available Models

Open Super Whisper supports the following AI transcription models:

Whisper-1 - OpenAI's original open-source Whisper model
GPT-4o Transcribe - High-performance transcription model offering superior accuracy
GPT-4o Mini Transcribe - Lightweight and fast transcription model with a good balance of speed and accuracy

Demo

Download

You can download the latest executable file (.exe) for Windows from our GitHub Releases page.

Requirements

OpenAI API key
Windows or macOS operating system

Installation

Using UV Package Manager

UV is a fast and efficient Python package installer and environment manager. It's faster than traditional pip and venv, and provides better dependency resolution.

Check if UV is installed:

uv --version

If not installed, you can install it with:

# Windows (PowerShell)
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

# macOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

Clone or download this repository
Set up the project using UV's sync command, which will create a virtual environment and install all dependencies:

uv sync

Activate the virtual environment:

# Windows (PowerShell)
.\.venv\Scripts\activate.ps1

# macOS/Linux
source .venv/bin/activate

Note: If you get a "execution of scripts is disabled on this system" error when using activate.ps1 in PowerShell, try one of these solutions:
Use Command Prompt (cmd.exe) and run .venv\Scripts\activate.bat instead
Run the following command in PowerShell to change the execution policy for the current session only:
Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope Process
Then run .\.venv\Scripts\activate.ps1
Run PowerShell as Administrator and change the execution policy for your user account (do this only if you understand the security implications):
Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser

Run the application:

python main.py

Building the Application

To create a standalone executable, you can use PyInstaller:

# Windows (PowerShell)
python -m PyInstaller --onefile --windowed --icon assets/icon.ico --name "OpenSuperWhisper" --add-data "assets;assets" main.py

# For macOS
python -m PyInstaller --onefile --windowed --icon assets/icon.icns --name "OpenSuperWhisper" --add-data "assets:assets" main.py

# For Linux
python -m PyInstaller --onefile --windowed --icon assets/linux_pngs/icon_256.png --name "OpenSuperWhisper" --add-data "assets:assets" main.py

The Windows command does the following:

--onefile: Creates a single executable file
--windowed: Prevents a console window from appearing
--icon assets/icon.ico: Sets the application icon
--name "OpenSuperWhisper": Specifies the output filename
--add-data "assets;assets": Includes the entire assets directory in the executable

Once the build is complete, you'll find OpenSuperWhisper.exe in the dist folder on Windows, OpenSuperWhisper.app in the dist folder on macOS, or OpenSuperWhisper in the dist folder on Linux.

Usage

Setting up your API Key

On first launch, you'll be prompted to enter your OpenAI API key
If you don't have an API key, you can get one from OpenAI's website
Your API key will be saved for future use
To change it later, click "API Key Settings" in the toolbar

Recording Audio

Click the "Start Recording" button to begin recording from your microphone
Click "Stop Recording" when you're done
The application will automatically transcribe your recording
You can also use the global hotkey (default: Ctrl+Shift+R) to start/stop recording even when the application is in the background

Using Global Hotkeys

The default hotkey is set to "Ctrl+Shift+R"
Pressing this hotkey will start/stop recording even when the application is in the background
To change the hotkey, click "Hotkey Settings" in the toolbar

Using the System Tray (Windows) or Menu Bar (macOS)

The application stays resident in your system tray (Windows) or menu bar (macOS)
Closing the window will keep the application running in the background
Click the system tray/menu bar icon to toggle the application's visibility
Right-click the system tray icon (Windows) or click the menu bar icon (macOS) to access a context menu with options to:
- Show the application
- Start/stop recording
- Completely exit the application

Language Selection

Select a language from the dropdown menu before recording or importing audio
Choose "Auto-detect" to let Whisper identify the language automatically

Model Selection

Select the Whisper model to use from the dropdown menu
Different models offer different balances of accuracy and processing speed
Your selected model will be remembered for future sessions

Custom Vocabulary

Click "Custom Vocabulary" in the toolbar
Add specific terms, names, or phrases that might appear in your audio
These terms will help improve transcription accuracy

System Instructions

Click "System Instructions" in the toolbar
Add specific instructions to control transcription behavior, such as:
- "Ignore filler words like um, uh, er"
- "Add proper punctuation"
- "Format text into paragraphs"
These instructions help refine transcription results without manual editing

Managing Transcriptions

View the transcription in the main text area
Edit the text if needed (the text area is editable)
Use the toolbar buttons to:
- Copy the transcription to clipboard

Other Settings

"Auto Copy" option: Toggle automatic copying of transcription to clipboard when completed

Command Line Options

The application supports the following command line arguments:

python main.py -m
# or
python main.py --minimized

Using the -m or --minimized option will start the application minimized to the system tray only, without showing the window.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

This application uses OpenAI's Whisper API for speech recognition
Built with PyQt6 for the user interface
Inspired by the Super Whisper desktop application

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
assets		assets
demo		demo
src		src
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.ja.md		README.ja.md
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Open Super Whisper

Quick Start - Just 3 Steps!

Features

Available Models

Demo

Download

Requirements

Installation

Using UV Package Manager

Building the Application

Usage

Setting up your API Key

Recording Audio

Using Global Hotkeys

Using the System Tray (Windows) or Menu Bar (macOS)

Language Selection

Model Selection

Custom Vocabulary

System Instructions

Managing Transcriptions

Other Settings

Command Line Options

License

Acknowledgements

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

TakanariShimbo/open-super-whisper

Folders and files

Latest commit

History

Repository files navigation

Open Super Whisper

Quick Start - Just 3 Steps!

Features

Available Models

Demo

Download

Requirements

Installation

Using UV Package Manager

Building the Application

Usage

Setting up your API Key

Recording Audio

Using Global Hotkeys

Using the System Tray (Windows) or Menu Bar (macOS)

Language Selection

Model Selection

Custom Vocabulary

System Instructions

Managing Transcriptions

Other Settings

Command Line Options

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages