Simple CLI tool to transcribe audio files into text files.
It tries:
faster-whisper(CPU int8)- falls back to
openai-whisper
- Input audio path as argument
- Output transcript path configurable (
-o/--output) - Optional language override (
--language) - Engine selection (
--engine auto|faster-whisper|openai-whisper) - Segment timestamps included in output
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txtpython transcribe_voice.py /path/to/audio.m4aCustom output:
python transcribe_voice.py /path/to/audio.m4a -o /path/to/transcript.txtForce engine/language:
python transcribe_voice.py /path/to/audio.m4a --engine faster-whisper --language de- Default output file is
transcript_<input_stem>.txtin the same folder as input. faster-whisperuses CPUint8by default.