Important:
This requires a lot of configuration if run directly. Recommended way is to use UI Application: https://github.com/Sharrnah/whispering-ui which downloads this automatically.
Standalone Release File (4.3 GB):
Download Server:
Changelog (v1.3.17.3)
- [FEATURE] Update pyTorch and flash-attn to NVIDIA 50x Blackwell supporting versions
- [FEATURE] Add support for flash and normal canary models
- [FEATURE] Add plugin settings reset function.
- [FEATURE] Add Orpheus TTS
- [FEATURE] Add audio streaming to Orpheus TTS without vllm
- [FEATURE] Add delayed start audio streaming
- [FEATURE] Implement Voxtral (Speech-to-Text + Text-Translation + LLM)
- [TASK] Remove debug output
- [TASK] cleanup transformer whisper code
- [TASK] Add flash attention 2 to transformer whisper
- [TASK] Update zonos library
- [TASK] call plugin_tts_after_audio event for streamed playback
- [TASK] Improve kokoro tts split_pattern
- [TASK] code cleanup
- [TASK] Add support for parakeet model
- [TASK] Implement BitsAndBytes for Transformer whisper again
- [TASK] Add large distilled v3.5 english model to faster whisper
- [TASK] Add setting to synchronize txt realtime with stt realtime active
- [TASK] Update transformers dependency
- [BUGFIX] internal event calling
- [BUGFIX] Fix canary code for new nemo library version
- [BUGFIX] Fix downloading MMS models
- [BUGFIX] Fix pyinstaller build spec
Full Changelog: v1.3.16.2...v1.3.17.3