InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 Python Tt Projects
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20
XTTS-v2 — Zero-shot voice cloning, 17 languages, streaming support
-
-
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Star the Open Voice repository ⭐
-
Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20
FishSpeech — Natural dialogue flow
-
Stream
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
-
NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
git clone https://github.com/NVIDIA/NeMo.git nemo
-
Project mention: Generating audiobooks from E-books with Kokoro-82M | news.ycombinator.com | 2025-01-15
-
Project mention: IndexTTS2 Comprehensive Review: In-Depth Analysis of 2025's Most Powerful Emotional Speech Synthesis Model | dev.to | 2025-09-11
# 1. Clone repository git clone https://github.com/index-tts/index-tts.git cd index-tts # 2. Install dependencies uv sync --all-extras # 3. Download model hf download IndexTeam/IndexTTS-2 --local-dir=checkpoints # 4. Launch web interface uv run webui.py
-
PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
-
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Project mention: Show HN: Voice Cloning and Multilingual TTS in One Click (Windows) | news.ycombinator.com | 2025-01-26There is a MIT license in the repo. In that sense it's open source.
It's using "Edge TTS", which I believe means use API keys stolen [1] from Microsoft Edge and hope Microsoft doesn't sue you, non jolly-roger flying internet users beware.
Can't speak to other models and their licenses, I stopped looking after I saw this since I don't feel the need to use this.
[1] https://github.com/rany2/edge-tts/blob/ac41fb85ab2b2b48fef8a...
-
-
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
-
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
-
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
-
Project mention: Microsoft VibeVoice: A Frontier Open-Source Text-to-Speech Model | news.ycombinator.com | 2025-09-03
Probably not even the best ones, but among some recent models I find Dia and Orpheus more natural
- http://dia-tts.com/
- https://github.com/canopyai/Orpheus-TTS
-
voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Project mention: Voice-Pro: Ultimate AI Voice Conversion and Multilingual Translation Tool 🔊 | dev.to | 2025-02-10GitHub: https://github.com/abus-aikorea/voice-pro
-
DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
-
-
TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
-
Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
Project mention: Microsoft VibeVoice: A Frontier Open-Source Text-to-Speech Model | news.ycombinator.com | 2025-09-03I'm using Kokoro via https://github.com/remsky/Kokoro-FastAPI. It has a `generate_audio_from_phonemes()` endpoint that I'm sure maps to the Kokoro library if you want to use it directly.
My usage is for Chinese, but the phonemes it generated looked very much like IPA.
-
Project mention: Abogen – Generate audiobooks from EPUBs, PDFs and text | news.ycombinator.com | 2025-08-09
It's probably due to the unusual sound format, 24kHz PCM, and the fact that it was somehow forced into a WebM container, which only supports the Vorbis and Opus formats.
It looks like they created it using the "higher quality" ffmpeg command line, except for the "webm" final extension, producing the opposite of what's described as "an MP4 file that's compatible with more devices".
https://github.com/denizsafak/abogen/tree/main/demo#for-high...
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Tts discussion
Python Tts related posts
-
Neural audio codecs: how to get audio into LLMs
-
2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1)
-
CorentinJ: Real-Time Voice Cloning
-
Microsoft VibeVoice: A Frontier Open-Source Text-to-Speech Model
-
Abogen – Generate audiobooks from EPUBs, PDFs and text
-
Kitten TTS: 25MB CPU-Only, Open-Source Voice Model
-
Build Your Own Clone: Best Open-Source AI Tools
-
A note from our sponsor - InfluxDB
www.influxdata.com | 16 Nov 2025
Index
What are some of the best open-source Tt projects in Python? This list will help you:
| # | Project | Stars |
|---|---|---|
| 1 | Real-Time-Voice-Cloning | 58,832 |
| 2 | GPT-SoVITS | 52,168 |
| 3 | TTS | 43,441 |
| 4 | ChatTTS | 38,144 |
| 5 | MockingBird | 36,745 |
| 6 | OpenVoice | 35,415 |
| 7 | fish-speech | 24,035 |
| 8 | NeMo | 16,065 |
| 9 | ebook2audiobook | 15,342 |
| 10 | index-tts | 15,250 |
| 11 | PaddleSpeech | 12,343 |
| 12 | edge-tts | 9,345 |
| 13 | EmotiVoice | 8,367 |
| 14 | VALL-E-X | 7,965 |
| 15 | vits | 7,654 |
| 16 | StyleTTS2 | 6,033 |
| 17 | Orpheus-TTS | 5,716 |
| 18 | voice-pro | 5,015 |
| 19 | DiffSinger | 4,651 |
| 20 | metavoice-src | 4,191 |
| 21 | TensorFlowTTS | 3,982 |
| 22 | Kokoro-FastAPI | 3,930 |
| 23 | abogen | 3,840 |