Top 23 Python Tt Projects

Real-Time-Voice-Cloning

1 100 58,832 2.9 Python

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Project mention: CorentinJ: Real-Time Voice Cloning | news.ycombinator.com | 2025-09-14
InfluxDB

www.influxdata.com featured

InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
GPT-SoVITS

2 2 52,168 9.5 Python

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
TTS

3 243 43,441 8.1 Python

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20

XTTS-v2 — Zero-shot voice cloning, 17 languages, streaming support
ChatTTS

4 4 38,144 7.7 Python

A generative speech model for daily dialogue.
MockingBird

5 9 36,745 5.6 Python

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
OpenVoice

6 18 35,415 5.4 Python

Instant voice cloning by MIT and MyShell. Audio foundation model.

Project mention: 5 must know open-source repositories to build cool AI apps | dev.to | 2025-10-29

Star the Open Voice repository ⭐
fish-speech

7 7 24,035 8.5 Python

SOTA Open Source TTS

Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20

FishSpeech — Natural dialogue flow
Stream

getstream.io featured

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
NeMo

8 31 16,065 9.9 Python

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Project mention: FFmpeg 8.0 adds Whisper support | news.ycombinator.com | 2025-08-13

git clone https://github.com/NVIDIA/NeMo.git nemo
ebook2audiobook

9 2 15,342 10.0 Python

Generate audiobooks from e-books, voice cloning & 1107+ languages!

Project mention: Generating audiobooks from E-books with Kokoro-82M | news.ycombinator.com | 2025-01-15
index-tts

10 1 15,250 9.3 Python

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Project mention: IndexTTS2 Comprehensive Review: In-Depth Analysis of 2025's Most Powerful Emotional Speech Synthesis Model | dev.to | 2025-09-11

# 1. Clone repository git clone https://github.com/index-tts/index-tts.git cd index-tts # 2. Install dependencies uv sync --all-extras # 3. Download model hf download IndexTeam/IndexTTS-2 --local-dir=checkpoints # 4. Launch web interface uv run webui.py
PaddleSpeech

11 6 12,343 8.4 Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
edge-tts

12 9 9,345 7.9 Python

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Project mention: Show HN: Voice Cloning and Multilingual TTS in One Click (Windows) | news.ycombinator.com | 2025-01-26

There is a MIT license in the repo. In that sense it's open source.
It's using "Edge TTS", which I believe means use API keys stolen [1] from Microsoft Edge and hope Microsoft doesn't sue you, non jolly-roger flying internet users beware.
Can't speak to other models and their licenses, I stopped looking after I saw this since I don't feel the need to use this.
[1] https://github.com/rany2/edge-tts/blob/ac41fb85ab2b2b48fef8a...
EmotiVoice

13 5 8,367 7.9 Python

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
VALL-E-X

14 2 7,965 8.8 Python

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
vits

15 6 7,654 0.0 Python

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
StyleTTS2

16 7 6,033 7.7 Python

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Orpheus-TTS

17 6 5,716 8.8 Python

Towards Human-Sounding Speech

Project mention: Microsoft VibeVoice: A Frontier Open-Source Text-to-Speech Model | news.ycombinator.com | 2025-09-03

Probably not even the best ones, but among some recent models I find Dia and Orpheus more natural
- http://dia-tts.com/
- https://github.com/canopyai/Orpheus-TTS
voice-pro

18 11 5,015 8.0 Python

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

Project mention: Voice-Pro: Ultimate AI Voice Conversion and Multilingual Translation Tool 🔊 | dev.to | 2025-02-10

GitHub: https://github.com/abus-aikorea/voice-pro
DiffSinger

19 1 4,651 2.1 Python

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
metavoice-src

20 5 4,191 7.8 Python

Foundational model for human-like, expressive TTS
TensorFlowTTS

21 6 3,982 0.0 Python

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Kokoro-FastAPI

22 2 3,930 9.2 Python

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Project mention: Microsoft VibeVoice: A Frontier Open-Source Text-to-Speech Model | news.ycombinator.com | 2025-09-03

I'm using Kokoro via https://github.com/remsky/Kokoro-FastAPI. It has a `generate_audio_from_phonemes()` endpoint that I'm sure maps to the Kokoro library if you want to use it directly.
My usage is for Chinese, but the phonemes it generated looked very much like IPA.
abogen

23 2 3,840 9.5 Python

Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

Project mention: Abogen – Generate audiobooks from EPUBs, PDFs and text | news.ycombinator.com | 2025-08-09

It's probably due to the unusual sound format, 24kHz PCM, and the fact that it was somehow forced into a WebM container, which only supports the Vorbis and Opus formats.
It looks like they created it using the "higher quality" ffmpeg command line, except for the "webm" final extension, producing the opposite of what's described as "an MP4 file that's compatible with more devices".
https://github.com/denizsafak/abogen/tree/main/demo#for-high...
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Tts discussion

Python Tts related posts

Neural audio codecs: how to get audio into LLMs

3 projects | news.ycombinator.com | 21 Oct 2025
2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1)

7 projects | dev.to | 20 Sep 2025
CorentinJ: Real-Time Voice Cloning

3 projects | news.ycombinator.com | 14 Sep 2025
Microsoft VibeVoice: A Frontier Open-Source Text-to-Speech Model

9 projects | news.ycombinator.com | 3 Sep 2025
Abogen – Generate audiobooks from EPUBs, PDFs and text

10 projects | news.ycombinator.com | 9 Aug 2025
Kitten TTS: 25MB CPU-Only, Open-Source Voice Model

19 projects | news.ycombinator.com | 5 Aug 2025
Build Your Own Clone: Best Open-Source AI Tools

3 projects | dev.to | 27 Jun 2025
A note from our sponsor - InfluxDB
www.influxdata.com | 16 Nov 2025

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Tt projects in Python? This list will help you:

#	Project	Stars
1	Real-Time-Voice-Cloning	58,832
2	GPT-SoVITS	52,168
3	TTS	43,441
4	ChatTTS	38,144
5	MockingBird	36,745
6	OpenVoice	35,415
7	fish-speech	24,035
8	NeMo	16,065
9	ebook2audiobook	15,342
10	index-tts	15,250
11	PaddleSpeech	12,343
12	edge-tts	9,345
13	EmotiVoice	8,367
14	VALL-E-X	7,965
15	vits	7,654
16	StyleTTS2	6,033
17	Orpheus-TTS	5,716
18	voice-pro	5,015
19	DiffSinger	4,651
20	metavoice-src	4,191
21	TensorFlowTTS	3,982
22	Kokoro-FastAPI	3,930
23	abogen	3,840

Python Tts

Top 23 Python Tt Projects

Python Tts discussion

Python Tts related posts

Neural audio codecs: how to get audio into LLMs

2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1)

CorentinJ: Real-Time Voice Cloning

Microsoft VibeVoice: A Frontier Open-Source Text-to-Speech Model

Abogen – Generate audiobooks from EPUBs, PDFs and text

Kitten TTS: 25MB CPU-Only, Open-Source Voice Model

Build Your Own Clone: Best Open-Source AI Tools

Index

Did you know that Python is the 2nd most popular programming language based on number of references?

Did you know that Python is
the 2nd most popular programming language
based on number of references?