Thanks to visit codestin.com
Credit goes to www.libhunt.com

Python text-to-speech

Open-source Python projects categorized as text-to-speech

Top 23 Python text-to-speech Projects

text-to-speech
  1. GPT-SoVITS

    1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. TTS

    🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

    Project mention: 2025 Voice AI Guide: How to Make Your Own Real-Time Voice Agent (Part-1) | dev.to | 2025-09-20

    XTTS-v2 — Zero-shot voice cloning, 17 languages, streaming support

  4. ChatTTS

    A generative speech model for daily dialogue.

  5. MockingBird

    🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

  6. OpenVoice

    Instant voice cloning by MIT and MyShell. Audio foundation model.

    Project mention: 5 must know open-source repositories to build cool AI apps | dev.to | 2025-10-29

    Star the Open Voice repository ⭐

  7. dia

    A TTS model capable of generating ultra-realistic dialogue in one pass.

    Project mention: Kitten TTS: 25MB CPU-Only, Open-Source Voice Model | news.ycombinator.com | 2025-08-05

    The best open one I've found so far is Dia - https://github.com/nari-labs/dia - it has some limitations, but i think it's really impressive and I can run it on my laptop.

  8. index-tts

    An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

    Project mention: IndexTTS2 Comprehensive Review: In-Depth Analysis of 2025's Most Powerful Emotional Speech Synthesis Model | dev.to | 2025-09-11

    # 1. Clone repository git clone https://github.com/index-tts/index-tts.git cd index-tts # 2. Install dependencies uv sync --all-extras # 3. Download model hf download IndexTeam/IndexTTS-2 --local-dir=checkpoints # 4. Launch web interface uv run webui.py

  9. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  10. pyvideotrans

    Translate the video from one language to another and add dubbing.

  11. espnet

    End-to-End Speech Processing Toolkit

  12. Amphion

    Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

  13. edge-tts

    Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

    Project mention: Show HN: Voice Cloning and Multilingual TTS in One Click (Windows) | news.ycombinator.com | 2025-01-26

    There is a MIT license in the repo. In that sense it's open source.

    It's using "Edge TTS", which I believe means use API keys stolen [1] from Microsoft Edge and hope Microsoft doesn't sue you, non jolly-roger flying internet users beware.

    Can't speak to other models and their licenses, I stopped looking after I saw this since I don't feel the need to use this.

    [1] https://github.com/rany2/edge-tts/blob/ac41fb85ab2b2b48fef8a...

  14. EmotiVoice

    EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

  15. VALL-E-X

    An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

  16. vits

    VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

  17. StyleTTS2

    StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

  18. voice-pro

    Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

    Project mention: Voice-Pro: Ultimate AI Voice Conversion and Multilingual Translation Tool 🔊 | dev.to | 2025-02-10

    GitHub: https://github.com/abus-aikorea/voice-pro

  19. Awesome-Prompt-Engineering

    This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc

  20. DiffSinger

    DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

  21. metavoice-src

    Foundational model for human-like, expressive TTS

  22. TensorFlowTTS

    :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

  23. abogen

    Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

    Project mention: Abogen – Generate audiobooks from EPUBs, PDFs and text | news.ycombinator.com | 2025-08-09

    It's probably due to the unusual sound format, 24kHz PCM, and the fact that it was somehow forced into a WebM container, which only supports the Vorbis and Opus formats.

    It looks like they created it using the "higher quality" ffmpeg command line, except for the "webm" final extension, producing the opposite of what's described as "an MP4 file that's compatible with more devices".

    https://github.com/denizsafak/abogen/tree/main/demo#for-high...

  24. RealtimeTTS

    Converts text to speech in realtime

  25. WhisperLive

    A nearly-live implementation of OpenAI's Whisper.

    Project mention: FFmpeg 8.0 adds Whisper support | news.ycombinator.com | 2025-08-13

    You'll probably like Whisper Live and it's browser extensions: https://github.com/collabora/WhisperLive?tab=readme-ov-file#...

    Start playing a YouTube video in the browser, select "start recording" in the extension, and it starts writing subtitles in white text on a black background below the video. When you stop capturing you can download the subtitles as a standard .srt file.

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python text-to-speech discussion

Log in or Post with

Python text-to-speech related posts

  • FFmpeg 8.0 adds Whisper support

    10 projects | news.ycombinator.com | 13 Aug 2025
  • Kitten TTS: 25MB CPU-Only, Open-Source Voice Model

    19 projects | news.ycombinator.com | 5 Aug 2025
  • Show HN: Automate final cut pro's XML language

    3 projects | news.ycombinator.com | 13 Jun 2025
  • Real-time Voice Chat at ~500ms Latency

    11 projects | news.ycombinator.com | 5 May 2025
  • Llasa: Llama-Based Speech Synthesis

    2 projects | news.ycombinator.com | 1 May 2025
  • Getting Started with ElevenLabs API

    1 project | dev.to | 29 Apr 2025
  • How to Run Dia-1.6B Locally: Your Ultimate Guide to Open Source TTS Freedom

    1 project | dev.to | 23 Apr 2025
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 15 Nov 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source text-to-speech projects in Python? This list will help you:

# Project Stars
1 GPT-SoVITS 52,168
2 TTS 43,441
3 ChatTTS 38,144
4 MockingBird 36,745
5 OpenVoice 35,415
6 dia 18,792
7 index-tts 15,250
8 pyvideotrans 15,164
9 espnet 9,580
10 Amphion 9,496
11 edge-tts 9,345
12 EmotiVoice 8,367
13 VALL-E-X 7,965
14 vits 7,654
15 StyleTTS2 6,033
16 voice-pro 5,015
17 Awesome-Prompt-Engineering 4,976
18 DiffSinger 4,651
19 metavoice-src 4,191
20 TensorFlowTTS 3,982
21 abogen 3,820
22 RealtimeTTS 3,619
23 WhisperLive 3,574

Sponsored
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io

Did you know that Python is
the 2nd most popular programming language
based on number of references?