MeloTTS — Maintained & Easy-to-Use Fork 🛠️

This project is an independently maintained fork of the original MeloTTS by Wenliang Zhao, Xumin Yu, and Zengyi Qin.
The original work is licensed under the MIT License, and we thank the authors for their excellent research and contributions.

While the original MeloTTS is an impressive research project, this fork focuses on making it simple to run and integrate — with a working Docker image, included UI, and API support.

It’s designed so that you can:

Pull the Docker image
Run it instantly
Start synthesizing speech via UI or API without hunting down dependencies

⚠️ Note: This project is maintained for usability and convenience by a single developer (with a different primary tech stack).
It is not a production-hardened system and may require additional work for deployment in critical environments.

✅ Offline Mode: Supported — provided that models are baked into the Docker image or mounted via a volume.
If running in a fully offline environment, ensure all required model files are available locally before starting the container.

🤝 Contributions Welcome: If you find bugs, have ideas, or want to improve things, feel free to submit issues or pull requests. Every bit of help makes this project better for everyone.

🆘 Support & Issues

If you encounter bugs, have feature requests, or need help using MeloTTS:

Please open a new GitHub Issue with as much detail as possible
Include error messages, logs, and reproduction steps if applicable
For general questions or ideas, you can also use the Discussions tab

🚀 Quick Start

docker run -p 8888:8888 --gpus all sensejworld/melotts:latest

Then open: http://localhost:8888

🌐 API Usage Example

curl -X POST "http://localhost:8888/api/tts"   -F "text=Hello world!"   -F "language=EN"   -o output.wav

📦 Docker Features

Pinned dependencies for reproducible builds
Preloaded models for instant offline use (optional)
GPU acceleration when available
HTTP API + web UI in one container

🐳 Docker Hub

You can explore all available MeloTTS container images on Docker Hub.

This is useful if you want to:

Select a specific version of MeloTTS for compatibility
Check the latest available builds before pulling
Verify image tags for deployment

📜 Version History

v0.0.5 (Planned)

Add V2 models
Add V3 models
Create new repo (Melotts-base) with image containing models so build have more space in the future

v0.0.4 (09.08.2025)

Dependency updates for improved performance and stability.
Full offline support — all required models are now baked into the image.
Model overwrite option: set MELOTTS_MODELS to point to your custom model folder.
Smaller image size via optimized multi-stage Docker build.

Run with:

docker run -p 8888:8888 --gpus all sensejworld/melotts:v0.0.4

v0.0.3 (25.07.2025)

Optimized docker build to use layer caching so we can build stuff fast after the initial build
Expanded ping to include version and build
Expanded UI with sdp_ratio, noise_scale and noise_scale_w
Expanded API with sdp_ratio, noise_scale and noise_scale_w
Corrected faulty version dates
Updated documentation

Run with:

docker run -p 8888:8888 --gpus all sensejworld/melotts:v0.0.3`

v0.0.2 (22.06.2025)

Enable API calls together with UI

run with

docker run -p 8888:8888 --gpus all sensejworld/melotts:v0.0.2`

run for english only

docker run -p 8888:8888 -e TTS_LANGUAGES=EN sensejworld/melotts:v0.0.2`

run for english and japanese

docker run -p 8888:8888 -e TTS_LANGUAGES=EN,JP sensejworld/melotts:v0.0.2`

run for english with gpu support named melotts_gpu_en

docker run -p 8888:8888 --gpus all -e TTS_LANGUAGES=EN --name melotts_gpu_en sensejworld/melotts:v0.0.2`

v0.0.1 (21.06.2025)

Initial release
Basic TTS functionality
Support for English (Default, US, BR, India, AU)
Docker support for both CPU and GPU
Web interface on port 8888 (http://localhost:8888/)

Run with

docker pull sensejworld/melotts:v0.0.1`

🛠 Developer Notes

If you’re interested in building MeloTTS locally, testing changes, or working directly on the codebase, I have included additional technical details and tips in notes.md.

This file contains guidance for:

Local environment setup
Dependency management
Testing workflows
Build & Docker optimization notes

📜 License

This fork is licensed under the MIT License.
Original work by Wenliang Zhao, Xumin Yu, and Zengyi Qin in MeloTTS.

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
.github/workflows		.github/workflows
docs		docs
melo		melo
test		test
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MeloTTS — Maintained & Easy-to-Use Fork 🛠️

🆘 Support & Issues

🚀 Quick Start

🌐 API Usage Example

📦 Docker Features

🐳 Docker Hub

📜 Version History

v0.0.5 (Planned)

v0.0.4 (09.08.2025)

v0.0.3 (25.07.2025)

v0.0.2 (22.06.2025)

v0.0.1 (21.06.2025)

🛠 Developer Notes

📜 License

About

Uh oh!

Releases

Packages

Languages

License

TheMasterOfDisasters/MeloTTS

Folders and files

Latest commit

History

Repository files navigation

MeloTTS — Maintained & Easy-to-Use Fork 🛠️

🆘 Support & Issues

🚀 Quick Start

🌐 API Usage Example

📦 Docker Features

🐳 Docker Hub

📜 Version History

v0.0.5 (Planned)

v0.0.4 (09.08.2025)

v0.0.3 (25.07.2025)

v0.0.2 (22.06.2025)

v0.0.1 (21.06.2025)

🛠 Developer Notes

📜 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages