ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
- 
            Updated
            Jul 12, 2025 
- Jupyter Notebook
ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Tacotron2 Persian Text-to-Speech Model trained on ManaTTS, the largest open single-speaker Persian speech dataset with over 114 hours of high-quality audio.
This project focuses on implementing a Keyword Spotting (KWS) system for Persian (Farsi) conversational speech using a fine-tuned version of wav2vec2-xlsr-large.
Add a description, image, and links to the persian-speech topic page so that developers can more easily learn about it.
To associate your repository with the persian-speech topic, visit your repo's landing page and select "manage topics."