Clone your voice in your native Indian language!
DVCloner (DesiVoiceCloner) is an open-source, privacy-focused web application that allows users to synthesize speech in major Indian languages using their own voice as a reference. Powered by Flask (Python), a modern JavaScript/CSS/HTML frontend, and the AI4Bharat IndicF5 model, it enables high-quality voice cloning for educational and experimental use.
- About
- Features
- Supported Languages
- How It Works
- Screenshots
- Demo
- Installation
- Usage
- Project Structure
- Tech Stack
- Contributing
- License
- Acknowledgements
- Contact
DesiVoiceCloner is designed for the Indian context, supporting 10+ major languages. Users can upload a short voice sample (10–30 seconds), provide matching reference text, and synthesize new speech in their own voice for any text in their chosen language.
Note: This project is strictly educational and experimental. All voice processing is privacy-focused and temporary.
- Voice cloning in 10+ Indian languages
- Modern, mobile-friendly web UI
- One-click voice generation and instant download
- Reference text matching for better accuracy
- Privacy-focused: old files are auto-deleted
- Based on AI4Bharat's IndicF5 open AI model
- Supports multiple audio formats (wav, mp3, m4a, etc.)
- Open source and easy to extend
- Kannada (
kn) - Hindi (
hi) - Tamil (
ta) - Telugu (
te) - Gujarati (
gu) - Malayalam (
ml) - Marathi (
mr) - Bengali (
bn) - Punjabi (
pa) - Odia (
or)
(More can be added with minor code changes.)
- Input: Enter text in your chosen Indian language.
- Sample: Upload a clear voice sample (10–30 seconds) reading a short passage.
- Reference: Enter exactly what was spoken in the audio sample.
- Select Language: Pick your language from the list.
- Generate: Click "Generate Voice"—your cloned voice will be synthesized and available for download.
All processing leverages the AI4Bharat IndicF5 backend for robust and accurate results.
Add screenshots of the UI to showcase the app (e.g., input form, results, contributors section, etc.).
- Live Demo (Coming Soon)
- Video Walkthrough (Optional)
- Python 3.8+
- pip (Python package manager)
- Node.js and npm (if customizing frontend assets)
ffmpeginstalled and accessible in your PATH
git clone https://github.com/mithun50/DVCloner.git
cd DVCloner
pip install -r requirements.txt
python app.pyVisit http://localhost:5000 in your browser.
-
Clone the repository
git clone https://github.com/mithun50/DVCloner.git cd DVCloner -
Install Python dependencies
pip install -r requirements.txt
- Flask
- gradio_client
- ffmpeg-python
-
Ensure ffmpeg is installed
- On Ubuntu:
sudo apt install ffmpeg - On Mac:
brew install ffmpeg
- On Ubuntu:
-
Run the app
python app.py
The app will be available at
http://localhost:5000. -
(Optional) For frontend development, customize
templates/index.htmlandstatic/css/style.css.
- Open the web app in your browser.
- Enter your text (Indian language only).
- Upload a voice sample (wav/mp3/m4a).
- Fill in the reference text that matches your recording.
- Select your language.
- Click "Generate Voice."
- Listen and download your cloned voice!
Note: English text is not supported in any input field.
DVCloner/
├── app.py # Flask backend & API routes
├── requirements.txt # Python dependencies
├── templates/
│ └── index.html # Main frontend UI (HTML)
├── static/
│ ├── css/
│ │ └── style.css # Main CSS styles
│ ├── script/
│ │ └── script.js # Frontend JS (optional)
│ └── outputs/ # Generated voice files
├── uploads/ # Temp audio uploads (auto-cleaned)
├── README.md # Project documentation
- Frontend: HTML, CSS (Poppins, Font Awesome), JavaScript
- Backend: Python, Flask
- AI Model: AI4Bharat IndicF5 (via Hugging Face/Gradio client)
- Audio Processing: ffmpeg-python
We welcome contributions! To contribute:
- Fork the repo and create your branch:
git checkout -b my-feature - Commit your changes:
git commit -am 'Add new feature' - Push to the branch:
git push origin my-feature - Open a pull request
See the Contributors section in the app for more info.
Distributed under the MIT License. See LICENSE for details.
- AI4Bharat for IndicF5 TTS/cloning model
- Gradio for seamless AI API integration
- Font Awesome for UI icons
- All contributors and testers!
Created by Mithun Gowda B
For issues or suggestions, open an issue or email via GitHub profile.
DesiVoiceCloner – Bringing voice cloning to every Indian language, for everyone.
Note: This project is for educational and experimental use only. Generated voices should not be used for impersonation or malicious purposes.