VNAVC

TTS Data Pipeline

Check the separate README.md file which is in subfolder in tts_data_pipeline for more details

Crawler Module: README.md
Pre-Processing Module: README.md
Alignment Module: README.md

Push to Hub

A Python CLI tool for processing structured audio datasets, calculating total duration, and pushing the dataset to the Hugging Face Hub. Ideal for preparing Vietnamese TTS datasets or other speech-related corpora.

Dataset Structure

dataset/
├── speaker_001/
│   ├── audio1.wav
│   ├── audio1.txt
│   ├── audio2.wav
│   └── audio2.txt
├── speaker_002/
│   ├── ...

Each .wav file must have a corresponding .txt transcription file with the same name.

Installation

pip install typer librosa datasets num2words

Usage

python tts_data_pipeline/push_to_hub.py

Options:

--input-dir: Path to the input dataset directory containing speaker subdirectories.
--dataset-id: Dataset ID on Hugging Face Hub.
--commit: Commit message for pushing to Hugging Face Hub.
--private: Make the dataset private on Hugging Face Hub.
--push-to-hub: Push the dataset to Hugging Face Hub.
--calc-duration: Calculate total duration and print it to the console.

License

This project is licensed under the Apache License, Version 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
data		data
finetune		finetune
tts_data_pipeline		tts_data_pipeline
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VNAVC

TTS Data Pipeline

Push to Hub

Dataset Structure

Installation

Usage

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

quochuy242/VNAVC

Folders and files

Latest commit

History

Repository files navigation

VNAVC

TTS Data Pipeline

Push to Hub

Dataset Structure

Installation

Usage

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages