Stars
Google AI 2018 BERT pytorch implementation
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
search and download music 从网易云音乐、QQ音乐、酷狗音乐、百度音乐、虾米音乐、咪咕音乐等搜索和下载歌曲
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Forced alignment and Goodness of Pronunciation (GOP) with DNN support. Bases on Kaldi.
Agora Solo is an open source speech codec, it was developed based on Silk with BWE(Bandwidth Extension) and MDC(Multi Description Coding). With these technologies, Solo is enable to resist weak net…
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese
The Implementation of FastSpeech based on pytorch.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Voice Converter Using CycleGAN and Non-Parallel Data