Stars
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
AcademiCodec: An Open Source Audio Codec Model for Academic Research
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
华南理工大学硕博士学位论文模板(LaTeX)。Latex templates for the thesis of South China University of Technology
Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"
Tensors and Dynamic neural networks in Python with strong GPU acceleration
This is now the official location of the Merlin project.
A Flow-based Generative Network for Speech Synthesis
🥗 All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching.
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.