Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yinhao0214's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report yinhao0214

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 7,955 990 Updated Feb 6, 2026

An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.

Python 220 12 Updated Jan 20, 2026

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 5,910 710 Updated Feb 11, 2026

A high quality and fast TTS repository

Python 501 42 Updated Dec 22, 2025

The best ChatGPT that $100 can buy.

Python 43,631 5,687 Updated Feb 19, 2026

A framework for efficient model inference with omni-modality models

Python 2,767 435 Updated Feb 16, 2026

A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows

Python 226 16 Updated Jan 8, 2026

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,162 411 Updated Dec 11, 2025

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 178 14 Updated Feb 3, 2026
Python 173 15 Updated Aug 25, 2025

We Speech Transcript based on LLM, in 300 lines of code.

Python 183 18 Updated Jun 20, 2025

Text-audio foundation model from Boson AI

Python 7,914 604 Updated Jan 18, 2026

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Python 415 62 Updated Nov 20, 2025

Phonetisaurus G2P

Shell 507 129 Updated Jun 1, 2024

Text Normalization & Inverse Text Normalization

Python 726 97 Updated Feb 3, 2026

The Triton TensorRT-LLM Backend

920 135 Updated Feb 18, 2026

Count the MACs / FLOPs of your PyTorch model.

Python 5,081 535 Updated Jul 8, 2024

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 213 17 Updated Sep 19, 2024

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 654 52 Updated Jan 21, 2026

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 349 48 Updated Jul 21, 2025

基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。

Python 587 76 Updated May 18, 2025

A Conversational Speech Generation Model

Python 14,495 1,460 Updated May 27, 2025

Spark-TTS Inference Code

Python 10,915 1,167 Updated Apr 9, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,916 322 Updated Aug 14, 2025
Python 204 20 Updated Sep 24, 2024

Running the F5-TTS by ONNX Runtime

Python 191 31 Updated Jan 7, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,003 879 Updated Feb 6, 2026

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 291 23 Updated Oct 12, 2025

GLM-4-Voice | 端到端中英语音对话模型

Python 3,140 274 Updated Dec 5, 2024
Next