yinhao0214

🎯

Focusing

Hao Yin yinhao0214

🎯

Focusing

My research interest is Machine Learning, Deep Learning and NLP、Speech.

414 followers · 51 following

Soochow University
suzhou
http://www.jianshu.com/u/52c593425488

Achievements

Lists (1)

Sort

✨ Inspiration

Stars

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 7,955 990 Updated Feb 6, 2026

ASLP-lab / VoiceSculptor

An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.

Python 220 12 Updated Jan 20, 2026

OpenBMB / VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 5,910 710 Updated Feb 11, 2026

ysharma3501 / MiraTTS

A high quality and fast TTS repository

Python 501 42 Updated Dec 22, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 43,631 5,687 Updated Feb 19, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 2,767 435 Updated Feb 16, 2026

ASLP-lab / MeanVC

A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows

Python 226 16 Updated Jan 8, 2026

Soul-AILab / SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 3,162 411 Updated Dec 11, 2025

wenet-e2e / west

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 178 14 Updated Feb 3, 2026

FunAudioLLM / CV3-Eval

Python 173 15 Updated Aug 25, 2025

wenet-e2e / wesr

We Speech Transcript based on LLM, in 300 lines of code.

Python 183 18 Updated Jun 20, 2025

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,914 604 Updated Jan 18, 2026

wenet-e2e / wetts

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Python 415 62 Updated Nov 20, 2025

BytedanceSpeech / seed-tts-eval

Python 1,530 142 Updated Jun 14, 2024

AdolfVonKleist / Phonetisaurus

Phonetisaurus G2P

Shell 507 129 Updated Jun 1, 2024

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 726 97 Updated Feb 3, 2026

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

920 135 Updated Feb 18, 2026

Lyken17 / pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Python 5,081 535 Updated Jul 8, 2024

Aria-K-Alethia / BigCodec

Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"

Python 213 17 Updated Sep 19, 2024

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 654 52 Updated Jan 21, 2026

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 349 48 Updated Jul 21, 2025

HuiResearch / FlashTTS

基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。

Python 587 76 Updated May 18, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 14,495 1,460 Updated May 27, 2025

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 10,915 1,167 Updated Apr 9, 2025

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,916 322 Updated Aug 14, 2025

xinchen-ai / Westlake-Omni

Python 204 20 Updated Sep 24, 2024

DakeQQ / F5-TTS-ONNX

Running the F5-TTS by ONNX Runtime

Python 191 31 Updated Jan 7, 2026

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,003 879 Updated Feb 6, 2026

zhenye234 / xcodec

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 291 23 Updated Oct 12, 2025

zai-org / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 3,140 274 Updated Dec 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hao Yin yinhao0214

Achievements

Achievements

Block or report yinhao0214

Lists (1)

✨ Inspiration

Stars

QwenLM / Qwen3-TTS

ASLP-lab / VoiceSculptor

OpenBMB / VoxCPM

ysharma3501 / MiraTTS

karpathy / nanochat

vllm-project / vllm-omni

ASLP-lab / MeanVC

Soul-AILab / SoulX-Podcast

wenet-e2e / west

FunAudioLLM / CV3-Eval

wenet-e2e / wesr

boson-ai / higgs-audio

wenet-e2e / wetts

BytedanceSpeech / seed-tts-eval

AdolfVonKleist / Phonetisaurus

wenet-e2e / WeTextProcessing

triton-inference-server / tensorrtllm_backend

Lyken17 / pytorch-OpCounter

Aria-K-Alethia / BigCodec

zhenye234 / LLaSA_training

zhenye234 / X-Codec-2.0

HuiResearch / FlashTTS

SesameAILabs / csm

SparkAudio / Spark-TTS

modelscope / ClearerVoice-Studio

xinchen-ai / Westlake-Omni

DakeQQ / F5-TTS-ONNX

OpenRLHF / OpenRLHF

zhenye234 / xcodec

zai-org / GLM-4-Voice