Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Cocii's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Zhipu Ai
  • Beijing

Block or report Cocii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning

Python 748 91 Updated Dec 17, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 18,225 2,852 Updated Dec 19, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 18,123 2,013 Updated Dec 17, 2025
Python 425 28 Updated Nov 27, 2025
Python 1,456 152 Updated Nov 15, 2025

TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks

Python 20 1 Updated Nov 17, 2025

💼【AI找工作助手】全平台自动投简历脚本:(boss、前程无忧、猎聘、智联招聘)

Java 5,505 712 Updated Nov 27, 2025

语音方向实验室/公司/资源/实习等,欢迎推荐或自荐

590 69 Updated Nov 13, 2024

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 30,702 2,749 Updated Nov 25, 2025

Flops counter for neural networks in pytorch framework

Python 2,958 308 Updated Aug 20, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,140 192 Updated Oct 9, 2025

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 905 87 Updated Sep 20, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,805 75 Updated Jun 5, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 16,870 2,029 Updated Dec 2, 2025

MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows

Python 116 11 Updated Sep 2, 2025

Agent OS is a system for better planning and executing software development tasks with your AI agents.

Shell 2,923 534 Updated Dec 11, 2025

Production-ready Claude subagents collection with 100+ specialized AI agents for full-stack development, DevOps, data science, and business operations.

6,076 650 Updated Dec 17, 2025

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Python 685 97 Updated Apr 26, 2024

A book about Text-to-Speech (TTS) in Chinese.

TeX 613 80 Updated Apr 19, 2022

Recommend new arxiv papers of your interest daily according to your Zotero libarary.

Python 4,267 3,773 Updated Dec 17, 2025

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,060 95 Updated Dec 8, 2025

Text-audio foundation model from Boson AI

Python 7,754 577 Updated Sep 15, 2025

The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these factors with real speech and noise datasets.

Python 72 5 Updated Sep 29, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,716 81 Updated Apr 18, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,649 2,859 Updated Dec 20, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,017 237 Updated Nov 30, 2025

中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库

701 159 Updated Feb 4, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,060 139 Updated Dec 18, 2025

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

Python 36,001 10,882 Updated Nov 15, 2025
Python 815 74 Updated Jun 7, 2024
Next