Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View wxy1988's full-sized avatar

Block or report wxy1988

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 584 51 Updated Dec 12, 2025

"Paper2Slides: From Paper to Presentation in One Click"

Python 2,387 325 Updated Dec 19, 2025

MOSS-Speech is a true speech-to-speech large language model without text guidance.

Python 113 5 Updated Dec 4, 2025

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,060 95 Updated Dec 8, 2025
Python 14 1 Updated Dec 6, 2025

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 167 11 Updated Dec 16, 2025

Automatic Korean word spacing with Python

Python 424 115 Updated Jul 4, 2024

可循环值守和多人录制的直播录制软件,支持抖音、TikTok、Youtube、快手、虎牙、斗鱼、B站、小红书、pandatv、sooplive、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、17Live、Twitch、Acfun、CHZZK、shopee等40+平台直播录制

Python 8,935 1,176 Updated Nov 3, 2025

Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.

Python 680 60 Updated Nov 27, 2025

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 3,175 382 Updated Jun 11, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,933 1,218 Updated Dec 20, 2025

https://hrl.boyuai.com/

Jupyter Notebook 4,314 775 Updated Nov 22, 2022

My solutions to DLFC - Deep Learning: Foundations and Concepts

94 17 Updated Mar 30, 2025

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,246 104 Updated Mar 2, 2025

This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lan…

Python 196 13 Updated Sep 21, 2025

learning fomula

Jupyter Notebook 296 60 Updated Jul 24, 2021

My Own Solution Manual of PRML

1,001 123 Updated Apr 5, 2021

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 476 64 Updated Dec 18, 2025

Efficient audio understanding with general audio captions

Python 390 39 Updated Nov 3, 2025

국립국어원 사전 / FOSS Korean dictionary by National Institute of Korean Language

Python 108 19 Updated Aug 25, 2021

datasets resource

127 13 Updated Jul 1, 2025

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Python 466 97 Updated Jul 13, 2023

T-one is a high-performance streaming ASR pipeline for Russian, specialized for the telephony domain.

Python 229 24 Updated Nov 12, 2025

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

919 73 Updated Dec 15, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,030 1,095 Updated Dec 12, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,946 5,857 Updated Aug 16, 2024

Open STT

Python 815 85 Updated Mar 11, 2022

Text-audio foundation model from Boson AI

Python 7,754 577 Updated Sep 15, 2025

chinese speech pretrained models

Shell 1,183 88 Updated Aug 23, 2024

Code for DeSTA2.5-Audio

Python 127 7 Updated Dec 10, 2025
Next