Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View AI-X-King's full-sized avatar

Block or report AI-X-King

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A toolkit for processing speech data and creating speech datasets

Python 181 36 Updated Sep 29, 2025

wenet_LLM_from_ASLP

Python 14 1 Updated Nov 26, 2024

An open-source implementation of Whisper

Python 451 41 Updated Oct 29, 2025

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV).…

C# 1,752 434 Updated Feb 19, 2025

[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

JavaScript 391 28 Updated Sep 19, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 4,840 679 Updated Aug 11, 2025

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,060 160 Updated Oct 13, 2025

A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.

Python 4,692 336 Updated Oct 30, 2025

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,108 304 Updated Oct 29, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,133 535 Updated Oct 30, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,038 3,691 Updated Oct 30, 2025

structured outputs for llms

Python 11,718 878 Updated Oct 29, 2025

🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library

Python 3,080 195 Updated Oct 29, 2025

Concurrent Python made simple

Python 1,506 29 Updated Feb 4, 2025

https://hf.co/hexgrad/Kokoro-82M

JavaScript 4,666 523 Updated Aug 6, 2025

A generative speech model for daily dialogue.

Python 38,054 4,127 Updated Jul 6, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,476 3,206 Updated Oct 30, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 61,403 10,889 Updated Oct 30, 2025

LLM inference in C/C++

C++ 88,478 13,456 Updated Oct 30, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,710 1,619 Updated Jul 6, 2025

Sample codes for my CUDA programming book

Cuda 1,914 374 Updated Feb 15, 2025

Open language modeling toolkit based on PyTorch

Python 152 22 Updated Oct 29, 2025

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,204 656 Updated Oct 29, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,085 810 Updated Mar 5, 2025

Tesseract Open Source OCR Engine (main repository)

C++ 70,593 10,335 Updated Oct 13, 2025

Collection of training data management explorations for large language models

335 31 Updated Aug 2, 2024

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 743 127 Updated Apr 11, 2024

Modeling, training, eval, and inference code for OLMo

Python 6,063 664 Updated Oct 24, 2025
Next