AI-X-King

AI-X-King

1 follower · 0 following

Stars

NVIDIA / NeMo-speech-data-processor

A toolkit for processing speech data and creating speech datasets

Python 181 36 Updated Sep 29, 2025

gengxuelong / wenet_LLM_from_ASLP

wenet_LLM_from_ASLP

Python 14 1 Updated Nov 26, 2024

allenai / OLMoASR

An open-source implementation of Whisper

Python 451 41 Updated Oct 29, 2025

microsoft / Recognizers-Text

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV).…

C# 1,752 434 Updated Feb 19, 2025

HumanAIGC / omnitalker

[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication

JavaScript 391 28 Updated Sep 19, 2025

SkyworkAI / SkyReels-V2

SkyReels-V2: Infinite-length Film Generative model

Python 4,840 679 Updated Aug 11, 2025

wenet-e2e / wespeaker

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,060 160 Updated Oct 13, 2025

dashingsoft / pyarmor

A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.

Python 4,692 336 Updated Oct 30, 2025

pex-tool / pex

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,108 304 Updated Oct 29, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,133 535 Updated Oct 30, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,038 3,691 Updated Oct 30, 2025

567-labs / instructor

structured outputs for llms

Python 11,718 878 Updated Oct 29, 2025

chonkie-inc / chonkie

🦛 CHONK docs with Chonkie ✨ — The no-nonsense RAG library

Python 3,080 195 Updated Oct 29, 2025

pyper-dev / pyper

Concurrent Python made simple

Python 1,506 29 Updated Feb 4, 2025

hexgrad / kokoro

https://hf.co/hexgrad/Kokoro-82M

JavaScript 4,666 523 Updated Aug 6, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,054 4,127 Updated Jul 6, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,476 3,206 Updated Oct 30, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 61,403 10,889 Updated Oct 30, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 88,478 13,456 Updated Oct 30, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,710 1,619 Updated Jul 6, 2025

brucefan1983 / CUDA-Programming

Sample codes for my CUDA programming book

Cuda 1,914 374 Updated Feb 15, 2025

Infatoshi / cuda-course

Cuda 1,812 341 Updated Oct 13, 2025

eole-nlp / eole

Open language modeling toolkit based on PyTorch

Python 152 22 Updated Oct 29, 2025

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,204 656 Updated Oct 29, 2025

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,085 810 Updated Mar 5, 2025

tesseract-ocr / tesseract

Tesseract Open Source OCR Engine (main repository)

C++ 70,593 10,335 Updated Oct 13, 2025

ZigeW / data_management_LLM

Collection of training data management explorations for large language models

335 31 Updated Aug 2, 2024

TaoRuijie / ECAPA-TDNN

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 743 127 Updated Apr 11, 2024

leto19 / MultiMetricGANplusplus

Python 6 2 Updated Sep 8, 2023

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,063 664 Updated Oct 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly