Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
The definitive Web UI for local AI, with powerful features and easy setup.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A generative speech model for daily dialogue.
SGLang is a high-performance serving framework for large language models and multimodal models.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Large Language Model Text Generation Inference
ModelScope: bring the notion of Model-as-a-Service to life.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.
4 bits quantization of LLaMA using GPTQ
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
Download metadata from DHT network directly.