Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View 78's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report 78

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
20 stars written in Python
Clear filter

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 68,382 12,859 Updated Jan 24, 2026

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,922 5,880 Updated Jan 15, 2026

Universal memory layer for AI Agents

Python 45,888 5,028 Updated Jan 13, 2026

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,381 4,772 Updated Jun 2, 2025

A generative speech model for daily dialogue.

Python 38,580 4,198 Updated Jan 18, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,667 4,169 Updated Jan 24, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,629 1,531 Updated Jan 7, 2026

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 13,021 1,221 Updated Sep 26, 2025

Large Language Model Text Generation Inference

Python 10,739 1,253 Updated Jan 8, 2026

ModelScope: bring the notion of Model-as-a-Service to life.

Python 8,662 905 Updated Jan 19, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,553 649 Updated Jan 23, 2026
Python 4,603 373 Updated Dec 19, 2025

A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.

Python 3,127 651 Updated Jan 7, 2026

4 bits quantization of LLaMA using GPTQ

Python 3,077 457 Updated Jul 13, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,748 242 Updated Dec 8, 2025

API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.

Python 539 88 Updated Oct 23, 2024

Naive Bayes-based Context Extension

Python 326 22 Updated Dec 9, 2024

Download metadata from DHT network directly.

Python 54 41 Updated May 15, 2015

Django storage for qcloud.com 对象存储服务

Python 18 12 Updated Mar 16, 2018

A pure python implemented QUIC HTTP/3 Client

Python 2 Updated Nov 28, 2025