Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View controlRun's full-sized avatar

Block or report controlRun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,867 296 Updated Jan 29, 2026

Development repository for the Triton language and compiler

MLIR 18,280 2,531 Updated Jan 29, 2026

High Performance LLM Inference Operator Library

C++ 569 49 Updated Jan 28, 2026

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Python 1,072 131 Updated Apr 17, 2024

TVM Documentation in Chinese Simplified / TVM 中文文档

TypeScript 3,138 601 Updated Nov 21, 2025

Push acceptor for ephemeral and batch jobs.

Go 3,285 500 Updated Jan 27, 2026

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 34,730 5,523 Updated Jan 29, 2026

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,131 806 Updated Jan 16, 2026

🚀 The fast, Pythonic way to build MCP servers and clients

Python 22,425 1,688 Updated Jan 29, 2026

Yet Another Document Translator

Python 7,591 592 Updated Jan 22, 2026

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 31,568 2,846 Updated Nov 25, 2025

Production-ready platform for agentic workflow development.

TypeScript 127,964 19,930 Updated Jan 29, 2026

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 19,709 2,806 Updated Jan 20, 2026

A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)

Java 23,053 8,167 Updated Jan 26, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,898 861 Updated Jan 22, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,545 2,170 Updated Jan 27, 2026

OS-Level Memory Layer for LLMs, AI Agents & Multi-Agent Systems with long-term, working, and external memory.

Python 4,811 436 Updated Jan 29, 2026

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 144,345 8,373 Updated Jan 29, 2026

The official NGINX Open Source repository.

C 29,199 7,736 Updated Jan 26, 2026

Model Context Protocol Servers

TypeScript 77,473 9,382 Updated Jan 27, 2026

Train transformer language models with reinforcement learning.

Python 17,186 2,458 Updated Jan 29, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,800 3,130 Updated Jan 29, 2026

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,646 376 Updated Jan 29, 2026

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 13,090 1,233 Updated Jan 27, 2026

No fortress, purely open ground. OpenManus is Coming.

Python 53,880 9,467 Updated Jan 5, 2026

A framework for few-shot evaluation of language models.

Python 11,308 2,994 Updated Jan 27, 2026

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,615 732 Updated Jan 22, 2026

MLX: An array framework for Apple silicon

C++ 23,683 1,480 Updated Jan 29, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 18,053 1,565 Updated Jan 4, 2026

😎 Awesome lists about all kinds of interesting topics

432,957 32,974 Updated Jan 28, 2026
Next