Starred repositories
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Python tool for converting files and office documents to Markdown.
An open-source AI agent that brings the power of Gemini directly into your terminal.
这是一个画大图的Agent的系统提示词,绘制静态的实体关系拓扑图,动态的时序图。可以用来做业务需求分析,代码逻辑分析。
mcp-use is the easiest way to interact with mcp servers with custom agents
CLI tool for configuring and monitoring Claude Code
A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
A powerful GUI app and Toolkit for Claude Code - Create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
A simple yet powerful agent framework that delivers with open-source models
Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
Automate browser-based workflows with LLMs and Computer Vision
A research prototype of a human-centered web agent
Open-source, vision-first browser agent
A curated list of awesome Claude Code Sub-Agents
Intelligent automation and multi-agent orchestration for Claude Code
VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…
slime is an LLM post-training framework for RL Scaling.
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.