Build, run, manage multi-agent systems.
Agno is a framework, runtime, and control plane for multi-agent systems.
| Layer | What it does |
|---|---|
| Framework | Build agents, teams, and workflows with memory, knowledge, guardrails, and 100+ integrations |
| AgentOS Runtime | Run your system in production with a stateless, secure FastAPI backend |
| Control Plane | Test, monitor, and manage your system using the AgentOS UI |
- Private by design. AgentOS runs in your cloud. The control plane connects directly to your runtime from your browser. No retention costs, no vendor lock-in, no compliance headaches.
- Production-ready on day one. Pre-built FastAPI runtime with SSE endpoints, ready to deploy.
- Fast. 529× faster instantiation than LangGraph. 24× lower memory. See benchmarks →
An agent with MCP tools, persistent state, served via FastAPI:
from agno.agent import Agent
from agno.db.sqlite import SqliteDb
from agno.models.anthropic import Claude
from agno.os import AgentOS
from agno.tools.mcp import MCPTools
agno_agent = Agent(
name="Agno Agent",
model=Claude(id="claude-sonnet-4-5"),
db=SqliteDb(db_file="agno.db"),
tools=[MCPTools(transport="streamable-http", url="https://docs.agno.com/mcp")],
add_history_to_context=True,
markdown=True,
)
agent_os = AgentOS(agents=[agno_agent])
app = agent_os.get_app()
if __name__ == "__main__":
agent_os.serve(app="agno_agent:app", reload=True)Run this and connect to the AgentOS UI:
Agno.AgentOS.UI.mp4
Core
- Model-agnostic: OpenAI, Anthropic, Google, local models
- Type-safe I/O with
input_schemaandoutput_schema - Async-first, built for long-running tasks
- Natively multimodal (text, images, audio, video, files)
Memory & Knowledge
- Persistent storage for session history and state
- User memory across sessions
- Agentic RAG with 20+ vector stores, hybrid search, reranking
- Culture: shared long-term memory across agents
Orchestration
- Human-in-the-loop (confirmations, approvals, overrides)
- Guardrails for validation and security
- Pre/post hooks for the agent lifecycle
- First-class MCP and A2A support
- 100+ built-in toolkits
Production
- Ready-to-use FastAPI runtime
- Integrated control plane UI
- Evals for accuracy, performance, latency
- Durable execution for resumable workflows
- RBAC and per-agent permissions
- Follow the getting started guide
- Browse the cookbook for real-world examples
- Read the docs to go deeper
Agent workloads spawn hundreds of instances. Stateless, horizontal scalability isn't optional.
| Metric | Agno | LangGraph | PydanticAI | CrewAI |
|---|---|---|---|---|
| Instantiation | 3μs | 1,587μs (529×) | 170μs (57×) | 210μs (70×) |
| Memory | 6.6 KiB | 161 KiB (24×) | 29 KiB (4×) | 66 KiB (10×) |
Apple M4 MacBook Pro, Oct 2025. Run benchmarks yourself →
Agno.Performance.Benchmark.mp4
Add our docs to your AI-enabled editor:
Cursor: Settings → Indexing & Docs → Add https://docs.agno.com/llms-full.txt
Also works with VSCode, Windsurf, and similar tools.
We welcome contributions. See the contributing guide.
Agno logs which model providers are used to prioritize updates. Disable with AGNO_TELEMETRY=false.