Agno is a multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.
It provides a rich set of primitives for building:
- Agents with persistent state, knowledge retrieval, memory, and advanced features like human-in-the-loop, guardrails, dynamic context management and best-in-class MCP support.
- Multi-Agent Teams that operate autonomously under a team leader that maintains shared state and context.
- Step-based Workflows for controlled, deterministic execution. Steps can be Agents, Teams, or a regular python functions and can run sequentially, in parallel, in loops, branches, or conditionally.
Agno also provides a FastAPI-powered runtime for serving multi-agent systems in production, covering the entire {build → run → deploy} lifecycle. Building Agents is easy, running them is hard, and that's where Agno shines.
If you're new to Agno, follow our quickstart to build your first Agent and chat with it using the AgentOS UI.
After that, checkout the examples gallery and build real-world applications with Agno.
- Docs: docs.agno.com
- Cookbook: Cookbook
- Community forum: community.agno.com
- Discord: discord
Here’s an example of an Agent that connects to an MCP server, manages conversation state in a database, and is served using a FastAPI application that you can interact with using the AgentOS UI.
from agno.agent import Agent
from agno.db.sqlite import SqliteDb
from agno.models.anthropic import Claude
from agno.os import AgentOS
from agno.tools.mcp import MCPTools
# ************* Create Agent *************
agno_agent = Agent(
    name="Agno Agent",
    model=Claude(id="claude-sonnet-4-5"),
    # Add a database to the Agent
    db=SqliteDb(db_file="agno.db"),
    # Add the Agno MCP server to the Agent
    tools=[MCPTools(transport="streamable-http", url="https://docs.agno.com/mcp")],
    # Add the previous session history to the context
    add_history_to_context=True,
    markdown=True,
)
# ************* Create AgentOS *************
agent_os = AgentOS(agents=[agno_agent])
# Get the FastAPI app for the AgentOS
app = agent_os.get_app()
# ************* Run AgentOS *************
if __name__ == "__main__":
    agent_os.serve(app="agno_agent:app", reload=True)Agno is a remarkably feature-rich framework, purpose-built for large-scale multi-agent deployments.
| Category | Feature | Description | 
|---|---|---|
| Core Intelligence | Model Agnostic | Works with any model provider so you can use your favorite LLMs. | 
| Type Safe | Enforce structured I/O through input_schemaandoutput_schemafor predictable, composable behavior. | |
| Dynamic Context Engineering | Inject variables, state, and retrieved data on the fly into context. Perfect for dependency-driven agents. | |
| Memory, Knowledge, and Persistence | Persistent Storage | Give your Agents, Teams, and Workflows a database to persist session history, state, and messages. | 
| User Memory | Built-in memory system that allows Agents to recall user-specific context across sessions. | |
| Agentic RAG | Connect to 20+ vector stores (called Knowledge in Agno) with hybrid search + reranking out of the box. | |
| Culture (Collective Memory) | Shared knowledge that compounds across agents and time. | |
| Execution & Control | Human-in-the-Loop | Native support for confirmations, manual overrides, and external tool execution. | 
| Guardrails | Built-in safeguards for validation, security, and prompt protection. | |
| Agent Lifecycle Hooks | Pre- and post-hooks to validate or transform inputs and outputs. | |
| MCP Integration | First-class support for the Model Context Protocol (MCP) to connect Agents with external systems. | |
| Toolkits | 100+ built-in toolkits with thousands of tools, ready for use across data, code, web, and enterprise APIs. | |
| Runtime & Evaluation | Runtime | Pre-built FastAPI based runtime with SSE compatible endpoints, ready for production on day 1. | 
| Control Plane (UI) | Integrated interface to visualize, monitor, and debug agent activity in real time. | |
| Natively Multimodal | Agents can process and generate text, images, audio, video, and files. | |
| Evals | Measure your Agents' Accuracy, Performance, and Reliability. | |
| Security & Privacy | Private by Design | Runs entirely in your cloud. The UI connects directly to your AgentOS from your browser, no data is ever sent externally. | 
| Data Governance | Your data lives securely in your Agent database, no external data sharing or vendor lock-in. | |
| Access Control | Role-based access (RBAC) and per-agent permissions to protect sensitive contexts and tools. | 
Every part of Agno is built for real-world deployment — where developer experience meets production performance.
AgentOS is Agno's high-performance runtime for serving multi-agent systems in production. Key features include:
- 
Pre-built FastAPI Runtime: AgentOS ships with a ready-to-use FastAPI app for orchestrating your agents, teams, and workflows. This provides a major head start when building an AI product. 
- 
Integrated Control Plane: The AgentOS UI connects directly to your runtime, letting you test, monitor, and manage your system in real time. This gives you unmatched visibility and control over your system. 
- 
Private by Design: AgentOS runs entirely in your cloud, ensuring complete data privacy. No data ever leaves your system. This is ideal for security-conscious enterprises. 
Here's what the AgentOS UI looks like in action:
Agno.AgentOS.UI.mp4
For companies building agents, Agno provides the complete agentic solution:
- The fastest framework for building agents, multi-agent teams and agentic workflows.
- A ready-to-use FastAPI app that gets you building AI products on day one.
- A control plane for testing, monitoring and managing your system.
Agno brings a novel architecture that no other framework provides, your AgentOS runs securely in your cloud, and the control plane connects directly to it from your browser. You don't need to send data to any external services or pay retention costs, you get complete privacy and control.
For LLMs and AI assistants to understand and navigate Agno's documentation, we provide an llms.txt or llms-full.txt file. This file is built for AI systems to efficiently parse and reference our documentation.
When building Agno agents, using Agno documentation as a source in your IDE is a great way to speed up your development. Here's how to integrate with Cursor:
- In Cursor, go to the "Cursor Settings" menu.
- Find the "Indexing & Docs" section.
- Add https://docs.agno.com/llms-full.txtto the list of documentation URLs.
- Save the changes.
Now, Cursor will have access to the Agno documentation. You can do the same with other IDEs like VSCode, Windsurf etc.
If you're building with Agno, you're guaranteed best-in-class performance by default. Our obsession with performance is necessary because even simple AI workflows can spawn hundreds of Agents and because many tasks are long-running -- stateless, horizontal scalability is key for success.
At Agno, we optimize performance across 3 dimensions:
- Agent performance: We optimize static operations (instantiation, memory footprint) and runtime operations (tool calls, memory updates, history management).
- System performance: The AgentOS API is async by default and has a minimal memory footprint. The system is stateless and horizontally scalable, with a focus on preventing memory leaks. It handles parallel and batch embedding generation during knowledge ingestion, metrics collection in background tasks, and other system-level optimizations.
- Agent reliability and accuracy: Monitored through evals, which we’ll explore later.
Let's measure the time it takes to instantiate an Agent and the memory footprint of an Agent. Here are the numbers (last measured in Oct 2025, on an Apple M4 MacBook Pro):
- Agent instantiation: ~3μs on average
- Memory footprint: ~6.6Kib on average
We'll show below that Agno Agents instantiate 529× faster than Langgraph, 57× faster than PydanticAI, and 70× faster than CrewAI. Agno Agents also use 24× lower memory than Langgraph, 4× lower than PydanticAI, and 10× lower than CrewAI.
Note
Run time performance is bottlenecked by inference and hard to benchmark accurately, so we focus on minimizing overhead, reducing memory usage, and parallelizing tool calls.
Let's measure instantiation time for an Agent with 1 tool. We'll run the evaluation 1000 times to get a baseline measurement. We'll compare Agno to LangGraph, CrewAI and Pydantic AI.
Note
The code for this benchmark is available here. You should run the evaluation yourself on your own machine, please, do not take these results at face value.
# Setup virtual environment
./scripts/perf_setup.sh
source .venvs/perfenv/bin/activate
# Agno
python cookbook/evals/performance/instantiate_agent_with_tool.py
# LangGraph
python cookbook/evals/performance/comparison/langgraph_instantiation.py
# CrewAI
python cookbook/evals/performance/comparison/crewai_instantiation.py
# Pydantic AI
python cookbook/evals/performance/comparison/pydantic_ai_instantiation.pyLangGraph is on the right, let's start it first and give it a head start. Then CrewAI and Pydantic AI follow, and finally Agno. Agno obviously finishes first, but let's see by how much.
Agno.Performance.Benchmark.mp4
To measure memory usage, we use the tracemalloc library. We first calculate a baseline memory usage by running an empty function, then run the Agent 1000x times and calculate the difference. This gives a (reasonably) isolated measurement of the memory usage of the Agent.
We recommend running the evaluation yourself on your own machine, and digging into the code to see how it works. If we've made a mistake, please let us know.
Taking Agno as the baseline, we can see that:
| Metric | Agno | Langgraph | PydanticAI | CrewAI | 
|---|---|---|---|---|
| Time (seconds) | 1× | 529× slower | 57× slower | 70× slower | 
| Memory (MiB) | 1× | 24× higher | 4× higher | 10× higher | 
Exact numbers from the benchmark:
| Metric | Agno | Langgraph | PydanticAI | CrewAI | 
|---|---|---|---|---|
| Time (seconds) | 0.000003 | 0.001587 | 0.000170 | 0.000210 | 
| Memory (MiB) | 0.006642 | 0.161435 | 0.028712 | 0.065652 | 
Note
Agno agents are designed for performance and while we share benchmarks against other frameworks, we should be mindful that accuracy and reliability are more important than speed.
We welcome contributions, read our contributing guide to get started.
Agno logs which model an agent used so we can prioritize updates to the most popular providers. You can disable this by setting AGNO_TELEMETRY=false in your environment.