Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Timesi/agent_learning

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

65 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Agent Learning Roadmap

πŸ€– Learn Agent Development from Scratch

A systematic, comprehensive, and practice-oriented AI Agent development guide

Daily auto-tracking of arXiv frontier papers β€” content stays cutting-edge, always.


License: MIT Stars PRs Welcome mdBook Daily arXiv


Read Online ChineseΒ Β Β Read Online English


πŸ› Report Issues Β· πŸ’¬ Discussions Β· πŸ‡¨πŸ‡³ δΈ­ζ–‡η‰ˆ README


πŸš€ Auto-Tracking Frontier: Daily arXiv Paper Updates

πŸ€– This repository automatically searches arXiv for the latest AI Agent-related papers every day and updates the content accordingly β€” ensuring you always stay at the cutting edge of research!

  • πŸ“‘ Daily Automated Search: A scheduled pipeline scans arXiv daily for new papers on Agent architectures, tool use, memory systems, multi-agent collaboration, reinforcement learning for agents, and more.
  • πŸ“ Auto-Updated Content: Relevant findings are automatically integrated into the corresponding chapters, keeping the book's frontier sections fresh and up-to-date.
  • πŸ”” Never Miss a Breakthrough: No need to manually track dozens of research feeds β€” this repo does it for you, so you can focus on learning and building.

πŸ’‘ This means the content you read here is not static β€” it evolves continuously with the latest advances in the AI Agent field.


✨ Key Features

  • 🎯 Step by Step: From LLM fundamentals to multi-Agent systems, each chapter has a clear knowledge progression
  • πŸ’» Code First: Every core concept comes with runnable Python code examples
  • 🎨 Rich Illustrations: 120+ hand-drawn SVG architecture diagrams / flowcharts / sequence diagrams for intuitive understanding
  • 🎬 Interactive Animations: 5 built-in interactive HTML animations (Perceive-Think-Act cycle, ReAct reasoning, Function Calling, RAG flow, GRPO sampling)
  • πŸ”¬ Paper Reviews: Key chapters include frontier paper deep-dives (ReAct, Reflexion, MemGPT, GRPO, etc.)
  • πŸ—οΈ Complete Projects: 3 comprehensive hands-on projects (AI Coding Assistant, Intelligent Data Analysis Agent, Multimodal Agent)
  • πŸ›‘οΈ Production Ready: Covers security, evaluation, deployment, and other production essentials
  • πŸ§ͺ Cutting Edge: Covers Context Engineering, Agentic-RL (GRPO/DPO/PPO), MCP/A2A/ANP, and other 2025–2026 latest advances
  • πŸ“ Formula Support: KaTeX-rendered math formulas for clear reading of policy gradient, KL divergence derivations in RL chapters
  • πŸ”„ Continuously Updated: Tracking the latest changes in LangChain, LangGraph, MCP, and other frameworks

πŸ“Έ Selected Content Preview

Below are selected showcases from the book's 120+ hand-drawn SVG illustrations, all original to this book.

🧠 Agent Core Architecture

Perceive-Think-Act Loop (Chapter 1)

Perceive-Think-Act Loop

Agent's core mechanism: Perceive environment β†’ LLM reasoning β†’ Execute action β†’ Loop until goal achieved

ReAct Reasoning Framework (Chapter 6)

ReAct Reasoning Framework

Thought β†’ Action β†’ Observation alternating loop, enabling Agents to think while acting

πŸ› οΈ Tool Calling & RAG

Function Calling Complete Flow (Chapter 4)

Function Calling Flow

6-step complete flow from user input to tool invocation to final response, with message structure illustration

RAG Retrieval-Augmented Generation (Chapter 7)

RAG Workflow

Offline indexing + Online retrieval dual-phase architecture, making LLM answers evidence-based

πŸ’Ύ Memory System & Context Engineering

Three-Layer Memory Architecture (Chapter 5)

Three-Layer Memory Architecture

Working memory β†’ Short-term memory β†’ Long-term memory, with important info sinking down and semantic retrieval pulling up

Prompt Engineering vs Context Engineering (Chapter 8)

Prompt Engineering vs Context Engineering

From "how to say it" to "what the LLM sees" β€” the paradigm shift of the Agent era

🀝 Multi-Agent & Communication Protocols

Three Multi-Agent Communication Patterns (Chapter 14)

Multi-Agent Communication Patterns

Message Queue (async decoupling) / Shared Blackboard (data sharing) / Direct Call (real-time collaboration)

MCP / A2A / ANP Protocol Comparison (Chapter 15)

Three Protocol Comparison

Three-layer protocol stack: ANP for discovery β†’ A2A for task collaboration β†’ MCP for tool invocation

πŸ§ͺ Reinforcement Learning & Frameworks

GRPO Training Architecture (Chapter 10)

GRPO Training Architecture

No Critic model needed, computes advantage via intra-group normalization, only 1.5Γ— model size in VRAM

LangGraph Three Core Concepts (Chapter 12)

LangGraph Core Concepts

State (shared state) Β· Node (processing unit) Β· Edge (execution flow control)

πŸ“– The above is just a selected preview β€” For the full 120+ architecture diagrams + 5 interactive animations, please read online


🎬 Interactive Animations

This book includes 5 interactive HTML animations to help you intuitively understand the dynamic processes of core concepts:

Animation Chapter Description
πŸ”„ Perceive-Think-Act Cycle Chapter 1 Dynamic demonstration of Agent's core loop
πŸ’‘ ReAct Reasoning Process Chapter 6 Shows the alternating Thought β†’ Action β†’ Observation process
πŸ”§ Function Calling Chapter 4 Complete tool invocation flow animation
πŸ“š RAG Retrieval Flow Chapter 7 From document chunking to vector retrieval to answer generation
🎯 GRPO Sampling Process Chapter 10 Visualization of intra-group multi-output sampling and reward normalization

πŸ’‘ Interactive animations are only available in the online e-book. Local builds can also preview them.


πŸ”₯ Core Topics at a Glance

🧠 Agent Core Architecture

  • Perceive β†’ Think β†’ Act Loop
  • ReAct Reasoning Framework
  • Task Decomposition & Planning
  • Reflection & Self-Correction

πŸ› οΈ Tools & Skills

  • Function Calling Mechanism
  • Custom Tool Design
  • Skill System Construction
  • Tool Description Best Practices

πŸ§ͺ Reinforcement Learning Training

  • SFT + LoRA Basic Training
  • PPO / DPO / GRPO Algorithm Deep-Dive
  • Complete Training Pipeline Hands-on
  • 2025–2026 Latest Research Advances

πŸ’Ύ Memory, Knowledge & Context

  • Short-term / Long-term / Working Memory
  • Vector Databases (Chroma / FAISS)
  • RAG Retrieval-Augmented Generation
  • Context Engineering & Attention Budget

🀝 Multi-Agent Collaboration & Communication

  • MCP / A2A / ANP Protocol Stack
  • Supervisor vs Decentralized Patterns
  • CrewAI / AutoGen Frameworks
  • LangGraph Stateful Agents

πŸ›‘οΈ Production Full Pipeline

  • Evaluation Benchmarks (GAIA / SWE-bench)
  • Security Defense & Sandbox Isolation
  • Containerized Deployment & Streaming
  • Observability & Cost Optimization

πŸš€ Quick Start

Local Build

# Install mdBook (choose one)
cargo install mdbook
# Or macOS: brew install mdbook

# Install mdbook-katex plugin (for math formula rendering)
cargo install mdbook-katex

# Clone the repository
git clone https://github.com/Haozhe-Xing/agent_learning.git
cd agent_learning

# Build both Chinese and English versions and start unified server (default port 3000)
./serve.sh

After starting, visit:

  • 🌐 Language Selection Home: http://localhost:3000
  • πŸ‡¨πŸ‡³ Chinese Version: http://localhost:3000/zh/
  • πŸ‡ΊπŸ‡Έ English Version: http://localhost:3000/en/

Environment Setup (For Code Practice)

# Python 3.11+
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install core dependencies
pip install langchain langchain-openai langgraph openai anthropic

# Configure API Key
export OPENAI_API_KEY="your-key-here"

πŸ“Š Technology Stack

Python LangChain LangGraph OpenAI Anthropic FastAPI Docker Chroma FAISS mdBook KaTeX


🀝 Contributing

All forms of contribution are welcome!

  • πŸ› Found a bug: Submit an Issue
  • πŸ’‘ Content suggestions: Start a Discussion
  • πŸ“ Improve content: Fork β†’ Edit β†’ Submit PR
  • ⭐ Support the project: Give this repo a Star!

Contributing Guide

# Fork and clone
git clone https://github.com/YOUR_USERNAME/agent_learning.git

# Create a feature branch
git checkout -b feature/improve-chapter-4

# Local preview
./serve.sh

# Commit and push
git commit -m "feat: improve Chapter 4 tool calling code examples"
git push origin feature/improve-chapter-4

Content Organization Conventions

  • Each chapter is placed in a separate directory src/zh/chapter_xxx/ (Chinese) or src/en/chapter_xxx/ (English)
  • Chapter overview goes in README.md, sections are numbered as 01_xxx.md, 02_xxx.md
  • Chinese SVG illustrations go in src/zh/svg/, English versions in src/en/svg/, naming format: chapter_xxx_description.svg
  • Chinese interactive animations go in src/zh/animations/, English versions in src/en/animations/

πŸ“„ License

This project is open-sourced under the MIT License.


⭐ Star History

If this project helps you, please give it a Star ⭐ β€” it's the greatest encouragement for the author!


Built with ❀️, so that every developer can master AI Agent development

⬆ Back to Top

About

πŸ€– δ»Žι›ΆεΌ€ε§‹ε­¦ Agent - Agentic RL | Agent Memory | Agent 前沿进展

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • HTML 70.5%
  • CSS 10.3%
  • Handlebars 10.2%
  • Python 4.2%
  • Shell 4.0%
  • JavaScript 0.8%