🚀 Sage Multi-Agent Framework

A production-ready, modular, and intelligent multi-agent orchestration framework for complex problem solving

Sage is an advanced multi-agent system that intelligently breaks down complex tasks into manageable subtasks through seamless agent collaboration. Built with enterprise-grade reliability and extensibility in mind, it provides Deep Research Mode for comprehensive analysis and Rapid Execution Mode for quick task completion.

✨ Key Highlights

🧠 Intelligent Task Decomposition - Automatically breaks complex problems into manageable subtasks with dependency tracking
🔄 Agent Orchestration - Seamless coordination between specialized agents with robust error handling
🛠️ Extensible Tool System - Plugin-based architecture with MCP server support and auto-discovery
⚡ Dual Execution Modes - Choose between thorough analysis or rapid execution based on your needs
🌐 Interactive Web Interface - Beautiful Streamlit-based UI with real-time streaming visualization
📊 Advanced Token Tracking - Comprehensive usage statistics and cost monitoring across all agents
⚙️ Rich Configuration - Environment variables, config files, CLI options, and runtime updates
🔧 Developer Friendly - Clean APIs, comprehensive docs, examples, and extensive error handling
🎯 Production Ready - Robust error recovery, logging, retry mechanisms, and performance optimization

🤖 Supported Models

Sage has been extensively tested with the following language models:

✅ Officially Tested Models

🔥 DeepSeek-V3 - deepseek-chat - Excellent performance for complex reasoning
🌟 Qwen-3 - qwen-turbo, qwen-plus - Outstanding Chinese and English capabilities
🧠 GPT-4.1 - gpt-4-turbo, gpt-4o - Premium performance for all tasks
⚡ Claude-3.5 Sonnet - claude-3-5-sonnet-20241022 - Exceptional reasoning abilities

🌐 Compatible Providers

OpenAI - Direct API integration
OpenRouter - Access to 200+ models
Anthropic - Claude family models
Google AI - Gemini series
DeepSeek - Native API support
Alibaba Cloud - Qwen series
Mistral AI - All Mistral models

💡 Note: While Sage is optimized for the models listed above, it's designed to work with any OpenAI-compatible API endpoint.

🏗️ Architecture Overview

graph TD
    A[🔍 User Input] --> B(📋 Task Analysis Agent)
    B --> C{✅ Analysis Complete?}
    C -->|Yes| D[🎯 Task Decompose Agent]
    C -->|No| B
    D --> E[📝 Planning Agent]
    E --> F[⚡ Executor Agent]
    F --> G[👁️ Observation Agent]
    G --> H{🎪 Task Complete?}
    H -->|Yes| I[📄 Summary Agent]
    H -->|No| E
    I --> J[🎉 Final Output]
    
    subgraph "🛠️ Tool Ecosystem"
        F --> K[🔧 Tool Manager]
        K -->|Local| L[📱 Built-in Tools]
        K -->|Remote| M[🌐 MCP Servers]
        M --> N[🔌 External APIs]
        L --> O[📊 Results]
        N --> O
        O --> G
    end
    
    subgraph "📊 Token Tracking"
        B --> P[💰 Usage Monitor]
        D --> P
        E --> P
        F --> P
        G --> P
        I --> P
        P --> Q[📈 Cost Analytics]
    end
    
    style A fill:#e1f5fe
    style J fill:#e8f5e8
    style K fill:#fff3e0
    style P fill:#f3e5f5

🚀 Quick Start

Installation

git clone https://github.com/ZHangZHengEric/Sage.git
cd Sage
pip install -r requirements.txt

🎮 Interactive Web Demo

Experience Sage through our beautiful web interface with real-time agent visualization:

# Using DeepSeek-V3 (Recommended)
streamlit run examples/sage_demo.py -- \
  --api_key YOUR_DEEPSEEK_API_KEY \
  --model deepseek-chat \
  --base_url https://api.deepseek.com/v1

# Using OpenRouter (Multiple Models)
streamlit run examples/sage_demo.py -- \
  --api_key YOUR_OPENROUTER_API_KEY \
  --model deepseek/deepseek-chat \
  --base_url https://openrouter.ai/api/v1

# Using GPT-4
streamlit run examples/sage_demo.py -- \
  --api_key YOUR_OPENAI_API_KEY \
  --model gpt-4o \
  --base_url https://api.openai.com/v1

🌐 Modern Web Application (FastAPI + React)

Experience Sage through our cutting-edge web application featuring a modern React frontend with FastAPI backend:

Features:

🤖 Multi-Agent Collaboration - Visual workflow with decomposition, planning, execution, observation, and summary
🧠 Deep Thinking Mode - Expandable thought bubbles showing agent reasoning process
🚀 FastAPI Backend - High-performance async API server with streaming support
⚛️ React Frontend - Modern responsive UI with Ant Design components
📡 Real-time Communication - WebSocket + SSE dual support for live updates
🎨 Beautiful Interface - Collapsible deep thinking bubbles with modern design
🔧 Tool Management - Automatic tool discovery and management
📱 Responsive Design - Adapts to all screen sizes
🔧 TypeScript Support - Full type safety throughout

Quick Start:

cd examples/fastapi_react_demo

# Backend setup
python start_backend.py

# Frontend setup (new terminal)
cd frontend
npm install
npm run dev

Access the application at http://localhost:8080. For detailed setup instructions, see the FastAPI React Demo README.

💻 Command Line Usage

from agents.agent.agent_controller import AgentController
from agents.tool.tool_manager import ToolManager
from openai import OpenAI

# Example with DeepSeek-V3
model = OpenAI(
    api_key="your-deepseek-api-key", 
    base_url="https://api.deepseek.com/v1"
)
tool_manager = ToolManager()
controller = AgentController(model, {
    "model": "deepseek-chat",
    "temperature": 0.7,
    "max_tokens": 4096
})

# Execute task with comprehensive tracking
messages = [{"role": "user", "content": "Analyze the current trends in AI and provide actionable insights"}]

# Use system_context to provide additional runtime information
system_context = {
    "task_priority": "high",
    "deadline": "2024-01-15",
    "target_audience": "technical team"
}

result = controller.run(
    messages, 
    tool_manager, 
    deep_thinking=True, 
    summary=True,
    system_context=system_context
)

# Access results and usage statistics
print("Final Output:", result['final_output']['content'])
print("Token Usage:", result['token_usage'])
print("Execution Time:", result['execution_time'])

🎯 Core Features

🤖 Multi-Agent Collaboration (v0.9)

Task Analysis Agent: Enhanced deep understanding with context awareness and unified system prompt management
Task Decompose Agent: New intelligent task breakdown with dependency analysis and parallel execution planning
Planning Agent: Strategic decomposition with dependency management and optimal tool selection
Executor Agent: Intelligent tool execution with error recovery, retry mechanisms, and parallel processing
Observation Agent: Advanced progress monitoring with completion detection and quality assessment
Summary Agent: Comprehensive result synthesis with structured output and actionable insights

🛠️ Advanced Tool System

Plugin Architecture: Hot-reloadable tool development with automatic registration and versioning
MCP Server Support: Seamless integration with Model Context Protocol servers and remote APIs
Auto-Discovery: Intelligent tool detection from directories, modules, and remote endpoints
Type Safety: Comprehensive parameter validation with schema enforcement and runtime checks
Error Handling: Robust error recovery, timeout management, retry strategies, and detailed logging
Performance Monitoring: Tool execution time tracking, bottleneck detection, and optimization suggestions

📊 Token Usage & Cost Monitoring

Real-time Tracking: Monitor token consumption across all agents and operations
Detailed Analytics: Input, output, cached, and reasoning token breakdown
Cost Estimation: Calculate costs based on model pricing and usage patterns
Performance Metrics: Track execution time, success rates, and efficiency
Export Capabilities: CSV, JSON export for further analysis

# Get comprehensive token statistics
stats = controller.get_comprehensive_token_stats()
print(f"Total Tokens: {stats['total_tokens']}")
print(f"Total Cost: ${stats['estimated_cost']:.4f}")
print(f"Agent Breakdown: {stats['agent_breakdown']}")

# Print detailed statistics
controller.print_comprehensive_token_stats()

⚙️ Rich Configuration System

Environment Variables: SAGE_DEBUG, OPENAI_API_KEY, SAGE_MAX_LOOP_COUNT, etc.
Config Files: YAML/JSON configuration with validation and hot-reload
Runtime Updates: Dynamic configuration changes without restart
CLI Options: Comprehensive command-line interface with help system
Profile Management: Save and load configuration profiles

🔄 Execution Modes

Deep Research Mode (Recommended for Complex Tasks)

# Enhanced with system_context support
result = controller.run(
    messages, 
    tool_manager,
    deep_thinking=True,    # Enable comprehensive task analysis
    summary=True,          # Generate detailed summary with insights
    deep_research=True,    # Full multi-agent pipeline with decomposition
    system_context={       # Unified system context management
        "project_context": "AI research project",
        "constraints": ["time: 2 hours", "resources: limited"],
        "preferences": {"output_format": "detailed_report"}
    }
)

# Streaming version with real-time updates
for chunk in controller.run_stream(
    messages, 
    tool_manager,
    deep_thinking=True,
    summary=True,
    deep_research=True,
    system_context=system_context  # Consistent system context across streaming
):
    for message in chunk:
        print(f"[{message['type']}] {message['role']}: {message['show_content']}")

Standard Execution Mode (Balanced Performance)

result = controller.run(
    messages, 
    tool_manager,
    deep_thinking=True,    # Enable task analysis
    summary=True,          # Generate summary
    deep_research=False,   # Skip detailed decomposition phase
    system_context=system_context  # Runtime context support
)

Rapid Execution Mode (Maximum Speed)

result = controller.run(
    messages,
    tool_manager, 
    deep_thinking=False,   # Skip analysis
    deep_research=False,   # Direct execution
    system_context=system_context  # Even rapid mode supports context
)

📊 Real-time Streaming & Monitoring

Watch your agents work in real-time with detailed progress tracking and performance metrics:

import time

start_time = time.time()
token_count = 0

# Enhanced streaming with system context
system_context = {
    "monitoring_level": "detailed",
    "progress_tracking": True,
    "performance_metrics": True
}

for chunk in controller.run_stream(messages, tool_manager, system_context=system_context):
    for message in chunk:
        # Display agent activity
        print(f"🤖 {message['role']}: {message['show_content']}")
        
        # Track progress
        if 'usage' in message:
            token_count += message['usage'].get('total_tokens', 0)
        
        # Real-time statistics
        elapsed = time.time() - start_time
        print(f"⏱️  Time: {elapsed:.1f}s | 🪙 Tokens: {token_count}")

🔧 Advanced Tool Development

Create sophisticated custom tools with full framework integration:

from agents.tool.tool_base import ToolBase
from typing import Dict, Any, Optional
import requests

class DataAnalysisTool(ToolBase):
    """Advanced data analysis tool with caching and validation"""
    
    @ToolBase.tool()
    def analyze_data(self, 
                    data_source: str, 
                    analysis_type: str,
                    options: Optional[Dict[str, Any]] = None) -> Dict[str, Any]:
        """
        Perform comprehensive data analysis with visualization
        
        Args:
            data_source: URL or path to data source
            analysis_type: Type of analysis (statistical/trend/correlation)
            options: Additional analysis options
        """
        try:
            # Your implementation here
            result = self._perform_analysis(data_source, analysis_type, options)
            
            return {
                "success": True,
                "data": result,
                "metadata": {
                    "execution_time": self.execution_time,
                    "data_size": len(result.get("records", [])),
                    "analysis_type": analysis_type
                }
            }
        except Exception as e:
            return {
                "success": False,
                "error": str(e),
                "error_type": type(e).__name__
            }
    
    def _perform_analysis(self, source, analysis_type, options):
        # Implementation details
        pass

🛡️ Error Handling & Reliability

Sage includes comprehensive error handling and recovery mechanisms:

from agents.utils.exceptions import SageException, with_retry, exponential_backoff

# Automatic retry with exponential backoff
@with_retry(exponential_backoff(max_attempts=3, base_delay=1.0))
def robust_execution():
    return controller.run(messages, tool_manager)

# Custom error handling
try:
    result = controller.run(messages, tool_manager)
except SageException as e:
    print(f"Sage Error: {e}")
    print(f"Error Code: {e.error_code}")
    print(f"Recovery Suggestions: {e.recovery_suggestions}")

📈 Performance Monitoring

Monitor and optimize your agent performance:

# Enable detailed performance tracking
controller.enable_performance_monitoring()

# Execute with monitoring
result = controller.run(messages, tool_manager)

# Analyze performance
perf_stats = controller.get_performance_stats()
print(f"Execution Time: {perf_stats['total_time']:.2f}s")
print(f"Agent Breakdown: {perf_stats['agent_times']}")
print(f"Tool Usage: {perf_stats['tool_stats']}")
print(f"Bottlenecks: {perf_stats['bottlenecks']}")

# Export performance data
controller.export_performance_data("performance_report.json")

🔌 MCP Server Integration

Seamlessly integrate with Model Context Protocol servers:

# Start MCP servers
python mcp_servers/weather_server.py &
python mcp_servers/database_server.py &

# Use in your application
tool_manager.register_mcp_server("weather", "http://localhost:8001")
tool_manager.register_mcp_server("database", "http://localhost:8002")

# Tools are automatically available
result = controller.run([{
    "role": "user", 
    "content": "Get weather for Tokyo and save to database"
}], tool_manager)

📚 Documentation

Quick Start Guide - Get up and running in 5 minutes
Architecture Overview - Detailed system design
API Reference - Complete API documentation
Tool Development - Create custom tools
Configuration Guide - Advanced configuration options
Examples - Real-world usage examples

🎯 Production Deployment

Sage is production-ready with enterprise features:

from agents.config.settings import get_settings, update_settings

# Configure for production
update_settings(
    debug=False,
    max_loop_count=5,
    tool_timeout=30,
    enable_logging=True,
    log_level="INFO"
)

# Initialize with production settings
controller = AgentController.from_config("production.yaml")

🔄 Recent Updates (v0.9)

✨ New Features

🎯 Task Decompose Agent: New specialized agent for intelligent task breakdown and dependency management
🔧 Unified System Prompt Management: Centralized system context handling with system_context parameter across all agents
📊 Enhanced Token Tracking: Comprehensive usage statistics with detailed cost monitoring and optimization suggestions
🛡️ Robust Error Handling: Advanced error recovery, retry mechanisms, and comprehensive exception handling
⚡ Performance Optimization: 50% faster execution with improved resource management and parallel processing
🌐 Modern Web Application: Complete FastAPI + React web application with TypeScript support and real-time collaboration

🔧 Technical Improvements

🏗️ Agent Architecture: Added Task Decompose Agent to the workflow for better task breakdown
💬 System Context API: New system_context parameter for unified runtime information management
📝 System Prompt Organization: Centralized system prompt management with SYSTEM_PREFIX_DEFAULT constants
💾 Memory Management: Optimized memory usage for long-running tasks and large-scale deployments
🌐 Streaming Enhancement: Improved real-time updates with better UI feedback and WebSocket reliability
📊 Token Analytics: Comprehensive usage tracking with cost optimization suggestions and budget management

🐛 Bug Fixes

Fixed streaming response interruption issues
Resolved tool execution timeout problems
Improved session management and cleanup
Enhanced error message clarity and debugging information
Fixed memory leaks in long-running sessions

📋 API Changes

New Parameter: system_context added to run() and run_stream() methods for unified context management
Workflow Enhancement: Added Task Decompose Agent between Task Analysis and Planning phases
System Prompt: All agents now use unified system prompt management with SYSTEM_PREFIX_DEFAULT constants
Backward Compatibility: All existing APIs remain fully compatible

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for the powerful language models
DeepSeek for the exceptional V3 model
Alibaba Cloud for the Qwen series
The open-source community for inspiration and tools
All contributors who help make Sage better

_{Built with ❤️ by the Sage team}

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github/workflows		.github/workflows
agents		agents
assets		assets
docs		docs
examples		examples
mcp_servers		mcp_servers
.gitignore		.gitignore
README.md		README.md
README_CN.md		README_CN.md
requirements.txt		requirements.txt

ericsmall44/Sage

Folders and files

Latest commit

History

Repository files navigation