🧪 MCP Testing

AI-powered testing framework for MCP servers

Test your MCP servers with real AI agents conducting conversations and LLM judges evaluating results

Why MCP Testing?

Traditional testing doesn't work for MCP servers. You can't write unit tests for natural language interactions. MCP Testing solves this with:

🤖 Real AI Agents - Claude and ChatGPT actually use your MCP server
👤 User Simulation - AI simulates realistic multi-turn user behavior
⚖️ LLM-as-a-Judge - Intelligent evaluation instead of brittle assertions
🎭 Comprehensive Testing - Security, compliance and performace all in one framework
🔌 Multiple Transports - Supports HTTP and stdio servers

Quick Start

Get testing in 3 steps:

Install & Setup

pip install mcp-testing
export ANTHROPIC_API_KEY="sk-ant-..."  # For AI agents
export OPENAI_API_KEY="sk-..."         # For LLM judge

Interactive Onboarding

mcp-t quickstart  # Creates your first server & test suite

Run Tests

mcp-t run <suite-id> <server-id>
# Example: mcp-t run example_suite_001 hackernews_mcp_server

Core Concepts

Test Flow

Your Test Case → AI Agent (Claude/GPT-4) → Your MCP Server
      ↓                    ↓                      ↓
 User Message         Tool Calls            Server Response
      ↓                    ↓                      ↓
User Simulator      Conversation Loop         More Tools
      ↓                    ↓                      ↓
   LLM Judge       Complete Transcript      Pass/Fail + Reasoning

Configuration Files

Server Config - HTTP (examples/server.json):

{
  "name": "linear_mcp_server",
  "transport": "http",
  "url": "https://mcp.linear.app/mcp"
}

Server Config - stdio (examples/servers/time-server-stdio.json):

{
  "name": "Time Server",
  "transport": "stdio",
  "command": "npx -y @modelcontextprotocol/server-time"
}

Server Config - stdio with env (examples/servers/brave-search-stdio.json):

{
  "name": "Brave Search",
  "transport": "stdio",
  "command": "npx -y @modelcontextprotocol/server-brave-search",
  "env": {
    "BRAVE_API_KEY": "your-api-key-here"
  }
}

Test Suite (examples/suite.json):

{
  "suite_id": "example_suite_001",
  "name": "Hacker News MCP Server Tests",
  "test_cases": [
    {
      "test_id": "hackernews_greeting",
      "user_message": "Hello! Can you help me browse Hacker News?",
      "success_criteria": "Agent should respond politely and explain Hacker News capabilities",
      "max_turns": 5
    }
  ]
}

Test Types

💬 Conversational - Real user workflows
🔒 Security - Authentication & vulnerabilities
✅ Compliance - MCP protocol validation

Commands

Test Execution

mcp-t run <suite-id> <server-id>           # Run specific suite
mcp-t run example_suite_001 hackernews_mcp_server -v   # Verbose output

Configuration Management

mcp-t quickstart                 # Complete onboarding
mcp-t create server              # Interactive server setup
mcp-t create suite               # Create test suite
mcp-t create test-case           # Add test to suite
mcp-t list                       # Show all configs
mcp-t show suite example_suite_001   # View specific config

Test Generation

Run wizard that analyzes your MCP server and automatically generates comprehensive test cases

mcp-t generate

Test Results

Understanding Evaluation

{
  "test_id": "hackernews_stories",
  "verdict": "PASS",
  "confidence_score": 0.89,
  "judge_reasoning": "The agent successfully fetched and displayed Hacker News stories. Good use of available tools and clear presentation of results.",
  "conversation_quality": 0.87,
  "tool_calls": [
    { "tool": "get_top_stories", "args": {} },
    { "tool": "get_story_details", "args": { "story_id": 123 } }
  ]
}

Support

Built with ❤️ for the MCP ecosystem

_{Made in San Francisco, CA}

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.claude		.claude
.github		.github
examples		examples
src/test_mcp		src/test_mcp
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧪 MCP Testing

Why MCP Testing?

Quick Start

Core Concepts

Test Flow

Configuration Files

Test Types

Commands

Test Execution

Configuration Management

Test Generation

Test Results

Understanding Evaluation

Support

About

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

License

golf-mcp/golf-testing

Folders and files

Latest commit

History

Repository files navigation

🧪 MCP Testing

Why MCP Testing?

Quick Start

Core Concepts

Test Flow

Configuration Files

Test Types

Commands

Test Execution

Configuration Management

Test Generation

Test Results

Understanding Evaluation

Support

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 3

Uh oh!

Languages