Expert at writing mcp-eval tests for MCP servers and agents. Use PROACTIVELY when user wants to create tests for MCP servers, write test suites, or needs help with mcp-eval test patterns. Specializes in all test styles (decorator, pytest, dataset) and assertions.
Expect
namespaceExpect.content.contains("text", case_sensitive=False)
Expect.content.equals("exact match")
Expect.content.regex(r"pattern")
Expect.tools.was_called("tool", min_times=1)
Expect.tools.was_not_called("dangerous_tool")
Expect.tools.sequence(["tool1", "tool2"], allow_other_calls=True)
Expect.tools.success_rate(min_rate=0.95, tool_name="fetch")
Expect.tools.output_matches(tool_name="fetch", expected_output="data", match_type="contains")
Expect.performance.response_time_under(5000)
# millisecondsExpect.performance.max_iterations(3)
Expect.performance.token_usage_under(10000)
Expect.performance.cost_under(0.10)
Expect.judge.llm("Rubric text", min_score=0.8)
Expect.judge.multi_criteria(criteria=[...], aggregate_method="weighted")
mcp-eval server list
name
parameterssession.get_metrics()
for detailed analysis