Output formats
- Console summary (always)
- Markdown: portable report for PRs/wikis
- HTML: rich, filterable report with per‑test detail
What’s included
- Test metadata: id, name, agent, servers
- Assertions: pass/fail with messages and parameters
- Judge scores: rubric, criterion scores, aggregate
- Tool usage: sequence, counts, success rate
- Performance: response time, iterations, duration
- Metrics: coverage and OTEL identifiers
Artifacts directory
Default directory:./test-reports
(configurable via reporting settings)
Artifacts may include:
results.json
– full machine‑readable resultsresults.md
– Markdown summary suitable for GitHubindex.html
– rich HTML report- Per‑test trace files (if enabled)
Configuration
Inmcpeval.yaml
:
In CI
- Upload
test-reports/
as build artifacts. - Post
results.md
into PR comments. - Fail the job on
all_passed == false
.
Placeholder: add screenshots of the HTML report summary and a failed assertion detail view.