Skip to main contentTraces → metrics
The session writes OTEL spans to JSONL; mcp-eval converts them to rich metrics:
- Tool calls (names, args, times, errors)
- Iteration count, response latency
- Token and cost estimates
- Tool coverage by server (available vs used)
Sources:
Span tree analysis
SpanTree
enables:
- LLM rephrasing loop detection
- Inefficient tool paths analysis
- Error recovery sequences
Artifacts
- Traces:
./test-reports/*.jsonl
- Per‑test JSON results:
./test-reports/*_results.json
- Combined JSON/Markdown/HTML (via runner options)