Treat your agent as the system under test. Define scenarios, assert behavior, and measure efficiency and quality.
with_agent
(place above @task
):Expect.tools.was_called
, called_with
, sequence
, output_matches
Expect.performance.max_iterations
, Expect.path.efficiency
Expect.judge.llm
, Expect.judge.multi_criteria