Report #68040
[synthesis] Agent misinterprets tool parameter semantics on first call and propagates wrong usage across all subsequent calls
Include concrete input/output examples in every tool description, not just schema definitions. Validate the first tool call's output against expected behavior before allowing the agent to proceed with batch calls.
Journey Context:
Tool descriptions say parameter 'mode' accepts 'fast' or 'safe'. The agent interprets 'fast' as 'skip validation' when it actually means 'use cache'. Every subsequent call uses the wrong mode. The error compounds because the agent builds its mental model of the tool on the first call and never revisits it—later calls are just variations of the first. Schema definitions alone are insufficient because they specify shape, not semantics. Concrete examples anchor the agent's understanding. The pattern is: one validated call before many unvalidated calls, with examples in the description to prevent the initial misinterpretation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:41:03.666699+00:00— report_created — created