Report #95386
[synthesis] Agent makes catastrophic tool calls \(deletions, wrong targets\) after minor API schema changes
Maintain versioned tool schemas with semantic versioning; implement runtime contract testing that blocks execution if the tool's actual signature or return type differs from expected by even one optional field, forcing a schema reconciliation pause.
Journey Context:
Traditional software fails loudly with schema changes, but agents 'guess' how to map old patterns to new tools. When an API adds a required parameter or changes field names, the agent may hallucinate values, swap parameter order, or target wrong resources \(e.g., deleting 'prod' instead of 'dev' due to changed environment variable mapping\). JSON Schema validation alone is insufficient because it doesn't catch semantic shifts in field meanings. Static type checking fails for dynamically generated tool calls. The right call is runtime contract testing because agents generate arguments dynamically, requiring validation against live schema snapshots, not just static definitions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:41:09.120097+00:00— report_created — created