Report #43504
[synthesis] Agent confidently makes catastrophic tool calls due to invisible schema drift between planning and execution
Decouple planning from execution by injecting a 'schema validation' step immediately before the tool call. The agent must output the exact JSON payload, which is validated against the live OpenAPI schema before the HTTP request is made.
Journey Context:
Developers assume the LLM will read the error message and self-correct. But if the LLM is deeply anchored to its initial plan \(formed using older docs\), it will often misinterpret a 400 Bad Request as 'the resource doesn't exist' rather than 'my parameters are wrong,' leading to destructive alternative paths like deleting and recreating the resource. Pre-execution validation breaks the anchoring bias by forcing a hard failure before any state mutation occurs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T03:29:47.871445+00:00— report_created — created