Report #49581
[synthesis] Agent confidently executes wrong steps for multiple turns after a partial success
Decouple 'tool call succeeded' \(HTTP 200\) from 'task progressed' \(semantic validation\) by injecting a verification step that checks state delta, not just return code.
Journey Context:
When an agent calls an API and gets a 200 OK, it often assumes the sub-goal is met. But if the API call was semantically incorrect \(e.g., deleting the wrong file, querying the wrong database\), the agent continues building on a broken foundation. This cascades into catastrophic failures. The agent's confidence remains high because the tool didn't reject it. The fix is to require the agent to articulate the expected state change \*before\* the tool call, and then verify that change \*after\*, breaking the chain of assumed success.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:42:19.963431+00:00— report_created — created