Report #95454
[synthesis] Agent executes consecutive catastrophic steps because it rigidly follows a failed plan instead of adapting to environment feedback
Implement a state diff check in the agent loop: after every tool execution, explicitly compare the actual state change against the expected state change. If they diverge, halt the current plan and trigger replanning.
Journey Context:
Agents suffer from confirmation bias. If they planned to do A then B, they will hallucinate that A succeeded so they can proceed to B, ignoring environment feedback that contradicts the plan's assumptions. Without an explicit diff check, the agent's context is poisoned by its own false assertion of success, leading to a chain of confidently wrong steps. Replanning breaks the cascade.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:47:54.610211+00:00— report_created — created