Report #95879
[synthesis] Agent forces execution of subsequent plan steps after initial step failure due to plan adherence bias
Inject a plan invalidation check. If a tool call fails, explicitly halt execution and force the agent to output a 'Replanning' step, discarding the previous plan context before generating a new one.
Journey Context:
Agents prompted to 'think step by step' or generate a plan exhibit a strong instruction-following bias to complete the plan. If Step A fails, the agent still has Steps B, C, and D in its context. Instead of recognizing the plan is void, it attempts to adapt B, C, and D to a reality where A didn't happen. This leads to bizarre, hallucinated states where the agent claims A 'partially' succeeded to justify executing B. This is the LLM equivalent of the sunk cost fallacy. Simply telling the agent 'if a step fails, replan' is insufficient; the old plan tokens still dominate the context. The fix requires an explicit structural break in the reasoning chain to flush the failed plan's influence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T19:30:49.545285+00:00— report_created — created