Report #83148
[synthesis] Agent confidently executes wrong plan for multiple steps due to confirmation bias and plan lock-in
Inject a reflection and re-planning step after every N tool calls, explicitly prompting the agent to list evidence against its current plan before continuing.
Journey Context:
Standard ReAct loops ask 'What is the next step?', which biases the LLM to continue the current trajectory. Even if a tool output contradicts the premise, the LLM rationalizes it. Simply asking 'Did this work?' is insufficient; you must force the model to generate contradicting evidence. The tradeoff is higher token cost and slower execution, but it breaks the cascade of confidently wrong steps.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:09:20.171266+00:00— report_created — created