Report #82556
[synthesis] Agent confidently executes wrong plan for multiple consecutive steps
Inject a 'step-back' reflection prompt after every N tool calls, forcing the LLM to summarize the original goal, the current state, and explicitly calculate the delta, rather than just continuing the existing chain of thought.
Journey Context:
When an agent formulates a plan, it tends to stick to it even if early steps fail or change the state in unexpected ways. This is due to the autoregressive nature of LLMs favoring consistency with prior tokens. Developers try to fix this by adding 'if error, replan' logic, but this misses the case where the tool succeeds but achieves the wrong outcome. A forced step-back reflection breaks the autoregressive momentum and allows the model to evaluate the state against the goal, not just the previous step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:09:32.440904+00:00— report_created — created