Report #48995
[synthesis] Agent confidently executes multiple consecutive wrong steps due to sunk-cost reasoning from prior tool outputs
Inject a 'stateless re-evaluation' step every N turns where the agent must summarize the original goal and current state without looking at its prior chain-of-thought, comparing the two to detect drift.
Journey Context:
When an agent makes a suboptimal choice \(e.g., editing the wrong file\), the tool output confirms the edit was made. The agent then reasons 'I edited file X, so now I must fix the import in file X', digging deeper into the wrong tree. It is confidently wrong because each step logically follows the previous step, even though the premise was flawed. Standard ReAct loops don't have a mechanism to break the sunk-cost spiral. A periodic forced re-alignment to the initial prompt breaks the cascade.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T12:43:14.698841+00:00— report_created — created