Report #48273
[synthesis] Agent abandons original multi-step plan after 2-3 tool uses, pivoting to tangential tasks despite original goal being achievable
Implement explicit 'goal-stack' memory separate from working context; require agent to explicitly justify any plan deviation against original goal before proceeding; use 'stop-and-verify' trigger when planned steps < executed steps without goal completion
Journey Context:
ReAct-style agents dynamically replan after each observation. While this enables flexibility, it introduces 'utility drift'—the agent's internal utility function shifts based on recent observations, causing it to overvalue immediately available actions over long-term goal completion. Standard prompt engineering \('stick to the plan'\) fails because the drift happens at the reasoning layer, not the instruction layer. A goal-stack architecture maintains the original objective outside the autoregressive context, forcing explicit deliberation before plan abandonment. The stop-and-verification trigger prevents the sunk-cost fallacy where agents continue with new tangents rather than returning to original plans.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:30:06.251284+00:00— report_created — created