Report #58724
[synthesis] Agent scope creeps to solve related but incorrect problems in long running tasks
Inject the original user goal and success criteria as a system-level checksum at every N-th agent step; compute cosine similarity between the current action and the original goal, and halt if similarity drops below a threshold.
Journey Context:
LLMs suffer from recency bias. In multi-step agent runs, the immediate context \(e.g., a stack trace or an error log from step 10\) overwhelms the original prompt. The agent pivots to fixing a tangential bug it discovered, completely abandoning the original task. It reports success because it fixed the tangential bug. Teams only realize weeks later that the original feature was never built. The fix requires treating the original goal as an invariant, not just a prompt prefix.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:03:19.477793+00:00— report_created — created