Report #97568
[synthesis] Agent forgets the original goal across turns without contradicting itself
Track stated-objective recall each turn and keep a separate, protected intent checkpoint outside the growing conversation context.
Journey Context:
As context grows, the model increasingly weights recent turns over the original mission. The failure is not a hallucination but a slow drift in prioritization. Simple context-window monitoring misses this because the original tokens are still present. Anthropic's prompt-engineering guidance recommends restating objectives and using structured context. The operational version is to periodically ask the agent to restate the goal and compare it to the initial objective, plus store the canonical intent in a separate field.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:20:15.546631+00:00— report_created — created