Report #71393
[synthesis] Agent forgets early instructions in long-context multi-turn conversations
Place critical constraints at the very beginning and the very end of the context window. For GPT-4o, reiterate constraints in the latest user turn. For Claude, use \`\` tags at the top. For Gemini, keep the total context as concise as possible to prevent attention dilution.
Journey Context:
In long-context multi-turn conversations, GPT-4o exhibits classic 'lost in the middle' behavior, forgetting instructions placed in the center of the context window. Claude 3.5 Sonnet maintains high recall of the system prompt but loses track of specific user constraints mentioned in early turns if later turns are dense. Gemini 1.5 Pro doesn't strictly 'forget' but suffers from attention dilution, applying lower weight to middle instructions and resulting in vague compliance. Assuming a model 'remembers' everything just because it has a large context window leads to constraint drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:24:38.030885+00:00— report_created — created