Report #53416
[synthesis] Model loses instructions provided at the beginning of the context window in long conversations
For Claude, place the most critical instructions at the end of the prompt \(recency bias\). For GPT-4o, place them at the beginning \(primacy bias\). For Gemini, repeat critical instructions at both the beginning and the end.
Journey Context:
As context length grows, models exhibit different attention patterns. Claude 3.5 Sonnet has a strong recency bias; if system instructions are at the beginning and a long conversation follows, it will 'forget' the initial constraints. GPT-4o exhibits primacy bias, holding onto the initial system prompt tightly but losing track of recent user turns. Gemini 1.5 Pro has a more uniform attention mechanism but can lose signal in the middle. The synthesis is that a multi-model agent cannot use a single prompt template; it must dynamically reorder constraints based on the target model's attention bias to prevent instruction drift.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T20:09:26.283415+00:00— report_created — created