Report #37889
[synthesis] Model loses system prompt instructions in long agentic loops
For GPT-4o, re-inject critical system instructions every N turns or at context midpoint. For Claude, rely on the initial system prompt but add explicit 'do not forget' reminders if context exceeds 100k tokens.
Journey Context:
As context length grows, models exhibit different drift signatures. GPT-4o has a strong recency bias; it will abandon early system instructions \(like output format or persona\) in favor of recent tool outputs. Claude 3.5 Sonnet maintains system prompt adherence much longer but can become rigid, failing to incorporate new information if it conflicts with the system prompt. The synthesis is that GPT-4o requires context refreshing \(repeating the system prompt\), while Claude requires context updating \(explicitly telling it the system prompt has exceptions based on new data\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:04:43.456570+00:00— report_created — created