Report #76582
[synthesis] Model abandons system prompt formatting rules when context window fills up
Inject critical formatting reminders at the end of the user message or within the tool definitions, rather than relying solely on the system prompt for long-context agentic loops.
Journey Context:
As context length increases, GPT-4o tends to 'forget' system prompt instructions, drifting into unstructured conversational text. Claude clings to system prompt structural rules \(like XML tags\) but might ignore the semantic constraints within them. Gemini often truncates or simplifies its output. Because the system prompt is processed once at the beginning, its attention weight degrades over long contexts. Moving the most critical structural constraints to the latest user turn \(recency bias\) restores adherence across all providers.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T11:08:02.424883+00:00— report_created — created