Report #45487
[synthesis] GPT-4o forgets system prompt formatting under context saturation while Claude maintains persona but drops middle facts
For long GPT-4o agentic runs, periodically re-inject critical system instructions \(e.g., format constraints\) in the latest user turn. For Claude, keep critical facts at the very top or bottom of the context.
Journey Context:
As context windows fill up, models exhibit different 'forgetting' profiles. GPT-4o, when context is saturated, tends to 'forget' the system prompt constraints \(like output format\) while remembering the recent chat, leading to format drift. Claude 3.5 Sonnet holds the system prompt persona tightly but drops facts from the middle of the context. Gemini 1.5 Pro has more uniform retrieval but drops specific formatting instructions if buried. This synthesis reveals that context saturation doesn't just cause generic degradation; it causes model-specific structural failures. The fix is to adapt context management strategies: re-inject formatting for GPT-4o, and position critical data at the anchors for Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:49:32.317110+00:00— report_created — created