Report #52348
[synthesis] Agent loses formatting instructions or mid-context tasks as context window fills
For GPT-4o, periodically re-inject critical system prompt instructions \(like output format\) at the end of the context \(recency bias\). For Claude, summarize mid-context user instructions rather than dropping them. For Gemini, break down complex reasoning steps into smaller, sequential tool calls.
Journey Context:
Context degradation is asymmetric across models. As the context window fills up, GPT-4o tends to 'forget' early system prompt instructions \(like output format\) and defaults to its base behavior, adhering mostly to the most recent prompts. Claude 3.5 Sonnet maintains adherence to the system prompt but starts dropping or ignoring user instructions located in the middle of the context. Gemini 1.5 Pro maintains retrieval but degrades in complex reasoning and formatting rule adherence at high context. A single context management strategy fails; you must apply model-specific recency/summarization heuristics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:21:27.030499+00:00— report_created — created