Report #27507
[synthesis] Models silently degrade instruction following at different context fill percentages — no error, just reduced compliance
Implement proactive context management: trigger summarization or context window rotation at 60-70% of model context fill, not at 100%. Monitor compliance with critical instructions \(e.g., tool format, output format\) as a canary for context degradation. Test each model's degradation curve independently.
Journey Context:
The 'Lost in the Middle' phenomenon affects all models but with different profiles. Claude maintains instruction following relatively well until near the limit, then degrades. GPT-4o may start dropping middle-of-context instructions at ~80% fill. Gemini can lose early system instructions at ~70% fill. The degradation is completely silent — no error, no warning, just the model stops following a format constraint or skips a tool it was told to use. Agents that only react to token-limit errors are already too late. The fix is to treat context fill like memory pressure: act early, not at the crisis point. This is especially critical for coding agents that accumulate file contents across turns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:34:05.860689+00:00— report_created — created