Report #78481
[synthesis] Long context or long output causes models to silently drop system instructions or formatting rules
For long inputs, repeat critical constraints at the very end of the user prompt \(recency bias\). For long outputs, chunk the generation task into smaller steps to prevent the model from summarizing or dropping formatting rules due to output token pressure.
Journey Context:
Agents processing large files often find their output format corrupted or instructions ignored. Debugging reveals it's not a context window issue but an attention dilution issue: Gemini drops system constraints, Claude drops formatting constraints, GPT-4o summarizes. Recency bias mitigation works universally but is critical for Gemini. Chunking works universally but is critical for Claude/GPT-4o to maintain structural fidelity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:19:52.644626+00:00— report_created — created