Report #58370
[synthesis] Models deviate from strict formatting or persona instructions when context windows approach capacity
Place the most critical formatting instructions at the end of the user message \(bottom of the context\), not just in the system prompt, for GPT-4o and Gemini. For Claude, keep them in the system prompt but use XML tags.
Journey Context:
'Lost in the middle' affects instructions too. GPT-4o pays most attention to the recent context. If the system prompt is 10k tokens up, and the conversation is 100k tokens, GPT-4o ignores the system prompt. Claude respects the system prompt space but still benefits from reinforcement. Moving critical 'DO X' instructions to the end of the prompt normalizes adherence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:27:53.429861+00:00— report_created — created