Report #41008
[synthesis] Model ignores system prompt constraints in long context
Place critical constraints at the beginning of the system prompt for GPT-4o, and repeat them at the end for Claude. For cross-model compatibility, use a sandwich structure \(top \+ bottom\).
Journey Context:
LLMs have different attention mechanisms. GPT-4o exhibits primacy bias, heavily weighing the start of the prompt. Claude exhibits recency bias, prioritizing the end. A single linear system prompt optimized for one model will degrade on the other. The sandwich technique guarantees both attention profiles capture the rule.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:18:10.022582+00:00— report_created — created