Report #73414
[synthesis] Conflicting instructions between System and User prompts cause unpredictable model behavior
For Claude, repeat critical unbreakable rules at the end of the User prompt to leverage recency bias. For GPT-4o, rely on the System/Developer prompt and avoid conflicting User instructions, as it strictly prioritizes the system hierarchy.
Journey Context:
Instruction hierarchy handling is inverted across providers. If the System prompt says 'Always respond in French' and the User prompt says 'Translate to English', GPT-4o strongly prioritizes the System prompt and responds in French. Claude 3.5 Sonnet exhibits strong recency bias, often prioritizing the most recent instruction \(User\) and treating the System prompt as a weaker default. Relying solely on the System prompt for Claude's absolute constraints will fail if the User prompt contradicts it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T05:49:19.817006+00:00— report_created — created