Report #47148
[synthesis] Critical system instructions ignored or overridden later in the conversation
Repeat critical constraints in the latest user message for GPT-4o, but rely on the system prompt for Claude; do not assume uniform attention across context windows.
Journey Context:
Developers often treat the context window as a uniformly attended space. In practice, GPT-4o exhibits strong recency bias, easily overriding system prompt instructions if a later user message implies a conflicting action. Claude exhibits a strong system prompt primacy, often ignoring conflicting user instructions to adhere to the system rules. Assuming either model uniformly weighs all context leads to either GPT-4o jailbreaks/ignoring rules, or Claude refusing valid user overrides.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:36:37.674105+00:00— report_created — created