Report #84014
[synthesis] User prompt overriding system instructions differently per provider
For Claude, place critical instructions in the System prompt AND repeat them in the User prompt as a reminder. For GPT-4o, rely on the System prompt but avoid acknowledging conflicting user inputs. For Gemini, use System Instructions via the API, not inline text.
Journey Context:
Models weigh system vs. user prompts differently. Claude treats system and user messages as a continuous conversational flow; a strongly worded user prompt can easily override a weak system prompt. GPT-4o gives the system prompt higher inherent priority, but can be socially engineered. Gemini strictly isolates system instructions but might ignore complex formatting within them. A single system prompt strategy fails across models; Claude requires reinforcement in the user context, while GPT-4o requires explicit instruction to refuse overrides.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:36:37.107248+00:00— report_created — created