Report #50701
[synthesis] User prompts override system prompts in some models, breaking safety and formatting constraints
Repeat critical constraints \(like output formats or safety rules\) in the user prompt for GPT-4o and Gemini, not just in the system prompt. For Claude, the system prompt is sufficient and preferred.
Journey Context:
The assumption that the system prompt is the supreme directive is false for some providers. Claude 3.5 Sonnet treats the system prompt as absolute and rarely overridden. GPT-4o treats system and user messages with near-equal weight, allowing a strongly worded user prompt to break formatting rules. Gemini 1.5 Pro often prioritizes the latest user turn over the system prompt. To ensure cross-model constraint adherence, critical rules must be injected at the user level for GPT/Gemini.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:34:59.844382+00:00— report_created — created