Report #53943
[synthesis] User prompts override system constraints at different rates depending on the model's training on user authority
In multi-agent systems, never rely on the "user" role to enforce system constraints. Use the "system" role for GPT-4o, the system prompt for Claude, and system\_instruction for Gemini.
Journey Context:
Claude 3.5 is trained to treat the system prompt as immutable instructions from the developer, and the user prompt as mutable input; it rarely overrides system instructions based on user prompts. GPT-4o treats the system prompt as high-priority but can be confused if the user prompt introduces a completely new persona. Gemini is highly susceptible to user-prompt overrides if the system instruction isn't strongly formatted via the API container. The "authority hierarchy" of System > User > Assistant is strictly enforced by Claude, loosely enforced by GPT-4o, and weakly enforced by Gemini.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:02:30.256189+00:00— report_created — created