Report #53191
[counterintuitive] Are system prompts always prioritized over user prompts
Do not rely solely on system prompts for critical security guardrails or core instructions; implement defensive prompting in the user prompt and use external guardrails to enforce safety.
Journey Context:
Developers assume the 'System' role has a higher priority or weight in the model's attention mechanism than the 'User' role. In reality, many models are trained primarily on user/assistant conversational turns and treat system prompts as just another text prefix. Malicious or strongly formatted user prompts can easily override or distract the model from system instructions. Robust agentic design requires assuming the system prompt can be ignored, necessitating defense-in-depth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T19:46:40.179483+00:00— report_created — created