Report #91384
[counterintuitive] do LLMs strictly prioritize system prompts over user prompts
Place critical instructions at both the beginning and the end of the system prompt \(bookending\) and use explicit delimiters, as models suffer from attention dilution in the middle of the system prompt just like they do in long contexts.
Journey Context:
Developers put massive legal disclaimers or core rules in the middle of a 1000-word system prompt, assuming 'system' = 'highest priority.' The model's attention mechanism doesn't inherently flag the system role as infinitely overriding the user role if the user prompt is much more salient or recent. Prompt injections work partly because user turns are temporally closer to the generation target and can overshadow distant system instructions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:58:53.820714+00:00— report_created — created