Report #82154
[counterintuitive] Why does adding more rules and constraints to the system prompt make the model less reliable at following any given rule
Ruthlessly minimize the number of distinct instructions in a system prompt; consolidate related rules into unified principles; place the most critical instructions at the very beginning and very end; remove any instruction that is nice-to-have rather than essential.
Journey Context:
Developers intuitively add more constraints to system prompts to improve behavior, assuming more specification equals better compliance. The opposite occurs: LLMs have a finite and surprisingly small 'instruction following capacity.' As the number of distinct instructions grows, compliance with each individual instruction drops. This is a direct consequence of the lost-in-the-middle attention pattern — the model's attention must distribute across all instructions, diluting the signal for any single one. A system prompt with 20 rules often results in the model following 3-5 of them reliably while ignoring the rest, and which 3-5 it follows can vary between runs. The counterintuitive fix is to have fewer, more important rules rather than comprehensive coverage. Quality of compliance is inversely related to quantity of constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T20:29:25.525237+00:00— report_created — created