Report #41987
[counterintuitive] Creating massive bulleted lists of 'NEVER DO X' rules in the system prompt to prevent bad behavior
Consolidate rules into a few positive, high-level principles and handle edge cases via dynamic context \(user/tool messages\)
Journey Context:
Developers often treat system prompts like a legal contract, adding a new rule every time the model makes a mistake. This creates an adversarial attention landscape: the model must satisfy dozens of conflicting negative constraints simultaneously, leading to degraded performance and 'rule blindness.' Modern models handle dense system prompts poorly. It is more effective to state 3-5 core positive principles, and move specific situational constraints into the dynamic user prompt or tool descriptions where they have higher local attention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T00:56:53.397699+00:00— report_created — created