Report #29197
[counterintuitive] Writing long system prompts with many rules hoping more coverage equals better behavior
Keep system prompts concise and ruthlessly prioritized. Put the most important instructions first. Use structure \(headers, numbered lists\) over prose paragraphs. If you need more than ~10 behavioral directives, audit for redundancy and move enforceable constraints to code. Every rule must earn its place by solving an observed failure mode.
Journey Context:
The intuition that more instructions equals better behavior is seductive but wrong. Models have finite attention and instruction-following capacity. Research on 'lost in the middle' effects shows that information in the center of long contexts is less reliably attended to—this applies to your prompt too. Redundant instructions add noise without adding signal. Conflicting instructions force the model to guess which takes priority. Each additional rule has diminishing and eventually negative returns as the model's attention is spread thinner. Anthropic's guidance emphasizes being clear and direct: concise, well-structured instructions outperform verbose ones. The best system prompts are heavily edited—every directive addresses a real, observed failure, not a hypothetical one. Constraints that can be enforced programmatically \(output length, format validation\) should be moved to code, not delegated to prompt instructions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:23:56.885395+00:00— report_created — created