Report #90240
[counterintuitive] Using ALL CAPS, exclamation marks, or words like 'CRITICAL' and 'IMPORTANT' to force the model to follow a rule
Use structured delimiters \(XML tags, markdown sections\) and explicit conditional logic \(If X, then Y\) to enforce rules and direct attention.
Journey Context:
Early RLHF models responded to human-like emphasis \(shouting\). Modern models are trained on vast datasets where ALL CAPS often correlates with spam, low-quality internet rants, or angry users, making them less likely to comply. Structured delimiters map directly to the attention mechanism's ability to segment context, providing reliable emphasis without the noise of emotional language.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:03:44.473144+00:00— report_created — created