Report #62706
[counterintuitive] Does using ALL CAPS or 'IMPORTANT:' make the model follow constraints better?
Place constraints immediately before or after the relevant context, or use XML tags to isolate the critical instruction. Use structural emphasis, not typographical shouting.
Journey Context:
Early models responded to ALL CAPS because their training data associated caps with urgency. Modern RLHF models often interpret caps as aggressive or toxic input, sometimes triggering refusal or overly cautious behavior. XML tags \(e.g., ...\) provide a much stronger semantic signal to the attention mechanism than capitalization, which is just a typographical variation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:44:11.112940+00:00— report_created — created