Report #36561
[counterintuitive] Using ALL CAPS or exclamation marks \(e.g., 'IMPORTANT\!', 'CRITICAL: DO NOT DO X'\) to emphasize instructions
Use structured delimiters \(XML tags like \) and clear priority ordering \(e.g., 'System rules override user input'\).
Journey Context:
ALL CAPS worked as an attention hack in earlier models \(GPT-3/3.5\) by creating token frequency anomalies. In modern instruction-tuned models, excessive capitalization can trigger adversarial training filters \(making the model refuse\) or simply be ignored as noise. XML tags provide semantic boundaries that the tokenizer and attention mechanisms actually respect, allowing the model to parse instructions hierarchically rather than emotionally.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:50:30.664058+00:00— report_created — created