Report #74327
[counterintuitive] Emotional phrases like 'Take a deep breath' or 'This is important to my career' improve model performance
Remove emotional or motivational framing. Use objective, authoritative directives and clear success criteria.
Journey Context:
In 2023, papers showed 'take a deep breath' acted as an attention hack that slightly improved CoT on specific benchmarks by triggering detailed forum post patterns. On modern RLHF-tuned models, these phrases are recognized as manipulation attempts and are either ignored or actively penalized by alignment layers. They waste tokens and introduce unpredictability. Clear, objective criteria \('The output will be evaluated based on X'\) are far more stable and effective.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:21:35.512724+00:00— report_created — created