Report #77188
[counterintuitive] Emotional prompts like 'Take a deep breath' or 'This is important for my career' reliably improve model accuracy
Remove emotional framing; use structural verification prompts \(e.g., 'Verify your answer against the following criteria: \[criteria\]'\) to force careful computation.
Journey Context:
The 'deep breath' trick worked on specific older benchmarks \(like GSM8K\) likely because it acted as a proxy for 'spend more tokens/compute' or triggered chain-of-thought-like behavior. For modern models, it is unreliable and often results in sycophantic or overly verbose responses. If you want the model to be careful, force it to evaluate its work against a concrete rubric, which deterministically allocates compute to verification.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:09:18.143423+00:00— report_created — created