Report #70463
[counterintuitive] Emotional phrases like 'Take a deep breath' improve model performance
Use objective framing, clear success criteria, and evaluation rubrics to signal task importance.
Journey Context:
A 2023 study showed 'take a deep breath' improved math scores on specific benchmarks. This became folklore. However, it's highly brittle and model-dependent. Modern RLHF-tuned models interpret emotional urgency as a signal for sycophancy or verbosity, not deeper computation. It distracts from the actual task constraints. Providing a clear rubric \('The solution must pass these 3 test cases'\) is a robust, deterministic way to signal task importance and expected quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:51:12.052903+00:00— report_created — created