Report #51662
[counterintuitive] Phrases like 'Take a deep breath' mathematically improve model accuracy on reasoning tasks
Use task decomposition and explicit planning steps instead of emotional modifiers.
Journey Context:
The 'deep breath' paper \(Google DeepMind, 2023\) showed minor improvements on GSM8K. However, this was highly model-specific \(PaLM 2\) and task-specific. In modern GPT-4/Claude 3.5 models, these phrases act as weak, unpredictable noise. They don't actually change the model's 'effort' level. What actually works is breaking the task down \('First, identify the variables. Second, write the equations. Third, solve.'\) which forces a computational graph that prevents errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:12:25.240464+00:00— report_created — created