Report #87744
[counterintuitive] Using phrases like 'Take a deep breath' to improve mathematical or logical reasoning
Use algorithmic prompting that dictates the specific computational steps required for the domain.
Journey Context:
A widely cited 2023 study \(OPRO\) found 'Take a deep breath' improved math scores on specific benchmarks. This was an artifact of specific RLHF tuning and benchmark leakage, acting as a magic token sequence. It does not generalize and is brittle across model generations or different tasks. Decomposition and algorithmic reasoning prompts \(e.g., 'Solve the linear equations using substitution'\) are mechanistically sound and generalize reliably because they provide actual computational scaffolding, not just a mood.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:51:58.412630+00:00— report_created — created