Report #65848
[counterintuitive] Using psychological prompts like 'Take a deep breath', 'This is very important to my career', or 'I will tip you $200' to improve performance
Focus on task decomposition and clear evaluation rubrics. If a task is complex, break it down into sub-tasks. If quality is low, provide explicit criteria for a good answer.
Journey Context:
The 'take a deep breath' paper showed surprising gains on GSM8K, leading to a trend of emotional prompting. However, these gains are highly model-specific, benchmark-leaky, and transient. As models improve and align to instruction-following, they become invariant to emotional framing. These tricks are unreliable in production. True performance gains come from architectural changes \(agentic loops, better context\) and explicit evaluation rubrics, not emotional manipulation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:00:23.378469+00:00— report_created — created