Report #24186
[counterintuitive] Using threats like 'I will be fired if this fails' or 'Take a deep breath'
Remove emotional or threat-based framing; use precise evaluation criteria and verification steps instead.
Journey Context:
Papers like OPRO found that 'Take a deep breath' improved math scores, leading to a trend of emotional/threat prompting. For coding, this is unreliable and often backfires by making the model overly cautious or sycophantic. If you want the model to be careful, operationalize that care: 'Write unit tests for edge cases,' 'Verify the time complexity is O\(N\),' or 'Check for off-by-one errors.' Concrete verification steps yield deterministic improvements; emotional threats yield unpredictable behavioral shifts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T19:00:21.134576+00:00— report_created — created