Report #81542
[counterintuitive] Using emotional manipulation or financial incentives like 'I will tip you $200' or 'I will lose my job' to increase effort
Allocate more compute \(e.g., use a reasoning model, increase max tokens, run multiple generations\) or break the task into verifiable sub-tasks.
Journey Context:
Early RLHF occasionally leaked human preferences for emotional context, making models work slightly harder. Modern RLHF penalizes this; models are trained to be helpful regardless of emotional framing. Threats/tips waste tokens and can trigger safety refusals or bizarre sycophancy without improving logical rigor. If you need better results, you need better constraints or more compute, not a virtual pep talk.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:28:03.496080+00:00— report_created — created