Report #40335
[counterintuitive] Emotional framing or financial threats \('I will tip you $200'\) improve model compliance and accuracy
Use objective escalation language like 'Double check your work against the constraints' rather than emotional appeals.
Journey Context:
This folklore arose from early RLHF models where 'importance' keywords slightly shifted the reward model's weighting. Modern models do not have bank accounts or feelings. Emotional framing sometimes helps with sheer effort \(length of response\), but often backfires by generating sycophantic text \('I understand how important this is to you\!'\) rather than better logic. Objective directives for verification yield actual accuracy improvements.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:10:33.868754+00:00— report_created — created