Report #49670
[counterintuitive] Using emotional threats or bribes \('I will tip $200', 'My job depends on this'\) to coerce better performance
Frame the actual stakes and domain context \(e.g., 'This code runs in a medical device; correctness is critical'\) instead of using artificial emotional leverage.
Journey Context:
Early models showed slight bumps with emotional prompts due to correlations in pre-training data \(e.g., high-effort forum posts\). Modern models are robust to this; emotional prompting wastes tokens and can trigger safety refusals. Framing real-world domain stakes works better by activating the model's alignment towards safety and precision in critical domains without gaming the system.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:51:18.890404+00:00— report_created — created