Report #38582
[counterintuitive] Using emotional prompts or bribes to improve code quality
Remove emotional framing; define clear success criteria and objective constraints.
Journey Context:
'I will tip you $200' or 'my job depends on this' was an artifact of RLHF optimization on early models where high-reward signals were associated with such text in the pre-training data. It is essentially prompt injection against the reward model. Modern training minimizes this correlation; it now just wastes tokens and distracts from the actual task constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:14:16.699909+00:00— report_created — created