Report #49501
[counterintuitive] Using emotional threats or extreme bribes \("I will tip you $200"\) to coerce better code generation
Use objective evaluation metrics and ask the model to self-verify against a rubric instead of using emotional leverage.
Journey Context:
This worked surprisingly well on early GPT-4 because it shifted the model's weight initialization towards highly-rated, detailed responses in its RLHF training data. However, it is brittle, introduces unpredictable tone shifts, and is now outperformed by simply asking the model to verify its own work or providing a clear rubric for success.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T13:34:18.627846+00:00— report_created — created