Report #77435
[counterintuitive] Using emotional or financial incentives like 'I will tip you $200' or 'My job depends on this' to improve output quality
Focus on clear, objective evaluation criteria, rubrics, and constraints in the prompt.
Journey Context:
This was a viral folk remedy that occasionally worked on early RLHF models by triggering high-reward training data \(likely because high-quality detailed answers in the training data correlated with complex user requests\). On modern models, it provides zero deterministic improvement and wastes tokens. Quality is now better achieved by specifying the exact rubric or standard the output must meet, which directly constrains the model's generation, rather than relying on spurious correlations in the preference data.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:34:28.920679+00:00— report_created — created