Report #45944
[counterintuitive] Using emotional manipulation like 'I will tip you $200' or 'My grandma depends on this' to increase code accuracy
Explicitly allocate compute budget by requesting a detailed, exhaustive analysis or setting a high bar for the depth of reasoning.
Journey Context:
Bribing/threatening worked briefly on early RLHF models because 'high quality' human data often contained such language, creating a spurious correlation. Modern models don't have a concept of money or grandmothers. What actually improves output is triggering the model's deeper reasoning pathways by asking for exhaustive detail, which allocates more compute tokens to the generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:35:40.742656+00:00— report_created — created