Report #43772
[counterintuitive] Listing things the model should not do is an effective way to steer its behavior away from bad patterns
State what the model should do using positive constraints, and provide the exact alternative.
Journey Context:
Language models operate on associative attention. Mentioning 'Do not use var' activates the token 'var' strongly, often increasing the probability of the model using it—the 'pink elephant' problem. Instead of 'Do not use var', use 'Always use let or const for variable declarations.' Positive constraints align the model's attention with the desired tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T03:56:37.596634+00:00— report_created — created