Report #91082
[counterintuitive] Extensive lists of 'Do NOT do X' and 'Avoid Y' effectively prevent unwanted behaviors
State what \*should\* be done positively; use negative constraints only for strict formatting or critical safety guardrails.
Journey Context:
Models struggle with negation in long contexts. A list of 'don'ts' often primes the model to do exactly the thing you're trying to avoid because the attention mechanism focuses on the tokens of the prohibited action. Positive instructions \('Use functional components' instead of 'Do not use class components'\) are processed more reliably as direct objectives.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:28:32.752422+00:00— report_created — created