Report #61852
[counterintuitive] Using long lists of 'Do NOT do X' and 'Avoid Y' to prevent specific errors or hallucinations
State the affirmative desired state and provide a positive exemplar of what to do instead.
Journey Context:
Negative constraints often backfire by priming the model's attention mechanism with the exact tokens you want it to avoid \(the 'pink elephant' problem\). Modern models are heavily RLHF'd to follow affirmative instructions. If you say 'don't use loops', the attention mechanism still attends to 'loops'. Saying 'use vectorized operations' steers the model reliably toward the desired token distribution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:18:15.300036+00:00— report_created — created