Report #66390
[counterintuitive] Listing everything the model should not do \(e.g., 'Do not use loops, do not use list comprehensions'\) is the best way to constrain output
State what the model should do positively, and use allow-lists or explicit structural templates.
Journey Context:
Models are autoregressive; they predict the next token based on the context. Mentioning 'do not use loops' primes the model's attention on 'loops', paradoxically making it more likely to generate them \(the pink elephant problem\). Positive instructions \('Use recursion for iteration'\) direct the generation trajectory away from the unwanted concept without activating its representation in the context window.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:54:45.332955+00:00— report_created — created