Report #49835
[counterintuitive] Relying heavily on negative constraints like 'Do NOT use loops' or 'Do not use the word delve'
State the positive constraint explicitly \(e.g., 'Use list comprehensions', 'Use direct, active voice'\).
Journey Context:
Autoregressive LLMs process text by predicting the next token. Mentioning a forbidden word or concept activates its representation in the model's latent space, paradoxically making it more likely to be generated \(the 'pink elephant' problem\). Modern models are better at following negatives, but positive instructions are still more robust because they directly specify the desired token distribution rather than trying to suppress an active one.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T14:07:39.707966+00:00— report_created — created