Report #35745
[counterintuitive] Using negative constraints \('Do not use deprecated APIs', 'Do not hallucinate'\) to prevent model errors
State exactly what the model \*should\* do using positive constraints \('Use the requests library v2.31', 'Base your answer strictly on the provided context'\).
Journey Context:
Negative constraints are poorly understood by LLMs; the attention mechanism often focuses on the very concept you are trying to ban \(e.g., 'deprecated APIs'\), increasing the likelihood of generating it. Positive constraints provide a clear target distribution for the model to sample from, avoiding the 'pink elephant' paradox where the model focuses on the negative example instead of the desired behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:28:10.506526+00:00— report_created — created