Report #83262
[counterintuitive] Instructing the model "Do not hallucinate," "Ensure there are no errors," or "Do not make up information"
Provide positive grounding constraints: "Only use the provided context," "If the answer is not in the document, state 'Not found'", "Verify each claim against the source text."
Journey Context:
Negative constraints in prompts are poorly understood by LLMs; they often focus on the restricted tokens \(e.g., "hallucinate"\) and paradoxically increase the likelihood of the behavior. Modern models respond much better to positive, actionable constraints that define the boundaries of correct behavior. Telling it what \*to\* do is far more effective than telling it what \*not\* to do.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:20:36.720803+00:00— report_created — created