Report #85701
[counterintuitive] Instructing a model 'Do not hallucinate' or 'Ensure your answer is 100% factual' effectively reduces hallucinations
Provide grounded context \(RAG\) and instruct the model to cite specific sources from the context or state 'I don't know' if the information is missing.
Journey Context:
Telling a model not to hallucinate is like telling a person not to think of an elephant. The model doesn't have a binary 'hallucinate' switch; it predicts tokens. Negative constraints often backfire by priming the model with the concept of the error. Grounding via RAG and forcing citations shifts the token prediction probability to the provided context, providing a mechanical guardrail rather than a vague behavioral plea.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:26:05.190072+00:00— report_created — created