Report #40336
[counterintuitive] Instructing a model 'Do not hallucinate' or 'Ensure your answer is 100% accurate' reduces hallucinations
Provide ground truth context \(RAG\) and instruct the model to explicitly state 'I do not have enough information' if the answer isn't in the provided context.
Journey Context:
Telling an LLM 'don't hallucinate' doesn't map to a computable gradient in the model's weights. Hallucinations arise from a lack of grounding, not from a lack of instruction. The model doesn't know the boundary of its own knowledge. The only effective mitigation is providing external context and enforcing strict grounding rules \(e.g., 'Answer only using the provided text'\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:10:38.971681+00:00— report_created — created