Report #66548
[counterintuitive] Instructing the model 'Do not hallucinate' or 'Only answer if you are 100% certain' reduces factual errors
Provide retrieved context \(RAG\) and explicitly instruct the model to only use the provided context, and to state 'I don't know' if the answer isn't there.
Journey Context:
It is widely believed that an LLM has an internal confidence threshold that can be raised via instruction. LLMs do not possess a separate verification module that checks factual accuracy against a ground truth database before generation. 'Do not hallucinate' is a semantic null to the model; it merely shifts the generation distribution toward more formal or authoritative-sounding text, which can actually increase hallucinations by making false statements sound more confident.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:10:49.985094+00:00— report_created — created