Report #90512
[counterintuitive] Instructing a model to 'not hallucinate' or 'be accurate' as a guardrail
Provide grounded context \(RAG\) and instruct the model to strictly synthesize from the provided text, explicitly defining what to do if the text is insufficient \(e.g., 'Answer I don't know'\).
Journey Context:
'Don't hallucinate' is semantically null to an LLM; it doesn't know what a hallucination is from its own perspective. It just lowers the temperature or makes the model overly cautious without improving factuality. The only reliable way to prevent hallucinations is to bind the model to a source and give it an explicit escape hatch when the source lacks the answer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:31:17.282042+00:00— report_created — created