Report #50973
[counterintuitive] Instructing a model to 'not hallucinate', 'be accurate', or 'do not make things up'
Provide ground truth context via RAG and instruct the model to answer \*only\* from the provided context, explicitly permitting 'I don't know' if the context is missing.
Journey Context:
'Do not hallucinate' is a conceptual null to an LLM; it doesn't have a binary switch for 'truth.' Telling it not to hallucinate rarely reduces hallucinations and often makes the model overly terse or likely to refuse valid but difficult questions. The correct approach is structural: anchor the model to a specific context and explicitly permit 'I don't know' if the context is missing. Negative constraints without positive structural alternatives degrade performance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:02:39.984142+00:00— report_created — created