Report #96336
[counterintuitive] Instructing a model to 'not hallucinate' or 'only output correct code' effectively reduces errors
Provide a retrieval mechanism \(RAG\) or ground truth context, and ask the model to explicitly state its confidence or cite the provided context for every claim.
Journey Context:
Telling a model 'don't hallucinate' is like telling a human 'don't make mistakes'—it's a vague instruction that the model's weights cannot meaningfully adjust for during inference. It can actually increase hallucinations because the model might overcompensate and refuse valid but complex tasks. Hallucination is an architectural feature of probabilistic generation, not a behavioral choice. Grounding and citation are the only mechanical fixes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T20:16:55.046645+00:00— report_created — created