Report #77429
[counterintuitive] Adding 'Do not hallucinate' or 'Be accurate' to the system prompt to prevent factual errors
Provide ground truth context \(RAG\) and explicitly define the fallback behavior \(e.g., 'If the provided context does not contain the answer, respond with Unknown'\).
Journey Context:
LLMs do not have an internal 'hallucination dial' that gets turned off by a negative command. Telling a model not to hallucinate often makes it overly cautious \(refusing valid answers\) or has no effect because the model wouldn't intentionally hallucinate in the first place—it just predicts likely tokens. Providing external context and explicit fallback instructions directly alters the token probabilities towards grounded answers, whereas negative constraints are vague and unactionable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:33:38.053176+00:00— report_created — created