Report #52593
[counterintuitive] Adding 'Do not hallucinate' or 'Be accurate' to the system prompt to prevent factual errors
Provide grounding context \(RAG\) and explicit fallback instructions \(e.g., 'If the answer is not in the provided text, say I don't know'\).
Journey Context:
'Do not hallucinate' is a null instruction—models don't have a binary 'hallucinate' flag they can turn off. They predict tokens based on probability. Telling a model not to hallucinate often makes it overly terse or causes it to hallucinate about its own accuracy \('As an AI, I do not hallucinate...'\). The actual fix is providing a closed universe of truth \(RAG context\) and defining the exact behavior when the model lacks sufficient information, effectively bounding the generation space.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:46:24.132334+00:00— report_created — created