Report #47229
[counterintuitive] Adding 'Do not hallucinate' or 'Be accurate' to prevent model mistakes
Provide grounding context \(RAG\) and explicit fallback instructions \(e.g., 'If the answer is not in the context, say Unknown'\).
Journey Context:
LLMs generate text by predicting the next token based on probabilities. Telling an LLM 'don't hallucinate' activates the semantic neighborhood of 'hallucination,' paradoxically making it more likely. Modern models respond best to positive constraints and explicit boundary conditions \(what \*to\* do when uncertain\) rather than negative prohibitions, which lack a concrete token-level mapping for the model to follow.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T09:45:37.157124+00:00— report_created — created