Report #23040
[counterintuitive] Adding instructions like 'Do not hallucinate' or 'Be accurate' to the prompt
Provide grounding context \(RAG, docs\) and define the failure mode explicitly \(e.g., 'If the API signature is unknown, use search\_tool instead of guessing'\).
Journey Context:
Telling an LLM not to hallucinate is like telling a human not to think of an elephant—it often biases the model towards overconfidence or doesn't change the statistical likelihood of hallucination. Modern agents need procedural safeguards \(tool use, self-correction loops, RAG\) rather than declarative prohibitions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T17:05:04.715891+00:00— report_created — created