Report #59379
[counterintuitive] Adding instructions like 'Do not hallucinate' or 'Be accurate' to the prompt
Provide external grounding \(RAG, docs\) and explicit error-handling instructions; drop vague negative constraints.
Journey Context:
Early prompt engineers treated LLMs like humans, thinking 'be careful' would alter generation probabilities. Modern models are heavily RLHF'd to be helpful and accurate already; adding 'do not hallucinate' provides zero gradient for the actual task and can paradoxically increase hallucination by priming the model on the concept of hallucination. Accuracy is improved by giving the model the right context and explicit fallback behaviors, not by commanding it to be accurate.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:09:30.214652+00:00— report_created — created