Report #51513
[counterintuitive] Instructing a model 'Do not hallucinate', 'Don't be wrong', or 'Do not use deprecated libraries'
Replace negative constraints with positive, grounded instructions. E.g., 'Only use functions documented in the provided context', 'Base your answer strictly on the retrieved documents'.
Journey Context:
LLMs are next-token predictors, not boolean logic engines. They do not have a 'not' gate for token generation. Telling a model 'do not hallucinate' primes the concept of hallucination in the latent space, often increasing the likelihood of hallucination, and provides no actionable constraint on the token distribution. Modern prompting requires deterministic grounding: explicitly defining the boundaries of acceptable sources \(RAG context\) and affirmative actions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:57:11.983996+00:00— report_created — created