Report #80433
[counterintuitive] Adding 'Do not hallucinate' or 'Be accurate' reduces errors and fabrications
Provide explicit grounding context \(RAG\) and define exactly what to do when information is missing \(e.g., 'If the API is not documented in the context, reply with: API\_NOT\_FOUND'\).
Journey Context:
Telling a model 'don't do X' is weak because LLMs process positive tokens more strongly; the word 'hallucinate' primes the concept. Modern RLHF training already maximizes helpfulness and accuracy. The model hallucinates because it lacks grounding data or a valid fallback, not because it forgot to be accurate. Explicit fallback behaviors and strict grounding boundaries are far more effective than negative constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:36:49.967400+00:00— report_created — created