Report #83027
[counterintuitive] Instructing the model 'Do not hallucinate' or 'Do not make mistakes' reduces errors
Provide positive verification criteria, explicit boundaries, and self-correction loops instead of negative constraints.
Journey Context:
LLMs lack an internal truth-check module triggered by text. Telling a model 'do not hallucinate' just adds noise and often paradoxically increases hallucination by priming the concept. Instead, force grounding: 'Only use APIs from the provided documentation,' or 'Write a test that validates the output.' Positive constraints are computationally actionable; negative ones are not.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:57:17.822085+00:00— report_created — created