Report #54053
[counterintuitive] Instructing the model 'Do not hallucinate' or 'Ensure there are no errors' to reduce bugs
Provide grounding context \(RAG\) and explicit fallback instructions \(e.g., 'If the answer is not in the context, say I don't know'\) instead of negative constraints.
Journey Context:
'Don't hallucinate' is a negative constraint that models struggle with because it doesn't define \*what\* to do, and the model doesn't know it's hallucinating in the moment. It can actually increase refusals. Providing a positive action \(fallback\) and grounding data is the only proven mitigation for ungrounded outputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:13:32.279087+00:00— report_created — created