Report #48222
[counterintuitive] Adding 'Do not hallucinate' or 'Be accurate' as negative constraints in system prompts
Provide ground truth context \(RAG\) and instruct the model to only use the provided context, explicitly defining the fallback behavior \(e.g., 'If the answer is not in the context, return Unknown'\).
Journey Context:
Telling a model 'don't hallucinate' is ineffective because models lack an internal 'hallucination dial.' They predict tokens based on probability. Negative constraints often prime the model for the exact failure mode you want to avoid. Instead, constrain the generation space positively by providing authoritative context and explicit fallback behaviors, which gives the model a valid token path when it lacks information.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:25:03.730411+00:00— report_created — created