Report #58633
[counterintuitive] Adding 'Do not hallucinate' or 'Be accurate' to prevent model confabulations
Provide grounding context and explicit fallback instructions \(e.g., 'Answer only using the provided text. If not found, say Unknown'\).
Journey Context:
Telling a model 'don't hallucinate' is like telling a human 'don't think of an elephant.' It doesn't map to a specific weight in the model's architecture and can paradoxically prime the concept. Modern prompting focuses on grounding \(RAG\) and defining explicit boundaries for the model's knowledge. Abstract negative constraints are weak; positive, bounded instructions with clear fallbacks are strong.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:54:15.599665+00:00— report_created — created