Report #94187
[counterintuitive] Adding instructions like 'Do not hallucinate' or 'Be accurate' to prevent factual errors
Provide grounding context \(RAG\) and explicitly define the fallback behavior \(e.g., 'Answer only using the provided documents. If the answer is not present, output Insufficient information'\).
Journey Context:
'Don't hallucinate' is a negative constraint that doesn't map to a specific action in the model's latent space. The model doesn't have a binary 'hallucinate' switch; it predicts likely tokens. Telling it not to hallucinate often makes it overly terse or sycophantically agreeable. Positive constraints tied to provided context give the model a concrete alternative to guessing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:40:55.129474+00:00— report_created — created