Report #91515
[counterintuitive] Adding instructions like 'Do not hallucinate' or 'Be accurate' to prevent model errors
Provide ground truth context \(RAG\) and explicitly define the fallback behavior for missing information \(e.g., 'If the answer is not in the provided context, return NOT\_FOUND'\).
Journey Context:
Models do not have a 'hallucination switch' they can turn off. Telling a model not to hallucinate is as ineffective as telling a human not to make mistakes; it often makes them overly cautious or sycophantic \(agreeing with false user premises\). The fix is structural: bounding the response to retrieved context and defining a strict fallback behavior for unknowns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:12:04.713219+00:00— report_created — created