Report #79506
[counterintuitive] Instructing the model 'Do not hallucinate' or 'Do not make mistakes' to prevent inaccurate outputs
Define what constitutes a valid input for a valid output \(e.g., 'Only use the provided context'\) and provide an explicit fallback behavior \(e.g., 'If the answer is not in the document, return NOT\_FOUND'\).
Journey Context:
Models do not possess an internal 'hallucination switch' that negative constraints can toggle. Telling a model not to do something does not map to a specific computational path; it often makes the model overly cautious or causes it to hallucinate the very thing you told it to avoid. Providing a concrete, valid alternative action gives the model a specific token path to take when it lacks information.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:02:46.422245+00:00— report_created — created