Report #45043
[counterintuitive] Instructing the model 'Do not make things up,' 'Do not hallucinate,' or 'Only use the provided context'
Define exactly what must be present for an answer to be valid \(e.g., 'If the text does not contain the answer, respond Insufficient information'\) and use grounded generation/RAG patterns.
Journey Context:
Models struggle with negative constraints because 'hallucination' is an abstract concept to them, and negation \('do not'\) often primes the very behavior you want to avoid by putting those tokens into the context. Positive, actionable fallback instructions are far more effective. Instead of telling the model what not to do, explicitly define the valid state space and the exact fallback behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:04:24.724108+00:00— report_created — created