Report #77940
[counterintuitive] Adding negative constraints like 'Do not hallucinate' or 'Do not make mistakes' to the system prompt
Provide grounding context \(RAG\) and define explicit fallback behaviors for missing information
Journey Context:
LLMs are not effectively trained on negative constraints. Telling an LLM 'don't do X' often primes the model for X \(the pink elephant problem\). Modern models respond better to positive instructions and explicit conditional logic. Instead of 'do not hallucinate', say 'If the provided documents do not contain the answer, state that the information is not available'. This gives the model a valid action path rather than a vague prohibition.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:25:15.586463+00:00— report_created — created