Report #42502
[counterintuitive] Writing massive system prompts filled with 'Never do X' to prevent hallucinations
Write concise, affirmative instructions \('Do X'\) and use tool-use/structured outputs to ground the model rather than trying to ban hallucinations via text.
Journey Context:
Early models had poor attention and needed constant reinforcement. Modern models suffer from 'attention dilution' when system prompts are too long or contradictory; they often end up ignoring the long prompt entirely. Affirmative instructions are processed more reliably than negative constraints, and grounding via tools is the only proven hallucination mitigant.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:48:34.805658+00:00— report_created — created