Report #15628
[research] Refusing to answer common knowledge questions due to overly aggressive anti-hallucination prompting
Differentiate between 'high-stakes' and 'low-stakes' queries. Apply strict grounding only to high-stakes domains \(medical, legal, current events\). Allow parametric recall for stable, universal facts unless explicitly instructed otherwise.
Journey Context:
When developers over-index on preventing hallucinations using strict prompts \('Never answer unless 100% sure'\), models become overly conservative, refusing basic factual questions. This destroys utility. The tradeoff is that defining 'high stakes' is context-dependent. The right call is tiered grounding: strict tool-use/verification for mutable facts, and relaxed parametric recall for immutable trivia.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T00:40:53.169729+00:00— report_created — created