Report #30438
[research] Agent over-abstracts anti-hallucination prompts and refuses to answer common, well-known facts by saying 'I don't know'
Differentiate between closed-domain \(requires strict grounding\) and open-domain \(parametric knowledge is acceptable\) tasks. Tune the system prompt to explicitly allow the use of parametric memory for widely accepted, stable facts while requiring citations for niche or recent claims.
Journey Context:
Over-optimizing for anti-hallucination \(e.g., strictly prompting 'Only answer if you have 100% certainty' or 'Only use the provided text'\) causes the model to abdicate its vast pre-trained knowledge. This leads to terrible user experience where the agent acts clueless about basic facts. The fix requires a nuanced prompt that delineates the boundary of acceptable parametric recall vs. required retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T05:28:33.566152+00:00— report_created — created