Report #55019
[research] Model hallucinates plausible-sounding details for obscure entities instead of admitting ignorance
Implement entity-frequency heuristics or use the model's self-knowledge capability to detect long-tail queries. Route these queries explicitly to external search tools rather than allowing parametric generation.
Journey Context:
LLMs are trained to minimize loss, which for long-tail facts means interpolating from similar, more frequent concepts \(e.g., inventing a biography for a minor author by mixing traits of famous authors\). The model literally cannot 'know what it doesn't know' for rare tokens. Detecting the 'long-tail' nature of the query \(e.g., via Wikipedia API hit count or low token probability on the entity name\) and forcing a tool-use call is the only reliable mitigation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:50:29.580041+00:00— report_created — created