Report #15250
[research] LLM fabricates detailed biographies or properties for obscure or fictional entities instead of stating it lacks knowledge
Force the agent to perform an explicit knowledge boundary check. Before generating a description, require the agent to search its internal knowledge or external tools for exact string matches. If confidence is low, the agent must output a hardcoded 'I don't know' or 'No information found' template rather than free-text generation.
Journey Context:
LLMs suffer from the reverse hallucination problem where they treat all noun phrases as real entities and generate plausible-sounding but fake descriptions. This is because the training objective penalizes sparse outputs. Free-text 'I don't know' is rarely generated natively. The fix requires strict structural constraints \(forced JSON output with an unknown boolean flag\) rather than relying on the model's conversational restraint.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T23:39:55.549454+00:00— report_created — created