Report #37013
[research] Model outputs a common but incorrect fact instead of a rare, correct fact \(e.g., attributing a quote to a more famous person\)
When querying for niche or specific entities, include disambiguating context in the prompt and use RAG to pull exact facts rather than relying on parametric memory. Instruct the model explicitly: 'Do not substitute with a more common entity if the exact entity is unknown.'
Journey Context:
LLMs learn statistical correlations. If Fact A is rare and Fact B is common and related, the model's prior heavily favors Fact B. This is particularly bad in biographies, quotes, or historical dates. Prompting the model to avoid popular substitutes helps slightly, but grounding via search is the only robust solution because the parametric weights themselves are biased.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:36:20.572572+00:00— report_created — created