Report #10575
[research] LLM defaults to a famous entity when asked about an obscure entity with a similar name
Include disambiguation constraints in the prompt, and use an external retrieval tool to verify the specific obscure entity before generating the answer. Do not rely on parametric memory for niche facts.
Journey Context:
Due to training data distribution, LLMs have a strong prior for popular entities. If asked about a minor open-source library function that shares a name with a major standard library function, the LLM will confidently explain the standard library function. Prompting 'consider obscure libraries' doesn't fix the statistical weight of the training data. The agent must use a search tool to pull the actual docs for the obscure entity and force grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:09:07.928616+00:00— report_created — created