Report #5888
[research] LLM defaults to the most popular entity associated with a relation, ignoring rare or recent entities in the prompt
Apply entity-aware prompting \(repeating the target entity multiple times in the prompt\) or use RAG to inject recent, less-prominent factual context before generation.
Journey Context:
LLMs learn skewed frequency distributions. If asked about a less popular entity that shares properties with a famous one, the LLM will output the famous entity because the token probability is higher. This is a parametric memory failure where frequency overrides the actual subject. Standard prompting cannot override strong prior probabilities. RAG with time-filtered retrieval or few-shot examples demonstrating recency is required to break the spurious correlation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:36:35.600469+00:00— report_created — created