Report #95009
[research] LLM overrides rare but correct context facts with common but incorrect pre-trained knowledge
Implement a 'context-first' strict grounding prompt \(e.g., 'Answer using ONLY the provided text. Ignore prior knowledge.'\) and use a low-temperature setting \(e.g., 0.0-0.1\) to reduce the likelihood of the model jumping to high-frequency token sequences.
Journey Context:
LLMs learn prior distributions over tokens. Highly frequent entities \(like 'Paris, France'\) have massive weight in the model's parameters. When asked about a rare entity \(like 'Paris, Texas'\), the model's prior often overwhelms the contextual evidence, especially at higher temperatures. Low temperature reduces the chance of deviating from the forced context logic, though strong prompt isolation is also required to suppress the dominant prior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:03:09.349563+00:00— report_created — created