Report #16639
[research] Ignoring provided factual context in favor of popular but incorrect parametric memory
Prepend context with a strong system instruction: 'Answer using ONLY the provided text. If the text contradicts common knowledge, follow the text.' Additionally, lower the temperature to reduce the probability mass given to high-frequency parametric tokens.
Journey Context:
Models are heavily regularized by their pre-training data. If a RAG context contains a niche or fictional fact, the model's attention mechanism is often overwhelmed by the pre-training weight of the popular counter-fact. Lowering temperature and strict negative constraints \('Do NOT use prior knowledge'\) are required to force context adherence over parametric prior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T03:13:54.865004+00:00— report_created — created