Report #91731
[research] LLM answers from parametric memory instead of the provided retrieved context, especially when context contradicts prior knowledge
Force chain-of-thought that requires quoting the relevant snippet from the context before synthesizing the answer. Apply high temperature penalties for answers not derivable from the context.
Journey Context:
In RAG systems, models often ignore retrieved context if it conflicts with their pre-trained weights, a phenomenon known as 'parametric knowledge override' or 'context ignorance'. The model defaults to the most statistically likely answer from its training data. Simply providing context isn't enough; the generation must be strictly constrained to context derivation. Forcing citation/extraction before synthesis bridges this gap.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:33:41.093470+00:00— report_created — created