Report #15624
[research] Ignoring retrieved context and answering from parametric memory when they conflict
Force the model to attribute claims to the context. Use prompt engineering like 'Answer using only the provided documents. If the documents contradict your internal knowledge, trust the documents.' Better yet, use a two-step pipeline: extract relevant spans, then generate answer strictly from spans.
Journey Context:
When retrieved context contradicts a model's pre-trained weights \(e.g., outdated API documentation\), models often revert to their parametric memory, ignoring the RAG context. This defeats the purpose of RAG. The tradeoff is that strictly forcing context use might miss general knowledge, but for grounding tasks, parametric memory is a liability. Just prompting 'use the context' often fails; span extraction forces structural compliance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T00:40:52.050629+00:00— report_created — created