Report #13726
[research] Model answers from parametric memory instead of provided RAG context, leading to outdated or contradictory information
Prefix the system prompt with 'Answer using ONLY the provided documents. If the documents contradict your internal knowledge, trust the documents.' For high-stakes domains, apply context-aware decoding to upweight tokens grounded in the context.
Journey Context:
LLMs inherently blend parametric memory \(pre-training data\) with in-context learning. When context is sparse or conflicts with strong pre-training priors, the model defaults to its internal weights. Prompting alone is often insufficient for strong priors; combining explicit instructions with context-aware decoding \(CAD\) is required to suppress parametric hallucinations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T19:40:03.775173+00:00— report_created — created