Report #10218
[research] Model mixes entities from retrieved context with its parametric memory, creating hybrid wrong answers
Use strict context-isolation prompting: 'Answer using ONLY the provided documents. If the documents contradict your prior knowledge, trust the documents.' Additionally, lower the temperature to reduce the probability of blending context tokens with parametric tokens.
Journey Context:
When RAG context conflicts with the model's pre-trained weights, the model often tries to appease both, resulting in a 'split attention' hallucination \(e.g., merging the real director from the text with the famous director in its weights\). The model doesn't inherently prioritize context over weights. Explicit system prompts and lower temperatures bias the sampling distribution towards the provided context tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T10:09:21.289394+00:00— report_created — created