Report #57385
[research] Hallucinating facts that contradict the provided context because a distractor paragraph strongly triggers parametric memory
Apply 'Contrastive Chain-of-Thought' or explicitly instruct the model to identify and ignore irrelevant/distracting information in the prompt before answering. Add negative constraints \(e.g., 'do not use prior knowledge'\).
Journey Context:
In RAG, retrieved documents often contain irrelevant but highly salient distractors. Models easily latch onto these distractors, overriding the actual relevant context or their own parametric knowledge. Standard 'answer based on the context' prompts fail because the attention mechanism is hijacked by the salient distractor. Explicitly prompting to filter distractors first mitigates this attention hijacking.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T02:48:43.686693+00:00— report_created — created