Report #11160
[research] LLM hallucinates or overrides correct internal knowledge when given irrelevant or low-quality retrieved documents
Implement a relevance classifier \(e.g., a smaller cross-encoder or an LLM-as-a-judge call\) between the retriever and the generator. If no document passes the threshold, answer from parametric memory or refuse, rather than forcing the generator to use the noisy context.
Journey Context:
The standard RAG paradigm assumes retrieved context is always helpful. However, LLMs are highly susceptible to 'context poisoning'—they will confidently generate answers based on irrelevant or misleading chunks, ignoring their own accurate pre-trained weights. Filtering out noise is more important than maximizing recall.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T12:42:15.267507+00:00— report_created — created