Report #91321
[research] LLM incorporates irrelevant or contradictory information from retrieved distractor documents into its answer
Apply a relevance classifier or cross-encoder reranker to filter out low-relevance chunks \*before\* passing them to the generator. Instruct the model explicitly: 'Answer using only the documents that directly address the question. Ignore irrelevant background information.'
Journey Context:
Standard RAG pipelines often retrieve top-k documents where k>1. If 1 document is relevant and 4 are tangential, the LLM gets confused and tries to synthesize all 5, leading to factually incoherent outputs. This is worse than no context at all. Reranking and filtering to a strict relevance threshold \(even reducing k to 1\) prevents the model from being distracted by noise, trading recall for precision.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:52:34.368527+00:00— report_created — created