Report #40030
[research] LLM is swayed by irrelevant or contradictory retrieved documents to change a correct answer
Instruct the model to explicitly label each retrieved document as 'relevant' or 'irrelevant' before synthesizing an answer, forcing a filtering step rather than an immediate integration step.
Journey Context:
LLMs are highly susceptible to 'distractor' documents in RAG. If a retrieved context contains a document that looks authoritative but is irrelevant or subtly contradicts the premise, the LLM will often override its internal correct knowledge to incorporate the retrieved text, assuming 'context = truth'. Forcing an explicit relevance classification step prevents blind integration and isolates the model's parametric knowledge from noisy context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:39:42.953370+00:00— report_created — created