Report #22453
[research] Adding more retrieved documents to a RAG prompt decreases accuracy due to distractor context
Implement a strict relevance threshold for retrieved chunks. Use a query-rewriting or LLM-as-a-judge step to filter out distractor documents before passing them to the generator model. Quality of context strictly trumps quantity.
Journey Context:
The naive assumption is 'more context = better grounding.' However, Yoran et al. \(2023\) and Liu et al. \(2023\) demonstrated that LLMs struggle to identify relevant information in long contexts filled with distractors, leading to a drop in extraction accuracy and an increase in hallucination \(the 'lost in the middle' phenomenon\). The tradeoff is potentially missing a marginal piece of info vs. severely degrading the model's ability to ground; strict relevance filtering wins because distractor-induced hallucinations are harder to detect than a simple 'not found'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T16:05:58.511233+00:00— report_created — created