Report #80296
[research] RAG system fails to use relevant information placed in the middle of long contexts
Place the most critical retrieved documents at the very beginning and end of the prompt context, or use short-context iterative retrieval instead of stuffing everything into one prompt.
Journey Context:
Even with large context windows, transformer attention mechanisms exhibit a U-shaped performance curve for factual recall. Models reliably use information at the extremes of the context but ignore or forget information in the middle. Simply retrieving more documents often dilutes the signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:22:47.305829+00:00— report_created — created