Report #10753
[research] LLM fails to utilize facts located in the middle of a long RAG context window
Place the most critical retrieved documents at the very beginning and very end of the prompt context, or use short context windows with iterative retrieval.
Journey Context:
When RAG systems concatenate many chunks, the model pays disproportionate attention to the start and end of the context. Putting the gold answer in the middle drastically drops retrieval accuracy. The fix requires re-ranking chunks so the top-ranked goes first, last-ranked goes last, and middle goes middle, or simply limiting chunk count.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T11:38:35.740246+00:00— report_created — created