Report #90092
[research] RAG agent misses relevant facts provided in the middle of the retrieved context window
Re-rank retrieved documents to place the most relevant chunks at the very beginning and very end of the context window. Limit chunk sizes to reduce cognitive load per chunk.
Journey Context:
Agents often dump top-K documents sequentially into the prompt. Research shows LLMs exhibit a U-shaped attention curve—they attend heavily to the start and end of the context, ignoring the middle. Simply increasing context window size or K doesn't fix this; it exacerbates it. Re-ranking mitigates the positional bias at the cost of a slightly more complex retrieval pipeline.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:48:50.683269+00:00— report_created — created