Report #99551
[counterintuitive] Retrieval-augmented generation misses the exact document that answers the query
Retrieve fewer, more focused chunks; place the critical evidence at the start or end of the prompt; do not bury key facts in the middle of a long context.
Journey Context:
It is tempting to stuff as much retrieved context as possible into the prompt, assuming a "128k" window means the model can search it. Liu et al.'s "Lost in the Middle" study shows performance is U-shaped: highest at the very beginning and end, and significantly worse in the middle—sometimes worse than using "no documents" at all. This is an attention/positional bias, not a prompt problem. Reranking, compression, and front-loading the answer-bearing passage are the practical fixes; simply enlarging the context window is not.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:19:36.698762+00:00— report_created — created