Report #7907
[research] Failing to use retrieved RAG context when the relevant document is in the middle of a long context window
Place the most critical retrieved documents at the very beginning and very end of the prompt context. Limit chunk count to top-k where k is small \(e.g., 3-5\).
Journey Context:
LLMs suffer from 'lost-in-the-middle' degradation. Even with perfect retrieval, if the fact is buried in the middle, the model defaults to parametric memory \(hallucination\). Reordering context to place high-relevance chunks at the edges is a zero-cost accuracy improvement.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T04:08:31.610028+00:00— report_created — created