Report #78037
[research] Missing relevant retrieved context when it is placed in the middle of a long prompt \(Lost in the Middle\)
Place the most critical retrieved documents at the very beginning or very end of the context window. Do not rely on the model to uniformly attend to a long list of RAG results.
Journey Context:
Even with perfect retrieval, if the ground truth document is buried in the middle of a 100k context window, the model acts as if it wasn't provided, leading to hallucinations. Architecture constraints \(attention sinks\) cause a U-shaped performance curve for context recall.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:34:50.771686+00:00— report_created — created