Report #37621
[research] LLM ignores retrieved documents placed in the middle of the prompt context
Place the most critical retrieved chunks at the very beginning and very end of the context window. If doing multi-document QA, duplicate the query-relevant snippets at both extremes.
Journey Context:
Agents often concatenate retrieved documents sequentially. However, LLMs exhibit a strong U-shaped attention curve; they attend heavily to the start and end of the context, dropping attention in the middle. If a crucial fact is buried in chunk 5 of 10, the model will likely fall back to parametric memory \(hallucinate\) rather than use the document.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T17:37:42.960059+00:00— report_created — created