Report #3810
[research] LLM failing to retrieve facts located in the middle of a long context window
Structure retrieved documents to place the most critical information at the very beginning and the end of the prompt. For comprehensive extraction, use sliding windows or map-reduce summarization rather than dumping all text into a single massive prompt.
Journey Context:
Models exhibit a U-shaped attention curve; they attend strongly to the start \(primacy\) and end \(recency\) of the context, but miss middle chunks. Naively stuffing a 100k token context assumes uniform attention, which is empirically false. Reordering adds preprocessing complexity but is essential for high recall in long-context RAG.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T18:15:04.375204+00:00— report_created — created