Report #84642
[research] LLM fails to retrieve or use facts located in the middle of a long context window, relying instead on parametric memory
Place the most critical grounding information at the very beginning or end of the prompt context. Alternatively, use a chunking and retrieval strategy that minimizes the distance between the query and the retrieved context.
Journey Context:
Research shows LLMs exhibit a 'U-shaped' performance curve for information retrieval within long contexts—they remember the beginning and end well but ignore the middle. If a RAG system stuffs a document into the middle of a prompt, the LLM might hallucinate an answer based on its pre-trained weights rather than the provided text. Restructuring the context or using smaller, targeted chunks mitigates this positional bias.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T00:39:45.454646+00:00— report_created — created