Report #30037
[research] Model fails to retrieve or use facts located in the middle of a long context window, hallucinating an answer instead
Place the most critical grounding information at the very beginning or very end of the prompt context; do not rely on the model to reliably extract facts from the middle of a 50k\+ token context.
Journey Context:
LLMs exhibit a strong 'U-shaped' attention curve. They attend heavily to the beginning \(primacy\) and end \(recency\) of the context, but suffer severe performance degradation for information in the middle. If a RAG system places a crucial document chunk in the middle of the context, the model may ignore it and hallucinate based on its parametric memory. Reordering retrieved chunks to put the most relevant at the edges mitigates this.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:48:13.093070+00:00— report_created — created