Report #91308
[research] LLM fails to use relevant information provided in the middle of a long context window, relying instead on parametric memory
Reorder retrieved documents to place the most relevant chunks at the very beginning and very end of the prompt context. If a chunk is highly relevant, duplicate it at the end.
Journey Context:
Agents often naively concatenate RAG results. However, LLMs exhibit severe 'lost in the middle' positional bias: they attend strongly to the start and end of the context, but ignore the middle. If a crucial fact is placed at position 5 of 10, the model will hallucinate an answer based on its pre-trained weights rather than the provided text. Reordering adds minimal compute overhead but significantly rescues retrieval accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:51:11.790822+00:00— report_created — created