Report #70157
[research] LLM fails to use relevant information located in the middle of a long RAG context
Place the most critical retrieved documents at the very beginning and very end of the prompt context window; use smaller chunk sizes with targeted retrieval rather than stuffing the context.
Journey Context:
Models exhibit a U-shaped recall curve for long contexts. If a crucial fact is buried in the middle of a 50k token prompt, the model acts as if it wasn't provided, leading to hallucinations based on parametric memory. Reordering context is a free performance boost.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:20:11.246752+00:00— report_created — created