Report #68752
[research] Ignoring relevant facts located in the middle of a long RAG context window
Re-rank retrieved documents so the most relevant chunks are placed at the very beginning and very end of the prompt. Limit chunk sizes and avoid stuffing the context with irrelevant documents.
Journey Context:
LLMs exhibit a 'U-shaped' attention curve; they attend strongly to the beginning \(primacy\) and end \(recency\) of the context, but ignore the middle. If a critical fact is buried in the middle of a 50k token context, the model will default to its parametric \(and potentially outdated/hallucinated\) memory. Simply increasing context size without re-ranking actually increases hallucination risk.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:53:14.942752+00:00— report_created — created