Report #38473
[research] LLM ignores relevant retrieved documents placed in the middle of the prompt context, hallucinating an answer based on parametric memory
Place the most relevant retrieved documents at the very beginning and very end of the context window. Do not rely on linear document insertion.
Journey Context:
It is commonly assumed that RAG solves hallucination by providing context. However, LLMs exhibit a U-shaped attention curve. If a crucial fact is buried in the middle of a 10k-token context, the model will skip it and hallucinate based on its pre-trained weights, leading to high-confidence errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:03:16.433359+00:00— report_created — created