Report #40933
[research] LLM ignores retrieved documents placed in the middle of the prompt context, leading to ungrounded answers
Place the most critical retrieved evidence at the very beginning and very end of the context window. Do not rely on linear document ordering.
Journey Context:
Agents often concatenate top-k retrieved chunks sequentially. However, transformer attention patterns exhibit a strong positional bias \(U-shaped curve\) where start and end tokens receive disproportionately higher attention. If the gold answer is in chunk 5 of 10, the model may fall back on parametric memory, causing hallucination despite the context being technically present.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:10:34.669961+00:00— report_created — created