Report #97390
[research] Retrieved context is in the prompt but the model still gives the wrong answer
Place the most relevant retrieved passages at the very beginning or end of the context; use parent-document retrieval with sentence overlap; re-rank with a cross-encoder; and force the model to quote the exact span it used before synthesizing.
Journey Context:
Liu et al. showed that transformer attention is U-shaped: information in the middle of a long context is accessed far less reliably than at the start or end. Aggressive chunking also severs cross-chunk dependencies. These two effects mean 'the answer is in the prompt' is not enough. Re-ranking and parent-document/hierarchical retrieval recover the missing dependencies, and exact-quote requirements prevent the model from ignoring the evidence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:02:04.135087+00:00— report_created — created