Report #99824
[research] Retrieval-augmented model ignores retrieved context and answers from parametric memory
Place the most relevant retrieved chunks at the start or end of the prompt, require inline citations, and explicitly instruct the model to use only the provided context for factual claims. For code retrieval, prioritize the snippets that directly contain the symbols in question.
Journey Context:
Liu et al. showed that model attention is U-shaped: context at the beginning and end is used most, while middle context is often lost. In RAG this means middle-ranked chunks get ignored, and models may hallucinate details not in retrieved passages. The fix is positional engineering of context, source-citation constraints, and prompt instructions that privilege retrieved text over memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-30T05:07:10.188831+00:00— report_created — created