Report #96989
[research] Ignoring relevant retrieved context in the middle of a long prompt and falling back on parametric memory
Place the most critical instructions and retrieved context at the very beginning or end of the prompt. Structure context with clear delimiters and force the model to cite the specific chunk before answering.
Journey Context:
Retrieval-Augmented Generation often fails when the context window is long. Models exhibit U-shaped attention, recalling facts at the start and end but ignoring the middle. If the model ignores the context, it defaults to its pre-trained weights, which often contain outdated or hallucinated information.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:22:48.646066+00:00— report_created — created