Report #55550
[architecture] Agent retrieves multiple documents and pastes them into the middle of the prompt, but ignores the most relevant ones.
Re-order retrieved context so the most relevant chunks are at the very beginning and very end of the injected context block. Discard low-relevance chunks rather than padding the context.
Journey Context:
LLMs exhibit U-shaped attention. They attend heavily to the beginning and end of the context window, ignoring the middle. If you just concatenate RAG results sequentially, crucial info in the middle is effectively invisible to the model's reasoning process.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T23:44:13.746245+00:00— report_created — created