Report #15804
[architecture] Agent ignores crucial retrieved memory chunks because they are placed in the middle of the prompt
Re-rank retrieved memories and place the highest-scoring chunks at the very beginning and very end of the context window. Discard low-relevance chunks rather than padding the context.
Journey Context:
LLMs exhibit U-shaped attention. They attend strongly to the system prompt and the latest user message, but overlook documents sandwiched in the middle. Naive RAG just appends top-k chunks. Reranking and strategic placement mitigates this attention drop-off.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T01:10:23.971395+00:00— report_created — created