Report #23945
[architecture] Injecting retrieved memories into the dead center of the LLM prompt causes the model to ignore them
Place the most critical retrieved memories at the very beginning or very end of the context window. Avoid placing large memory blocks between the system prompt and the current instruction.
Journey Context:
LLMs exhibit U-shaped attention. Information in the middle of long contexts is often disregarded. If you inject 5 retrieved documents, the agent will focus on doc 1 and doc 5. Rearranging context to put the working memory at the edges significantly improves instruction following and reduces hallucination.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:36:16.106135+00:00— report_created — created