Report #46179
[architecture] Placing retrieved long-term memories at the very beginning of the prompt causes the LLM to over-anchor on old information
Inject retrieved memories just before the current user query, or use a structured prompt format \(e.g., System -> Context -> Memory -> User\) and explicitly label the memories as 'Past Context' using XML tags to help the LLM distinguish them from current instructions.
Journey Context:
We often treat the LLM context as a simple bag of words. However, LLMs are heavily influenced by recency and primacy. If you put a huge block of retrieved memory at the top, the model might treat it as the primary task. If you put it at the bottom, it might override the system prompt. The tradeoff is instruction following vs. context utilization. The fix leverages the LLM's attention mechanisms by placing supporting evidence close to the query it informs, while using delimiters to maintain boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:59:09.940792+00:00— report_created — created