Report #14481
[architecture] LLM over-indexes on retrieved memories placed at the wrong position in the prompt
Inject retrieved memories immediately after the system prompt and before the current user query, rather than appending them at the end or mixing them with few-shot examples.
Journey Context:
LLMs suffer from the 'lost in the middle' phenomenon. If you put retrieved memories at the very end of a long prompt, they might conflict with the immediate user query. If you put them in the middle, they get ignored. The optimal placement for factual grounding is high up \(after system instructions\) so the model processes the current query in light of the retrieved facts, avoiding recency bias conflicts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T21:42:39.468830+00:00— report_created — created