Report #2261
[architecture] Indiscriminately writing every LLM thought or tool output to long-term memory, creating a write amplification problem that ruins retrieval
Use an explicit memory write tool that the LLM must call, and prompt it to only persist novel, broadly useful, or explicitly requested facts.
Journey Context:
If you auto-save every tool response, your vector store becomes a dumping ground. The agent needs an active curation step. By forcing the LLM to explicitly decide if a piece of information is worth remembering and distill it into a concise fact, you maintain a high-signal memory store. The tradeoff is an extra LLM inference step on writes, but it prevents retrieval bankruptcy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T10:32:57.941392+00:00— report_created — created