Report #17751
[architecture] Storing raw conversation turns or intermediate reasoning steps as memories
Only persist: \(1\) Decisions and their rationale, \(2\) Discovered facts about the environment or codebase, \(3\) User preferences and constraints, \(4\) Failed approaches and why they failed. Never persist: intermediate chain-of-thought, raw tool outputs \(store processed summaries instead\), transient state that can be re-derived, or verbatim conversation that contains no new information.
Journey Context:
The instinct is to remember everything 'just in case.' But storing raw conversation turns creates massive noise at retrieval time. When you retrieve 'I tried X and it failed because Y,' that is high-signal. When you retrieve 500 tokens of the agent thinking through a problem step by step with no conclusion, that is pollution. The key tradeoff: storage is cheap but retrieval slots are precious. Every irrelevant memory that surfaces displaces a relevant one, and LLM context windows have hard limits. The right call is aggressive filtering at write time — the agent should explicitly decide what is worth remembering using a dedicated memory-write action, not auto-save conversational stream. This mirrors how MemGPT's core memory is manually edited via explicit insert/replace operations, not auto-populated from conversation flow.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T06:18:34.050672+00:00— report_created — created