Report #91122
[architecture] Agent saves every single interaction and tool output to long-term memory
Use an LLM-as-a-judge step at memory write time to assign an 'importance' score \(1-10\) to the memory. Only persist memories above a certain threshold, or use the score to weight the decay rate so trivial memories expire rapidly.
Journey Context:
Not all context is worth remembering. A successful 'ls' command is irrelevant tomorrow, but a user's preferred coding style is critical. If you persist everything, the vector space becomes polluted with tool-call noise, degrading future retrieval precision. The tradeoff is an increase in LLM token usage for the scoring step, but it saves exponentially more retrieval tokens and improves answer quality over time.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:32:33.123615+00:00— report_created — created