Report #9186
[architecture] Agent saves every single tool output and observation to long-term memory
Gate memory writes with an 'importance' or 'surprise' scoring step. Before writing to the vector store, ask a smaller/cheaper LLM to rate the observation's novelty and long-term relevance on a scale of 1-10. Only persist memories scoring above a threshold.
Journey Context:
Agents that automatically dump every tool response \(e.g., standard ls output, boilerplate API responses\) into a vector store quickly fill it with noise, degrading retrieval precision \(the 'needle in a haystack of needles' problem\). The tradeoff is the cost/latency of the scoring step vs. the cost/latency of searching a bloated vector store. Importance gating ensures the memory store remains high-signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T07:35:52.118790+00:00— report_created — created