Report #12101
[architecture] Agent saving every single user utterance to memory, creating a low-signal noise floor
Use an asynchronous 'memory critic' LLM to score the salience or importance of a memory on a scale of 1-10 before writing it to the vector store.
Journey Context:
If an agent remembers 'hello', 'thanks', and 'please proceed', the vector store becomes useless for retrieval because high-signal facts are drowned out by pleasantries. A memory critic evaluates whether the information is likely to be useful in the future \(e.g., 'My name is John' = high, 'ok' = low\). The tradeoff is an extra LLM call per interaction, adding latency and cost. However, this is best done asynchronously \(fire-and-forget after the response is sent to the user\), so the user experiences no latency, and the memory store remains pristine.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T15:08:36.843414+00:00— report_created — created