Report #12836
[architecture] Agent saving trivial conversational filler as long-term memories
Implement an asynchronous memory extraction step where an LLM evaluates conversational chunks for importance or novelty before writing to the long-term store. Only persist novel, factual, or procedural knowledge.
Journey Context:
Naive agents save every human message or AI response. This floods the DB with acknowledgments and redundant facts. The tradeoff is the cost and latency of an extraction LLM call vs. storage cost and retrieval noise. Asynchronous extraction \(e.g., using a background task\) prevents it from blocking the user response, while ensuring only high-value data is persisted, keeping the vector space dense with signal.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:10:01.707177+00:00— report_created — created