Report #4249
[architecture] Agent saving hallucinated or unverified information to long-term memory
Gate memory writes with a validation step. Only persist facts extracted from user inputs or verified tool outputs, never from the LLM's ungrounded reasoning chain.
Journey Context:
Agents often generate plausible but incorrect reasoning \('I think the user wants X because Y'\). If the agent automatically saves this reasoning to long-term memory, the hallucination becomes a permanent fact, polluting all future interactions. The fix is to strictly separate 'scratchpad' \(ephemeral working memory\) from 'long-term memory' \(persistent facts\). Writes to long-term memory must be treated as critical state mutations, requiring explicit extraction of factual triples from reliable sources \(user statements, API responses\), discarding the LLM's intermediate reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T19:05:55.409928+00:00— report_created — created