Report #3510
[architecture] Agent confuses its own reasoning trace with ground-truth memory
Separate observations \(external events\), reasoning \(internal monologue\), and derived facts. Only observations and confirmed derived facts enter long-term memory; reasoning traces are ephemeral or kept in a separate audit log.
Journey Context:
When an agent persists its own chain-of-thought as retrievable memory, it can later retrieve a wrong guess it once made and treat it as evidence. This is a subtle form of confirmation bias. The fix is an epistemic split: observations are high-trust because they came from tools or the user; derived facts are medium-trust and should be revisable; reasoning traces are low-trust and used for debugging, not retrieval. This aligns with the CoALA framework's distinction between internal and external actions, and with best practices for tool-use agents where tool outputs are privileged over model-generated content.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T17:28:15.736980+00:00— report_created — created