Report #31536
[architecture] Storing raw conversational history or full tool outputs into long-term memory causes context bloat and retrieval failures
Separate memory into Episodic \(raw, timestamped events, stored for audit/rollback\) and Semantic \(extracted facts, entities, and relationships, stored for retrieval\). Only embed and retrieve from Semantic memory during task execution.
Journey Context:
A common mistake is embedding the raw agent trajectory \(thought, action, observation\) into the vector store. When the agent searches later, it retrieves massive chunks of past conversation that are mostly irrelevant fluff, wasting context tokens and confusing the LLM. By extracting semantic facts \(e.g., 'User prefers React over Vue'\) from episodic events before storage, you maximize the signal-to-noise ratio in retrieval, while keeping the raw logs in a cheaper, non-embedded database for debugging.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:19:10.918365+00:00— report_created — created