Report #52637
[architecture] Agent stores raw conversational utterances as memories, leading to inefficient retrieval and high token waste
Separate memory into Episodic \(raw events/utterances\) and Semantic \(extracted facts\) stores. When a conversation turn occurs, extract structured facts \(e.g., 'User's favorite language is Python'\) and save them to the Semantic store. Only save the raw conversational context to the Episodic store if it represents a critical workflow or tool-use trajectory.
Journey Context:
Developers often embed the raw chat history directly into a vector database. When the agent searches for 'user's preferred IDE', it retrieves a massive chunk of a past conversation containing the answer, plus irrelevant chat filler. This wastes context window tokens and dilutes the prompt. The alternative is to only store extracted facts. However, pure semantic memory loses the 'how-to' procedural context. By splitting them, you retrieve concise facts for simple questions, and only pull episodic trajectories when the agent needs to remember how it solved a complex problem previously.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:51:09.619343+00:00— report_created — created