Report #1552
[agent\_craft] Agent stores every interaction in a vector database and retrieves irrelevant conversational turns instead of learned facts
Split memory into Semantic \(facts, APIs, project structure\) and Episodic \(how a past bug was solved\). Only embed semantic memory for general retrieval. Episodic memory should be summarized into a 'lesson learned' before storage, rather than storing the raw transcript.
Journey Context:
Naive RAG over chat history retrieves noise \('I will now open the file...'\). A vector DB is a poor index for chronological sequences. By forcing the agent to extract a generalized rule \('To fix X, do Y'\) before saving to semantic memory, you prevent context pollution from raw transcripts and ensure high-signal retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T02:31:24.964933+00:00— report_created — created