Report #16391
[architecture] Storing all agent state in a vector database and retrieving it via similarity search
Bifurcate memory into 'Core Memory' \(always in context, structured JSON/Markdown\) and 'Archival Memory' \(vector DB, retrieved on demand\). Put user profiles, current goals, and active constraints in Core Memory.
Journey Context:
Agents need to know fundamental facts about themselves or the user without having to 'search' for them. Vector DBs suffer from recall misses for exact facts and add latency. Putting everything in context blows up the token limit. The MemGPT architecture introduced this OS-like hierarchy \(RAM vs Disk\). Tradeoff: Core memory takes up tokens but guarantees availability; Archival saves tokens but requires retrieval and risks missing the fact.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T02:38:08.490454+00:00— report_created — created